Avsnitt

  • How Machine Translation Makes Localization Easy on Frazier

    In addition to audio description, VIDEO TO VOICE is actively involved in other fields surrounding accessibility for digital media. To break down language barriers, Frazier includes a neural machine translation service. With lower costs and fewer delays, this option is a popular choice for clients wanting to create video content in multiple languages.

    Jokes about online translation tools are so last decade. When the first real-time machine translation services emerged in the early 2000s, they were ridiculed for producing questionable results. At best amusing, and at worst wildly offensive, translations created this way would often be too literal and stray away from the intended meaning of the original text.

    But as technology has improved in recent years, there have been noticeably fewer “translation fail” memes doing the rounds on social media. The latest developments in machine translation involve neural networks. These are subsets of machine learning that try to mimic how the human brain works during the translation process.

    Intrigued? Read on to find out everything you need to know about the language service industry, where providers are having trouble, and how machine translation makes localization easy.

    Why is translation important?

    It's simple – we all want access to content and information that is naturally communicated in our own language.

    From a business standpoint, effective communication helps companies improve their brand image, boost customer satisfaction, and increase productivity of staff.

    As well as keeping existing customers happy, translated content can help companies expand into other markets. This is backed up by findings in the 2020 Common Sense Advisory Report “Can't Read, Won't Buy”:

    Weiterlesen

  • In addition to providing technological solutions for audio description, VIDEO TO VOICE is actively involved in other fields surrounding accessibility for digital media. For some time, VIDEO TO VOICE has been working in close partnership with the Fraunhofer Institute for Integrated Circuits (IIS) on topics such as automated mixing and MPEG-H Audio. The following article provides detailed analysis of the tools that make up MPEG-H Audio, a technology developed by Fraunhofer IIS to improve accessibility and deliver the best sound experience possible.

    This is the second part of our series exploring the object-based approach to audio production and Fraunhofer IIS's involvement in developing MPEG-H Audio system. Part one focused on the fundamentals in traditional channel-based audio, the accessibility problems this approach causes, and the solutions that object-based audio can provide.

    With the basics covered, it's now time to take a closer look at the ground-breaking technology behind MPEG-H Audio and the various tools content creators can take advantage of. Let's start with the MPEG-H Authoring Suite...

    What does the MPEG-H Authoring Suite involve?

    Authoring in this sense is the step where metadata is created to enable playback devices to deliver on the three key principles of interactivity, immersive sound, and universal delivery.

    Metadata is information that describes an audio object's existence, position, and function.

    The MPEG-H Authoring Suite (MAS) is the place where content creators can create metadata, such as setting up presets, enabling interactivity options for viewers, and defining positions and properties of audio objects.

    On the MAS, content creators can also monitor how their mix will be rendered in different layouts and make any necessary changes.

    Weiterlesen

  • Saknas det avsnitt?

    Klicka här för att uppdatera flödet manuellt.

  • Dialog+: The Game Changer for Easier-to-Follow TV Dialogue

    In addition to providing technological solutions for audio description, VIDEO TO VOICE is actively involved in other fields surrounding accessibility for digital media. For some time, VIDEO TO VOICE has been working in close partnership with the Fraunhofer Institute for Integrated Circuits (IIS) on topics such as automated mixing and MPEG-H Audio. The following article provides a fascinating glimpse at Dialog+, a production technology developed by Fraunhofer IIS to make TV dialogue easier to understand.

    It's a frustration many of us have experienced while watching TV: We can't quite hear what the actor is mumbling during an important scene, so lose track of the plot for the rest of the show. Likewise, when the music, sound effects or background noises are too loud, the dialogue becomes difficult to follow, prompting us to turn up the volume to an uncomfortable level or switch over.

    To combat this issue, the Fraunhofer Institute for Integrated Circuits (IIS) teamed up with public broadcasters in developing Dialog+ – an innovative AI-based solution for making TV dialogue easier to understand. In essence, Dialog+ provides viewers with an enhanced version of the dialogue which they can select as an alternative to the original audio mix.

    In late 2020, German television networks Westdeutscher Rundfunk (WDR) and Bayerischer Rundfunk (BR) conducted viewer testing to see how Dialog+ would be received. But before we examine the results, let's take a closer look at why TV dialogue is difficult to follow in the first place.

    Weiterlesen

  • TTS AD is the abbreviation for text-to-speech audio description. Now you may have heard text to speech and audio description as separate entities, but TTS AD is a relatively new concept that is starting to turn heads in the world of accessible media. Speed is the name of the game with TTS AD, as media professionals seek cost and time-effective solutions for making their programming accessible to low-vision audiences.

    So let's take a closer look at what TTS AD is, how it is made, and what role it will play in the future of accessible media.

    TTS AD: A Viable Solution for Audio Describing Content?First, can you quickly explain to me what text to speech is?

    Text to speech (TTS) converts written text into a synthesised spoken voice.

    It is sometimes referred to as "read aloud" technology.

    TTS is an increasingly popular accessibility feature on computers and phones for reading out digital text.

    You can find TTS in many places today such as internet browsers, smart assistants, e-book readers, and word processors.

    And what is audio description?

    Audio description for film and television is a form of narration that uses verbal descriptions to provide information on visual aspects of a media production.

    In other words, a pre-recorded voice over track describes what is happening in a video, TV show or film.

    The audio description should be neatly intertwined with the production's original dialogue and soundtrack.

    So TTS AD is a combination of the two?

    That's right.

    TTS AD is audio description that is read aloud using synthetic voices through text to speech.

    Who is the audio description intended for?

    Weiterlesen

  • Audio description software is increasing in popularity

    With tight deadlines and limited budgets to worry about, audio describers and production companies are turning to technological solutions to lower costs and save time. But what's involved in audio description software, and what features are included to help users?

    We'll be using our own tools as an example to show you the ins and outs of audio description software.

    Remind me, what is audio description?

    The audio description is a narration spoken during quieter moments of a video.

    The audio description provides additional information that would otherwise not be clear from the dialogue.

    This often includes descriptions of the location, characters' facial expressions or the type of scene.

    Here's an example of an audio description made with our software:

    Who needs audio description?

    The audio description is primarily intended for people with vision loss.

    Around 250 million people worldwide have a visual impairment, with sight issues becoming more prevalent as we age.

    Despite its importance, only a small percentage of productions are provided with audio description.

    Who makes the audio description?

    Put simply, audio description production is usually broken down into the following stages:

    script writingvoice recordingaudio post-production

    Different professionals are normally required for each step, but some audio describers can take on all three tasks if they have a home studio.

    Got it. So why do we need audio description software?

    Audio description production is typically a complicated process.

    Audio description scriptwriters need to deal with fiddly timecoding and export issues, which slows down progress.

    Clients usually ask for corrections to the script, so there's also a lot of back and forth on the phone or by email.

    Weiterlesen

  • Hashtags are a great way to expand the reach of your company or brand. By adding hashtags to a post, you're making your content discoverable and giving other social media users the chance to engage with you on a common interest or theme.

    However, many people and businesses are forgetting one important thing when writing their hashtags. It's time to learn about the wonders of camel case...

    First of all, what is camel case?

    In camel case, the first letter of each word is capitalized when writing compound words or phrases; the first word isn't uppercase.

    You may also see this practice stylized as camelCase. Here are a few common examples:

    eBay eLearning iPhone macOS vCard

    When you also capitalize the first word, it's known as upper camel case or pascal case, as seen here:

    BlackBerry HarperCollins LaserJet LinkedIn YouTube So why the cute name?

    This way of writing is called camel case because the capital letters are taller than the lowercase ones.

    The rise in the middle of the compound resembles a hump on the back of a camel.

    What has this got to do with hashtags?

    On social media, longer hashtags using multiple words have become increasingly popular.

    In most instances, people write out the entire hashtagged phrase in lowercase like this:

    #videotovoicerocks

    However, this way of writing hashtags is problematic for many social media users.

    What sorts of problems do lowercase hashtags create?

    Lowercase multiple-word hashtags are an issue for screen readers.

    A screen reader is a form of assistive technology that reads out text and image content as speech or renders it as braille.

    Weiterlesen

  • Learn how to write alt text and image descriptions for blind users

    Alt text and image descriptions are vital for web accessibility. Ignoring these features makes visual content inaccessible to website visitors with low vision. As most people will click away from a site they cannot use, it's time for businesses, bloggers, and webmasters to get smart with alt text and image descriptions.

    Before we get started, what does web accessibility mean?

    It's a good question, and you can check out our web accessibility guide for a detailed answer.

    In short, web accessibility means designing a website in a way that doesn't prevent anyone from being able to use it.

    Therefore, people with disabilities or impairments shouldn't have any trouble on your site.

    Where does writing alt text and image descriptions come into it?

    Alt text and image descriptions remove hurdles that stop people from accessing your content.

    What is alt text?

    Alt text is the abbreviation for “alternative text”.

    Alt text briefly describes what is in an image, such as text and the most important details.

    You'll sometimes see the alt text when an image fails to load.

    In HTML view, you can provide alternative text for your pictures by adding alt="describe your image" within the image tag.

    In many content management systems, you can add alt text by simply clicking on the image and entering a short explanation.

    Most social media sites, such as Twitter, have a feature for adding alt text.

    What is an image description?

    An image description is more detailed than alt text, so that someone can find out more about what is happening in the image.

    Weiterlesen

  • It's a difficult game of catch-up to expand the presence of audio description online.

    While audio description is becoming increasingly embraced in the world of TV, film, and video on demand services, there remains a severe lack of availability online. As inaction continues, this black hole in the online media universe continues to grow wider. The blind and visually impaired aren't the only ones missing out on the opportunities audio description can provide, either…

    April 17, 2015 was a big day for audio description, when Marvel's Daredevil became the first show on Netflix to include an audio description track. This meant low-vision viewers could select an option to have a narrator describe everything on screen that they would otherwise miss. Featuring blind superhero Matt Murdock as the spin-off's main protagonist, it was an apt choice for the visually impaired community.

    Since then, the video on demand provider has been steadily increasing the number of productions that feature audio description. According to the American Council of the Blind (ACB), Netflix has 1,272 film and TV titles with audio description on its platform in the United States, accounting for around a quarter of its total content. There's still a lot of work to do, but the trend is positive.


    Staying state side, the situation is even rosier for blind movie-goers. Since 2018, federal regulations require cinemas to have the necessary equipment to provide an audio description in the English language. In most cases, visually impaired people access the narration track through special headphones. As such, nearly all first-run movies are now accessible to the blind.

    Weiterlesen

  • After a long day, parking ourselves in front of a screen is often the best cure to deal with life’s stresses. Whether it’s flicking through channels, catching up with our favourite YouTubers, or binging on the latest Netflix series, we are spoilt for choice when it comes to what we can watch. It is now second nature to load up movies, sporting events, news, and television shows at the touch of a button. Access to audiovisual media has become our basic right.

    It would be quite a shock, then, if our viewing options were abruptly slashed to a comparatively minute selection. In the digital age, there would be justifiable cause for outrage. And yet, lack of access to media is the grim reality facing around 30 million blind and partially sighted people living in Europe today.

    The low-vision population enjoys watching television and going to the movies, just like anyone else. However, people who fall into this category also need to have a complete picture of the on-screen action, which is why productions require an audio description.

    What is audio description?

    In short, an audio description is a spoken narrative detailing everything that is happening on screen. During gaps in dialogue, a voice artist describes the setting as well as actors’ movements, gestures, and facial expressions so that blind and partially sighted viewers can stay on top of the action.

    The concept of audio description was first developed by American professor Gregory Frazier in 1974. Since then, the technology has been limited to movie and TV production due to the laborious production process, which is usually based on the following workflow:

    Weiterlesen