Close Menu
TechCentralTechCentral

    Subscribe to the newsletter

    Get the best South African technology news and analysis delivered to your e-mail inbox every morning.

    Facebook X (Twitter) YouTube LinkedIn
    WhatsApp Facebook X (Twitter) LinkedIn YouTube
    TechCentralTechCentral
    • News

      War of words erupts over home affairs database fee hike

      24 June 2025

      Don’t expect Starlink in South Africa anytime soon

      24 June 2025

      Finally! Tribunal unpacks why it blocked Vodacom’s Vumatel deal

      24 June 2025

      Samsung to unveil new folding phones at July event

      24 June 2025

      Capital Appreciation banks on payments to offset software slump

      24 June 2025
    • World

      Mira Murati’s Thinking Machines hits $10-billion valuation

      24 June 2025

      Watch | Starship rocket explodes in setback to Musk’s Mars mission

      19 June 2025

      Trump Mobile dials into politics, profit and patriarchy

      17 June 2025

      Samsung plots health data hub to link users and doctors in real time

      17 June 2025

      Beijing’s chip champions blacklisted by Taiwan

      16 June 2025
    • In-depth

      Meta bets $72-billion on AI – and investors love it

      17 June 2025

      MultiChoice may unbundle SuperSport from DStv

      12 June 2025

      Grok promised bias-free chat. Then came the edits

      2 June 2025

      Digital fortress: We go inside JB5, Teraco’s giant new AI-ready data centre

      30 May 2025

      Sam Altman and Jony Ive’s big bet to out-Apple Apple

      22 May 2025
    • TCS

      TechCentral Nexus S0E3: Behind Takealot’s revenue surge

      23 June 2025

      TCS | South Africa’s Sociable wants to make social media social again

      23 June 2025

      TCS+ | AfriGIS’s Helen Hulett on how tech can help resolve South Africa’s water crisis

      18 June 2025

      TechCentral Nexus S0E2: South Africa’s digital battlefield

      16 June 2025

      TechCentral Nexus S0E1: Starlink, BEE and a new leader at Vodacom

      8 June 2025
    • Opinion

      South Africa pioneered drone laws a decade ago – now it must catch up

      17 June 2025

      AI and the future of ICT distribution

      16 June 2025

      Singapore soared – why can’t we? Lessons South Africa refuses to learn

      13 June 2025

      South Africa risks being left behind as stablecoins reshape global finance

      6 June 2025

      Beyond the box: why IT distribution depends on real partnerships

      2 June 2025
    • Company Hubs
      • Africa Data Centres
      • AfriGIS
      • Altron Digital Business
      • Altron Document Solutions
      • Altron Group
      • Arctic Wolf
      • AvertITD
      • Braintree
      • CallMiner
      • CambriLearn
      • CYBER1 Solutions
      • Digicloud Africa
      • Digimune
      • Domains.co.za
      • ESET
      • Euphoria Telecom
      • Incredible Business
      • iONLINE
      • Iris Network Systems
      • LSD Open
      • NEC XON
      • Network Platforms
      • Next DLP
      • Ovations
      • Paracon
      • Paratus
      • Q-KON
      • SevenC
      • SkyWire
      • Solid8 Technologies
      • Telit Cinterion
      • Tenable
      • Vertiv
      • Videri Digital
      • Wipro
      • Workday
    • Sections
      • AI and machine learning
      • Banking
      • Broadcasting and Media
      • Cloud services
      • Contact centres and CX
      • Cryptocurrencies
      • Education and skills
      • Electronics and hardware
      • Energy and sustainability
      • Enterprise software
      • Fintech
      • Information security
      • Internet and connectivity
      • Internet of Things
      • Investment
      • IT services
      • Lifestyle
      • Motoring
      • Public sector
      • Retail and e-commerce
      • Science
      • SMEs and start-ups
      • Social media
      • Talent and leadership
      • Telecoms
    • Events
    • Advertise
    TechCentralTechCentral
    Home » In-depth » Google’s ‘Babel fish’ heralds future of translation

    Google’s ‘Babel fish’ heralds future of translation

    By Editor9 January 2012
    Twitter LinkedIn Facebook WhatsApp Email Telegram Copy Link
    News Alerts
    WhatsApp
    Ashish Venugopal

    In Douglas Adams’s famous Hitchhiker’s Guide to the Galaxy series of science-fiction books, interstellar species use Babel fish — “small, yellow, leech-like” creatures that feed on “brain-wave energy” — to translate speech in real time.

    A team of developers at Google is working on the real thing, using statistical models to translate different languages, including Afrikaans, on the Web and on mobile phones, using voice input and output as well as text.

    TechCentral sat down with Google Translate research scientist Ashish Venugopal at Google’s headquarters in Silicon Valley last week and asked him about the stumbling blocks to effective real-time translation and the future of the technology. This is an edited transcript of that interview.

    TechCentral: How many languages does Google Translate now support?

    Ashish Venugopal: There are 63 languages supported. That’s a lot of languages. How do we get all that data in there? If we tried manually to give the system those languages, it would be a hopeless task. The only possible way we could do this is to harness the power of machine computation. We build statistical models that are automatically training themselves and learning all the time. As people translate new content on the Web, our systems pick this up and it adds the words. The system is constantly reading and analysing the Web. It’s a statistical approach. The idea is that once we learn the essential model of how to speak a word, and we can apply that to every word. We haven’t memorised every word.

    Are there any difficult languages that make it hard to get translation right?

    Yes, there are some incredibly tough languages. If your language is very different from English, for example, then it will be very difficult to translate it to English. We use English as an intermediate language and so if you were translating from Russian to Japanese, we’d translate the Russian to English and then to Japanese.

    When we talk about a “tough” language, it’s one that is really different compared to English. There are languages that are very different in multiple dimensions.

    The first question to ask is, what is the order of words from in one language compared to English. In English, we’d put the subject first, then the verb and then the object, whereas the Japanese have the subject first, then the object and finally the verb. We have to teach computers how to recognise this reordering pattern.

    We don’t tell the computer how to translate every sentence. We give it general patterns to look for. When it sees new data, it uses those patterns, matches that to data and then comes up with a model that it uses to translate sentences.

    When we say languages are harder, they’re harder because of the ordering of words, they’re harder because there may be different notions of what a word even is. In English, you say you put the phone on the table — “phone” and “table” are objects and “on” is an additional word that explains what’s happening. In other languages, the “on” could be glued onto the word “phone” or “table” and we have to teach the computer that “on” could be connected to the object or be separate from it.

    All these issues get easier when there’s more source data. We launch languages when we feel they are adding value to somebody. We have “alpha” or experimental languages where we were just able to launch the system, as opposed to it being fluent and correct. The alpha languages tend to have less source data available online.

    What are the main stumbling blocks to this technology and what will be possible in the future?

    We are really reliant on the source data. The first stumbling block for a new language is, is there data on the Web? Once there’s enough content on the Web and as we build our system … on average it works really well. On average, you’ll be very impressed with it. But every once in a while you’ll be irritated with it.

    Because of the statistical approach, you may enter something and get some crazy translation. What we are trying to do is limit those crazy translations and ensure in all cases we are providing a reasonable translation.

    This really comes from the fact that this is a statistical system. We’ve built it so you can literally put anything into it. We will translate anything you give us. It might be good or it might be bad, but on average it will be quite impressive.

    What we are really working on now is clipping the bottom end of the cases where we make mistakes. We see these issues in languages that are very different compared to English. Russian, for example, adds a lot of information to words and they get longer and longer and when we translate we sometimes make mistakes there.

    In the future, in a reasonably short time, we will take machine translation for granted, as part of our everyday lives. I mean that from an 80-20 standpoint, where 80% of the use cases we’ll be able to address effectively. The last 20% will be incredibly hard. That speaks to the fact that machine translation won’t be a substitute for a human translator.

    No one is going to take an important political speech and put it into machine translation to publish it in 20 different languages. Our goal is not to create artificial intelligence; our goal is to provide an 80% solution where you’ll be able to understand the political speech’s point, but not it’s rhetoric, not it’s beauty necessarily.

    Is the future of this technology instant voice translation using devices like mobile phones to facilitate real-time translation of conversations?

    We can do that already, but not simultaneously. It’s not an immediate goal. It’s a matter of where we are focusing now. There’s still more work to be done on the quality side before we can start to develop this continuous form of operation.

    Will you continue to do translation in the cloud (online on servers) or will it move down to devices like phones as they get more powerful?

    We make all our decisions purely based on quality. We want to ensure the highest quality translations are delivered to our users in the shortest possible time and that’s leaning towards the cloud for now, but that might change.

    What sort of computing power does Google Translate require?

    We use the full power of Google’s search engine. The reason Google Translate exists is because of the investments made in search. We sit on top that search infrastructure.

    Do you have a team of linguists working all over the world?

    We have a team of statisticians, all working right over there [points and laughs]. It’s less linguistically orientated. There are linguistic ideas that influence our decisions. To give you an example, when I was working on the last set of Indian languages that were launched, I didn’t use any linguistic knowledge; I used Wikipedia and my grandmother. So, it’s Wikipedia, my grandmother and statistics. That’s what we use to put a language together.  — Duncan McLeod, TechCentral

    • Subscribe to our free daily newsletter
    • Follow us on Twitter or on Google+ or on Facebook
    • Visit our sister website, SportsCentral (still in beta)


    Ashish Venugopal Google Google Translate
    Subscribe to TechCentral Subscribe to TechCentral
    Share. Facebook Twitter LinkedIn WhatsApp Telegram Email Copy Link
    Previous ArticleToshiba Portégé R830 review: not quite the Air
    Next Article New sub-sea cables to drive down broadband prices

    Related Posts

    Siemens is battling Big Tech for AI supremacy in factories

    24 June 2025

    Apple shifts its AI strategy

    23 June 2025

    Stolen phone? Samsung now buys you an hour to lock it down

    18 June 2025
    Company News

    Communication costs exploding? Telviva has a fix for UK-SA teams

    24 June 2025

    Section 18A deductions and BEE points – a strategic choice for business compliance in 2025

    24 June 2025

    Huawei Watch Fit 4 Series: beauty, brains and a battery that won’t quit

    24 June 2025
    Opinion

    South Africa pioneered drone laws a decade ago – now it must catch up

    17 June 2025

    AI and the future of ICT distribution

    16 June 2025

    Singapore soared – why can’t we? Lessons South Africa refuses to learn

    13 June 2025

    Subscribe to Updates

    Get the best South African technology news and analysis delivered to your e-mail inbox every morning.

    © 2009 - 2025 NewsCentral Media

    Type above and press Enter to search. Press Esc to cancel.