Close Menu
TechCentralTechCentral

    Subscribe to the newsletter

    Get the best South African technology news and analysis delivered to your e-mail inbox every morning.

    Facebook X (Twitter) YouTube LinkedIn
    WhatsApp Facebook X (Twitter) LinkedIn YouTube
    TechCentralTechCentral
    • News
      Big Microsoft 365 price increases coming next year

      Big Microsoft price increases coming next year

      5 December 2025
      Vodacom to take control of Safaricom in R36-billion deal - Shameel Joosub

      Vodacom to take control of Safaricom in R36-billion deal

      4 December 2025
      Black Friday goes digital in South Africa as online spending surges to record high

      Black Friday goes digital in South Africa as online spending surges to record high

      4 December 2025
      BYD takes direct aim at Toyota with launch of sub-R500 000 Sealion 5 PHEV

      BYD takes direct aim at Toyota with launch of sub-R500 000 Sealion 5 PHEV

      4 December 2025
      'Get it now': Takealot in new instant deliveries pilot

      ‘Get it now’: Takealot in new instant deliveries pilot

      4 December 2025
    • World
      Amazon and Google launch multi-cloud service for faster connectivity

      Amazon and Google launch multi-cloud service for faster connectivity

      1 December 2025
      Google makes final court plea to stop US breakup

      Google makes final court plea to stop US breakup

      21 November 2025
      Bezos unveils monster rocket: New Glenn 9x4 set to dwarf Saturn V

      Bezos unveils monster rocket: New Glenn 9×4 set to dwarf Saturn V

      21 November 2025
      Tech shares turbocharged by Nvidia's stellar earnings

      Tech shares turbocharged by stellar Nvidia earnings

      20 November 2025
      Config file blamed for Cloudflare meltdown that disrupted the web

      Config file blamed for Cloudflare meltdown that disrupted the web

      19 November 2025
    • In-depth
      Jensen Huang Nvidia

      So, will China really win the AI race?

      14 November 2025
      Valve's Linux console takes aim at Microsoft's gaming empire

      Valve’s Linux console takes aim at Microsoft’s gaming empire

      13 November 2025
      iOCO's extraordinary comeback plan - Rhys Summerton

      iOCO’s extraordinary comeback plan

      28 October 2025
      Why smart glasses keep failing - no, it's not the tech - Mark Zuckerberg

      Why smart glasses keep failing – it’s not the tech

      19 October 2025
      BYD to blanket South Africa with megawatt-scale EV charging network - Stella Li

      BYD to blanket South Africa with megawatt-scale EV charging network

      16 October 2025
    • TCS
      TCS+ | How Cloud on Demand helps partners thrive in the AWS ecosystem - Odwa Ndyaluvane and Xenia Rhode

      TCS+ | How Cloud On Demand helps partners thrive in the AWS ecosystem

      4 December 2025
      TCS | MTN Group CEO Ralph Mupita on competition, AI and the future of mobile

      TCS | Ralph Mupita on competition, AI and the future of mobile

      28 November 2025
      TCS | Dominic Cull on fixing South Africa's ICT policy bottlenecks

      TCS | Dominic Cull on fixing South Africa’s ICT policy bottlenecks

      21 November 2025
      TCS | BMW CEO Peter van Binsbergen on the future of South Africa's automotive industry

      TCS | BMW CEO Peter van Binsbergen on the future of South Africa’s automotive industry

      6 November 2025
      TCS | Why Altron is building an AI factory - Bongani Andy Mabaso

      TCS | Why Altron is building an AI factory in Johannesburg

      28 October 2025
    • Opinion
      Your data, your hardware: the DIY AI revolution is coming - Duncan McLeod

      Your data, your hardware: the DIY AI revolution is coming

      20 November 2025
      Zero Carbon Charge founder Joubert Roux

      The energy revolution South Africa can’t afford to miss

      20 November 2025
      It's time for a new approach to government IT spend in South Africa - Richard Firth

      It’s time for a new approach to government IT spend in South Africa

      19 November 2025
      How South Africa's broken Rica system fuels murder and mayhem - Farhad Khan

      How South Africa’s broken Rica system fuels murder and mayhem

      10 November 2025
      South Africa's AI data centre boom risks overloading a fragile grid - Paul Colmer

      South Africa’s AI data centre boom risks overloading a fragile grid

      30 October 2025
    • Company Hubs
      • Africa Data Centres
      • AfriGIS
      • Altron Digital Business
      • Altron Document Solutions
      • Altron Group
      • Arctic Wolf
      • AvertITD
      • Braintree
      • CallMiner
      • CambriLearn
      • CYBER1 Solutions
      • Digicloud Africa
      • Digimune
      • Domains.co.za
      • ESET
      • Euphoria Telecom
      • Incredible Business
      • iONLINE
      • IQbusiness
      • Iris Network Systems
      • LSD Open
      • NEC XON
      • Netstar
      • Network Platforms
      • Next DLP
      • Ovations
      • Paracon
      • Paratus
      • Q-KON
      • SevenC
      • SkyWire
      • Solid8 Technologies
      • Telit Cinterion
      • Tenable
      • Vertiv
      • Videri Digital
      • Vodacom Business
      • Wipro
      • Workday
      • XLink
    • Sections
      • AI and machine learning
      • Banking
      • Broadcasting and Media
      • Cloud services
      • Contact centres and CX
      • Cryptocurrencies
      • Education and skills
      • Electronics and hardware
      • Energy and sustainability
      • Enterprise software
      • Financial services
      • Information security
      • Internet and connectivity
      • Internet of Things
      • Investment
      • IT services
      • Lifestyle
      • Motoring
      • Public sector
      • Retail and e-commerce
      • Satellite communications
      • Science
      • SMEs and start-ups
      • Social media
      • Talent and leadership
      • Telecoms
    • Events
    • Advertise
    TechCentralTechCentral
    Home » Start-ups » How SA start-up wants to ‘reinvent search’

    How SA start-up wants to ‘reinvent search’

    By Editor2 February 2012
    Twitter LinkedIn Facebook WhatsApp Email Telegram Copy Link
    News Alerts
    WhatsApp
    Carl and Sally Greyling

    In an unassuming house on a golf estate in Centurion, south of Pretoria, nearly 30 Dell desktop computers run 24 hours a day in a makeshift server room. The machines are crawling Web feeds of breaking news along with all of the text of the US Library of Congress.

    This is the home of technology start-up Gatfol (pronounced gat-fole, not like the Afrikaans gatvol), which hopes to make search as intuitive as speech.

    Run on a shoestring budget by husband-and-wife team Carl and Sally Greyling, Gatfol has gradually taken shape over the past nine years. They hope to launch their first product in a fortnight — an applet that will work with Internet browsers and turn any search bar into a powerful tool that can handle far longer and more complex queries than currently possible.

    Inspired by artificial intelligence (AI) projects such as CYC and Princeton’s WordNet, the Greylings are attempting to make it possible for people to talk to machines, not in code or carefully constructed search terms, but using ordinary language. Carl Greyling says he likes to think of the project as “language talking to data”.

    Greyling started his career as an auditor and continues to do accounting work to make ends meet. He studied psychology and neurology at Unisa with the original aim of creating artificial intelligence software robots, or “bots”, for the likes of online community Second Life.

    “I started switching to data and search about a year ago,” he says.

    Greyling says it’s “hugely difficult” to use everyday language for search because it’s hard to carry every permutation of spoken language in a search engine.

    He looked at current and past semantic intelligence systems and found that for the most part their structures are either hierarchical or based on Boolean logic. “It’s been that way for the past 40 years, ever since early AI.”

    He says the problem that crops up in every AI system is the sheer number of possible permutations. Greyling says that if one takes a sentence like “I like to work on my computer in the morning” as an example, and considers possible replacements for the pronouns or nouns or verbs that would still make the sentence grammatically correct, the result is in the region of 10 to the power of eight possible permutations.

    “Simply describing the contents of a room could generate billions of permutations,” he says. The problem is, in order to remain workable, search engines require an engine that is small enough to process requests quickly.

    “You need an engine that’s tiny; literally a few megabytes,” Greyling says. He began looking at three-dimensional matrices and other mathematical systems to solve the problem and eventually stumbled onto the 100-year-old work of Russian mathematician Andrey Markov.

    “Markov’s chain analysis looks at strings or words in pairs — the words on either side of one another — and from this you can build up larger strings.”

    On the back of Markov’s approach, Greyling developed a tiny engine based on a two-dimensional matrix, “where every single concept in the matrix links to every other”. This, he says, gives the engine the power to consider trillions of permutations while remaining lightweight.

    He says the approach isn’t entirely new but that, where it has been employed, people have still used a word’s placement in a text to determine equivalences for it. He says this is ineffective because it cannot contend with factors like context.

    Conversation, says Greyling, is notoriously difficult. He uses the example of the sentence “my mother is in hospital with cancer” and says a standard response would be something like “I’m sorry to hear that”, which is based on an understanding of the words “mother”, “hospital”, and “cancer”. But were one to substitute “mother” for “Obama” that response wouldn’t be the same.

    He says in order to deal with plain, conversational language, engines must be able to deal with “multi-word equivalence”. Greyling believes he’s made a breakthrough in this regard and has a provisional patent in the US for his approach.

    The approach looks at a phrase from multiple viewpoints. He likens it to looking at an image of a landscape as a whole while looking at each tree or cloud or other element in close focus simultaneously. This approach forms the basis of the applet he hopes will be available within weeks.

    Gatfol operates with little to no funding. The backbone of the operation consists of 30 second-hand Dell PCs purchased from HSBC when the bank was going through an upgrade cycle. Running in parallel, the machines process about 2TB of Web data a month. Connectivity comes in the form of a leased line from MWeb.

    Russian mathematician Andrey Markov

    Greyling says he can’t keep the PCs’ cathode-ray tube monitors on all the time because not only do they use a large amount of electricity but they generate an enormous amount of heat. He monitors the temperature in the server room using digital and mercury thermometers and uses one wall mounted air-conditioning unit, a freestanding unit and table fans to keep temperatures down.

    Most nights Greyling sleeps in the makeshift server room because he has become so used to the sounds and temperatures that even with the screens off he can tell if a machine has crashed or entered into an infinite loop.

    A power outage of even a few seconds means Greyling has to spend a full working day getting all of the machines up and running again because he has to find the operation with which they were busy when the electricity went out.

    Though he has a small generator, he says it’s only sufficient to power three or four machines. Neverthless, he’s glad for the equipment he has because it allows him to keep working on a project to which he’s dedicated many years of his life.

    Each machine looks at a different element of the data, while a master machine looks at all of them simultaneously. “We take every matrix we have and give it a different focus on the data that comes in. That’s what the brain does; it deals with the general picture and detail at the same time.”

    At one stage, Greyling ran 55 chat bots worldwide. From that experience, he got his inspiration for a project that could deal with “human language intelligence”.

    He says that, even after 40 years of people working on the problem, the best example we have is Apple’s Siri voice recognition software. “Siri is good but it’s still struggling with two-word combinations. A phrase like ‘alcohol poisoning’ still gives you liquor stores nearby.”

    Gatfol works by breaking a long phrase into roughly a thousand packets, each of which is run through Gatfol’s matrices to produce semantically equivalent phrases – that is, phrases with the same meaning. These are then ranked and the best of them used for the search query.

    “To search in parallel is not so much of a problem,” says Greyling. “[Relational database management software] SQL can do it, so volume-wise there’s not such an enormous load on databases, but it produces far more relevant results.”

    Greyling says that, for example, were one to search a dating site for someone with a “vibrant, outgoing personality”, his engine would return results that mentioned “joie de vivre” and “exuberance” because, although these aren’t the same words, they’re “semantically similar”.

    He says this notion of semantic similarity is of particular interest to institutions like US security agencies that trawl the Web looking for blog posts or tweets or anything else that might prove useful in preemptively detecting an assassination or terrorist attack.

    “Say someone wants to plant a bomb, you won’t get obvious words, perhaps, but you might find ‘this is my final act’ or ‘solving the problem’ or something similar.”

    Eventually, Greyling hopes the approach can also be used for the problem of image recognition, which, he says, is still predominantly tag based. “Look at a window — it could be a painting with a border, or a mirror, or a window, or any number of other things.”

    He says the search engine could be used for all manner of queries and may also be useful in instances where people are looking for something that has subsequently come to be called something else, like when someone searches for an antique car part on eBay, the online auctions site.

    “In the case of something like Google, the data is already there, but we have such primitive tools to get it out.”

    Though the applet will be free, Greyling plans to create a selection of application programming interfaces for business and hopes to monetise those. He also hopes to move the crawling work to the cloud when funding allows for it.

    According to Greyling, the biggest challenge Gatfol has faced is funding. He says it’s almost impossible to sell an idea and that investors want a “proof of concept”.

    It’s his hope that the applet, when it is released, will encourage investors that have expressed some interest to look at the project more seriously and pique others’ curiosity.  — Craig Wilson, TechCentral

    • Subscribe to our free daily newsletter
    • Follow us on Twitter or on Google+ or on Facebook
    • Visit our sister website, SportsCentral (still in beta)


    Carl Greyling Gatfol Google MWeb Sally Greyling
    Subscribe to TechCentral Subscribe to TechCentral
    Share. Facebook Twitter LinkedIn WhatsApp Telegram Email Copy Link
    Previous ArticleHow Zuck wrested control of Facebook from shareholders
    Next Article The $100bn question

    Related Posts

    What South Africans searched for most in 2025

    What South Africans searched for most in 2025, according to Google

    4 December 2025
    Amazon and Google launch multi-cloud service for faster connectivity

    Amazon and Google launch multi-cloud service for faster connectivity

    1 December 2025
    Alphabet races toward $4-trillion valuation - Google

    Alphabet races towards $4-trillion valuation

    25 November 2025
    Company News
    AI is not a technology problem - iqbusiness

    AI is not a technology problem – iqbusiness

    5 December 2025
    Telcos are sitting on a data gold mine - but few know what do with it - Phillip du Plessis

    Telcos are sitting on a data gold mine – but few know what do with it

    4 December 2025
    Unlock smarter computing with your surface Copilot+ PC

    Unlock smarter computing with your Surface Copilot+ PC

    4 December 2025
    Opinion
    Your data, your hardware: the DIY AI revolution is coming - Duncan McLeod

    Your data, your hardware: the DIY AI revolution is coming

    20 November 2025
    Zero Carbon Charge founder Joubert Roux

    The energy revolution South Africa can’t afford to miss

    20 November 2025
    It's time for a new approach to government IT spend in South Africa - Richard Firth

    It’s time for a new approach to government IT spend in South Africa

    19 November 2025

    Subscribe to Updates

    Get the best South African technology news and analysis delivered to your e-mail inbox every morning.

    Latest Posts
    Big Microsoft 365 price increases coming next year

    Big Microsoft price increases coming next year

    5 December 2025
    AI is not a technology problem - iqbusiness

    AI is not a technology problem – iqbusiness

    5 December 2025
    Vodacom to take control of Safaricom in R36-billion deal - Shameel Joosub

    Vodacom to take control of Safaricom in R36-billion deal

    4 December 2025
    Black Friday goes digital in South Africa as online spending surges to record high

    Black Friday goes digital in South Africa as online spending surges to record high

    4 December 2025
    © 2009 - 2025 NewsCentral Media
    • Cookie policy (ZA)
    • TechCentral – privacy and Popia

    Type above and press Enter to search. Press Esc to cancel.

    Manage consent

    TechCentral uses cookies to enhance its offerings. Consenting to these technologies allows us to serve you better. Not consenting or withdrawing consent may adversely affect certain features and functions of the website.

    Functional Always active
    The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
    Preferences
    The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
    Statistics
    The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
    Marketing
    The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
    • Manage options
    • Manage services
    • Manage {vendor_count} vendors
    • Read more about these purposes
    View preferences
    • {title}
    • {title}
    • {title}