Skip to content
  • Products

    Products

    Astro - Online Category Tracking

    Online market share and category insights for consumer brands and manufacturers

    Tradewinds

    Cube's strategic e-commerce market data offering for investors, internet platforms, and corporates.
  • Technology

    Technology

    Why Cube?

    We achieve best-in-class data accuracy and granularity through statistical and machine learning models

    Data Collection

    Multiple data sources ensure accurate, high-quality insights by triangulating proprietary and external data.

    Data Security

    At Cube, safeguarding data integrity and confidentiality is our foremost priority.

    Product Tagging

    We have the most comprehensive classification system in the industry across platforms and countries
  • Insights

    Insights

    Cube Pulse - Articles

    Stay updated with our latest research, insights, and perspectives on hot topics in e-commerce

    Research reports

    Free to download research reports to help you understand the e-commerce landscape better

    E-commerce Glossary

    We've put all e-commerce terminology in one place so you can find the answers you are searching for

    E-commerce Platform Take-rate tracker

    This tracker provides up-to-date insights into current e-commerce platform take-rates by country and product category, offering a clear view of the marketplace dynamics.

    E-commerce Category Tree

    Cube's coverage follows a unified, standardized category tree across countries and platforms.

    OSCX index

    A benchmark of e-commerce seller sentiment across Southeast Asia and other key Shopee markets
  • Community

    Community

    Shopper Panel

    Join our Online Shopper Panel and answer paid surveys about your habits and opinions about e-commerce

    Seller Panel

    Join the Cube Data Partnership to help us build better insights for the ecosystem
  • Company

    Company

    About us

    Learn more about our mission, credentials, and how we work.

    Our Team

    We recruit great people and help them become better. Collaborate, Innovate, and Lead with Data at Cube

    Careers

    If you love working with data, hold a high bar for quality, and enjoy working with super-smart and collaborative people, Cube is the perfect place for you.

    Contact us

    Send us a question, ask to be contacted, or book a meeting with our expert team today

    Cube in the news

    Cube is frequently quoted in news stories about e-commerce. See our latest coverage here

    Security

    Explore how Cube protects data through enterprise-grade security, strict controls, and recognized compliance standards.

    Trust Center

    Highlights of our high-level risk mitigation strategy, industry best practices, and commitment to continuous improvement.
Contact us
  • Solution
    • E-commerce market data & insights
    • Tradewinds – Strategic Market Data for Southeast Asia E-Commerce
  • Technology
    • Why Cube Asia?
    • Data Collection
    • Data Security
    • Product Tagging
  • Insights
    • Cube Pulse – Articles
    • Research Reports
    • E-Commerce Glossary
    • Shopee Take-Rate Tracker
    • E-commerce Category Tree
    • OSCX Index
  • Community
    • Shopper Panel
    • Seller Panel
  • Company
    • About us
    • Our Team
    • Career
    • Cube in the news
  • Contact us

AI reality check: Can ChatGPT and Google Bard produce reliable e-commerce insights?

  • By Simon Torring
  • July 17, 2023
Link

 

Context: Breakthrough capabilities, but risks of misinformation

So much has been written about ChatGPT since its launch that most of us have probably wondered how AI will enhance or challenge our jobs. A recent BBC story carried this ominous quote – “Workers that don’t work with AI are going to find their skills [become] obsolete … it’s imperative to work with AI to stay employed.” Amidst all these “change or you shall be replaced” warnings, it’s easy to empathize for Steven Schwartz, a New York lawyer who relied on ChatGPT for research, only to realize later that “six of the submitted cases appear to be bogus judicial decisions with bogus quotes and bogus internal citations.”

AI systems such as ChatGPT and Google Bard no doubt possess remarkable capabilities in terms of speed, information processing, natural language understanding, and responsive communication. However, it is less clear just how good they are for research. In this post, we delve into this specific question, and try to use data to evaluate their performance within the specific context of the Southeast Asian e-commerce landscape.

 

Beyond the Surface: Analyzing ChatGPT and Bard’s ecommerce knowledge 

Every day, our analysts research news websites, company reports, and government publications for any new announcements or information relating to e-commerce in Asia. 

To gauge the performance of ChatGPT and Bard in research, we conducted an assessment using 50 data points related to the Southeast Asian e-commerce market, including the gross merchandise value (GMV) of different countries and categories. To ensure fairness, all the data points queried were for the year 2020, considering ChatGPT’s knowledge limitation up until September 2021.

 

The results were tagged in three simple buckets:

  • Green – reliability; AI was able to generate answers and point to real sources where the data existed
  • Yellow – hallucinations*; AI either quoted a wrong data point or invented a source that doesn’t exist
  • Gray – honesty; AI acknowledged that it did not know the answer

* Hallucinations refer to the creation of nonexistent sources and information, where the AI fabricates facts and details instead of admitting its lack of knowledge. Within our research we saw 2 types of hallucinations i) source provided but data not found ii) fabricated data point with no source provided.

The ideal color mix would have been green and gray – for there can only be 2 possibilities, either something is available, or it isn’t. And yet, as highlighted in the chart above, both models provided many manufactured answers. 

 

Truths vs Fiction: Insights reveal reliability gaps and hallucination galore

What we observed:

  • The findings revealed that both ChatGPT and Bard had accuracy rates below 20%, indicating significant room for improvement. The results did however improve between May and June, suggesting the teams behind the models are hard at work trying to improve them over time. 
  • ChatGPT’s honesty showed improvement as it refused to provide answers in 50% of the cases, up from 32% in May. This suggests that the company may be taking steps to reduce hallucinations. However, there was a noticeable negative trend in reliability, with the share of reliable (green) answers dropping from 14% to 4%.
  • As for Bard we observed that the AI never indicated that it doesn’t know an answer, hallucinating >80% of the time. Furthermore, in 50% of cases, Bard reported data without any source. Bard’s warning about its accuracy upon sign-up is spot on – the high volume of fabrication undermines its suitability as a research tool for now.

 

We’ve highlighted one particular egregious example of hallucinations below. We asked Google Bard to look through YouTube transcripts to see if there is any data available about Lazada, and it referred to a video interview with the Lazada Philippines CEO that it claimed was uploaded on 28 May 2023, had over 1,000 views, and at least 3 comments.

 

Remarkably however, all 3 of these data points, which could all be easily verified, were incorrect – the interview had happened a full year ago (on 04 April 2022), has fewer than 448 views, and only 1 comment!

 

Decoding the trends: Our theory of factors behind AI performance challenges

In our quest to understand the factors contributing to the observed results, we identified three potential reasons:

  • Generative Nature of Language Models: ChatGPT and Bard, being language models, are primarily trained as a generative tool and do not possess the ability to differentiate between fact and fiction. Simon Willison, a software developer, explains that large language models rely on statistical probability from their training data to merely predict the next word, which can lead to confabulation. Benj Edwards’ article on why AI models hallucinate is also a good read.  
  • Overfitting: Overfitting is a common issue in machine learning, where a model becomes excessively tailored to the training data, making it difficult to generalize to new or unseen data. This phenomenon could contribute to the inconsistency and lack of accuracy observed in AI systems.
  • Lack of Contextual Understanding: AI systems currently struggle with contextual understanding, which encompasses elements like common sense, nuanced details, emotions, social dynamics, and human behavior. These limitations hinder their ability to accurately interpret and predict outcomes, particularly in areas that require a comprehensive understanding. Ted Chiang’s New Yorker piece gives a great insight into the limitations of AI.

 

Final word: Generative AI tools can’t replace human researchers just yet

Our research indicates that popular generative AI models suffer from extensive hallucinations, providing confident yet fabricated information. This makes them hard to rely on and forces analysts to exercise plenty of caution and skepticism when using them for e-commerce research.

Although generative AI continues to evolve rapidly, it is not yet a substitute for human analysts and researchers. Just how much longer would that continue to be the case? On that question our guess is probably as good as yours. 

Related Articles

Shopee

January 13, 2026

LLMs at Cube: Google Gemini leads as of end-2025, but the race is not yet over

TikTok Shop

February 24, 2025

The best is yet to come: TikTok Shop’s pivotal 2024, and what’s in store for 2025
Top three mobile commerce that is popular in Malaysian market

In the news, Lazada, Shopee, TikTok Shop

February 5, 2025

Go big or go home: Signs of consolidation in Southeast Asia e-commerce

Become part of our e-commerce community

Join 2,000+ other leaders and experts to stay informed about the latest news, insights, and updates

  • info@cube.asia
  • Cube

Products

  • Astro - Online Category Tracking
  • Tradewinds

Community

  • Shopper Panel
  • Seller Panel

Technology

  • Why Cube?
  • Data Collection
  • Data Security
  • Product Tagging

Insights

  • Cube Pulse - Articles
  • Research reports
  • E-Commerce Glossary
  • Take-rate tracker
  • E-commerce Category Tree
  • OSCX Index

Company

  • About us
  • Our Team
  • Careers
  • Contact us
  • Cube in the news
  • Security
  • Trust Center

Copyright 2025 Cube. All Rights Reserved.

Your message has been successfully sent

We appreciate that you’ve taken the time to write us. We’ll get back to you very soon. Please come back and see us often.