Safely manage your Zendesk from the AI assistant you already use, via the Deltastring MCP. Beacon configuration platform
← Back to news

Two-Thirds of Malaysia’s AI Customer Service Chatbots Fail to Understand How We Talk

Entermind's Enterprise Chatbot Quality Index evaluated 24 Malaysian chatbots across e-commerce, fintech, travel, telecom and financial services, revealing that two-thirds fail basic comprehension tests—specifically the ability to understand colloquial language, handle topic shifts, and operate multilingually. The assessment tested each chatbot against 26 standardised criteria grouped into five categories: Comprehension, Access, Experience, Functional Capability, and Safeguards. Only eight chatbots passed both language and slang tests, with Touch 'n Go (82.3%) and Boost (74%) leading the e-commerce segment, whilst the financial services sector averaged just 38.3%, where only Ryt Bank demonstrated genuine conversational capability. The pattern is stark: 78% of chatbots passing at least four of seven Comprehension tests scored above 60% overall, whilst all six triage-layer bots passed Access but failed Comprehension entirely. This reveals comprehension as the sharpest differentiator between functional and dysfunctional implementations.

The implications for CX teams are immediate and troubling. When chatbots cannot understand natural language or handle context shifts, users are forced into rigid menu pathways, forced repetition, and escalation loops—precisely the friction that drives customers toward human agents. The whitepaper documents this directly: users report having to re-explain entire issues after escalation, and evaluators consistently noted chatbots functioning as "glorified search bars" rather than problem-solvers. For teams already managing Zendesk, Freshdesk or Salesforce Service Cloud implementations, this raises a critical question: if your organisation has deployed a chatbot that fails comprehension tests, are you actually reducing agent workload or simply creating a frustration layer that increases escalations and repeat contacts? The data suggests the latter is far more common.

The deeper concern is architectural. None of the 24 chatbots demonstrated cross-session memory—meaning customers are treated as strangers on every interaction, forcing re-explanation of context and history. This is a solvable problem, as evidenced by Ryt Bank's 80,000+ monthly transactions processed through natural language with a sub-1.5% hallucination rate, yet it remains unaddressed across the market. For support leaders evaluating AI investments, the question becomes whether your vendor roadmap includes genuine memory and context retention, or whether you're purchasing a system that will compound your escalation burden. The Malaysian market's failure rate suggests that scale and brand recognition alone do not guarantee chatbot quality—only deliberate investment in language understanding, error handling, and persistent context does.