Vectara Launches Factual Consistency Score Powered by Upgraded Hughes Hallucination Evaluation Model to Enhance Transparency in GenAI Responses
26.3.2024 16:00:00 EET | Business Wire | Press release
Vectara, the trusted Generative AI product platform, announced the inclusion of a Factual Consistency Score (FCS) for all generative responses based on an evolved version of the groundbreaking Hughes Hallucination Evaluation Model (HHEM)—the #1 hallucination detection model on Hugging Face with 100,000+ downloads since its launch last November. The associated Hallucination Leaderboard is now the industry standard for how LLMs benchmark their average factual consistency. Vectara’s end-to-end Retrieval Augmented Generation-as-a-service (RAGaaS) platform is setting a new standard as an industry-first feature for GenAI response transparency by providing real-time end-to-end RAG observability. This innovative metric provides unprecedented visibility into the factual consistency of summarized responses within Vectara’s RAGaaS platform, empowering users to set personalized thresholds for response acceptance based on a detailed accuracy score.
With average hallucination rates of LLMs on the market ranging from 3% to 16.2%, the risk of unknown inaccuracies in their response remains a major concern, preventing widespread business adoption of this powerful technology. Vectara mitigates this ambiguity for enterprises by providing a Factual Consistency Score grading the likelihood that the generated response is a hallucination or not. Only with a standardized, scientifically calculated method for grading responses can businesses responsibly introduce GenAI into business critical applications. Users have the ability to set thresholds for response acceptance based on a detailed accuracy score, giving product teams the flexibility to act on this information according to their preferences.
Vectara’s Factual Consistency Score is a groundbreaking tool in GenAI, setting a new benchmark for real-time hallucination detection and offering superior performance, affordability, and speed, thus marking a significant leap forward in trust. Its efficiency and effectiveness enable businesses to deploy GenAI into critical product use cases without being worried about exposure to liabilities that might arise from hallucinated responses.
Vectara's Factual Consistency Score equips developers with the capability to refine and enhance a wide range of applications, from internal Q&A systems to the quality of interactions with end consumers. The strength of this score lies in its calibration, making it interpretable as a direct probability—for instance, a score of 0.98 indicates a 98% probability of factual consistency. This contrasts sharply with many contemporary ML classifiers that disregard calibration, thus sacrificing clarity and direct interpretability.
"Integrating Vectara's Factual Consistency Score into the Yobi app will revolutionize how we handle AI transparency and accuracy for business use cases. By providing visibility and accountability into answers provided by our platform, we can stay true to our commitment to responsible AI that enterprises can depend on,” said Ahmed Reza, Founder and CEO of the Yobi app. “As a Co-Innovate Partner with Vectara, we're thrilled to see such advanced technology directly incorporated into the Vectara platform.”
The advanced HHEM that powers the Factual Consistency Score gives greater visibility than previously released open-sourced versions, offering enhanced accuracy and extended language support. This initiative is part of Vectara's commitment to transparency and control, empowering businesses with the autonomy to manage AI responses effectively.
"Just as we were early in pioneering RAG to enhance the relevance and quality of generated content, we are once again at the forefront of responsible AI by being completely open about our efforts to mitigate hallucinations in Generative AI," said Amr Awadallah, co-founder and CEO of Vectara. "By providing our customers with real-time access to factual consistency scores, we're not just engineering trust; we're handing over the control, enabling them to make informed decisions on how to utilize the responses generated by our RAGaaS platform."
About Vectara
Vectara is an end-to-end platform for embedding powerful generative AI features into applications with extraordinary results. As an end-to-end Retrieval Augmented Generation (RAG) platform, Vectara delivers the shortest path to a correct answer/action through a safe, secure, and trusted entry point. Vectara never trains on your data, allowing businesses to embed generative AI capabilities without the risk of data or privacy violations. To learn more, visit vectara.com.
To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.
View source version on businesswire.com: https://www.businesswire.com/news/home/20240326712242/en/
Contact information
Carly Bourne
carly@bulleitgroup.com
423-443-0449
About Business Wire
For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
OPEX® Corporation Introduces the Velo™ Series of Premium Desktop and High Production Document Scanners10.2.2026 11:00:00 EET | Press release
OPEX® Corporation, a global leader in Next Generation Automation providing innovative solutions for warehouse, document and mail automation, has announced the launch of its Velo™ Series powered by InoTec, a new class of premium desktop and free standing high production scanners. The OPEX Velo scanners are engineered to deliver exceptional performance, reliability and image quality and offer industry-standard TWAIN/ISIS connectivity to help simplify deployment into existing capture environments. These state-of-the-art scanners are ideal for service bureaus, government agencies, healthcare providers and enterprise capture operations. “The Velo Series powered by InoTec introduces an entirely new class of scanners to the OPEX portfolio, expanding the options available to both our current customers and organizations considering OPEX for the first time,” said Dann Worrell, President, Document and Mail Automation, OPEX. “By broadening our offerings, we can better align the right solution with
New ZOLL Zenix Monitor/Defibrillator Receives MDR Approval10.2.2026 11:00:00 EET | Press release
ZOLL®, an Asahi Kasei company that manufactures medical devices and related software solutions, announced today that Zenix®, the company’s most clinically advanced and easy-to-use professional monitor/defibrillator, is approved under the European Union (EU) Medical Device Regulation 2017/745, commonly referred to as EU MDR. The Zenix monitor/defibrillator is a groundbreaking device that redefines efficiency, clarity, and intelligence in both EMS and hospital settings. Built from years of feedback from customers, Zenix combines intuitive design with powerful functionality to enhance patient care and automate workflows for ease-of-use. Featuring a large, durable touchscreen, Zenix provides critical information when it’s needed. With on-the-fly customization, healthcare professionals can make real-time adjustments, helping them stay in control during high-pressure situations. Equipped with ZOLL’s innovative Real BVM Help® and exclusive Real CPR Help® technology, Zenix gives healthcare pro
IEO and Laife Reply Join Forces to Digitalise the Biobank Through Artificial Intelligence10.2.2026 11:00:00 EET | Press release
The Pathology Division of the European Institute of Oncology (IEO) and Laife Reply, the Reply Group company specialised in AI and Big Data solutions for the healthcare sector, have entered into a collaboration to develop Bianca, the first project in Italy aimed at creating an AI-based digital biobank designed as an integral part of clinical diagnostic practice. The initiative is part of a broader technological innovation journey that structurally integrates research and development into routine diagnostic processes in pathology, transforming the traditional histopathological sample workflow into an end-to-end digital ecosystem. The complete digitalisation of histopathological and molecular diagnostic workflows aims to make analysis more efficient, scalable and reproducible, laying the foundations for the evolution of AI-supported oncological diagnostics. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260210610418/en/ On this
Expereo Elevates expereoOne with New Digital Case Management Capabilities, Delivering Faster, Clearer and More Predictable Service Resolution10.2.2026 11:00:00 EET | Press release
Expereo is redefining what’s possible for global enterprises at CiscoLive Amsterdam with the launch of its enhanced Digital Case Management (DCM) capability in expereoOne. As the world-leading managed Network as a Service (NaaS) provider, Expereo is putting customers firmly in control - slashing resolution times, cutting through operational noise and ensuring everyone is aligned every step of the way. With DCM, service issues are no longer bottlenecks: instead, enterprises experience swift, transparent outcomes, with every stakeholder empowered through a single, real-time view of progress. DCM is designed around a simple objective: enable enterprises to resolve issues faster, with greater clarity and full control, through a fully digital, software-first workflow inside expereoOne. Customers can create, manage and track cases end-to-end within a single platform, ensuring predictable, consistent and aligned service experiences across their global network footprint. Unified, digital-first
U.S. Food and Drug Administration Accepts New Drug Application and Grants Priority Review for Takeda’s Oveporexton (TAK-861) as a Potential First-in-Class Therapy for Narcolepsy Type 110.2.2026 10:15:00 EET | Press release
Takeda (TSE:4502/NYSE:TAK) today announced that the U.S. Food and Drug Administration (FDA) accepted its New Drug Application (NDA) and granted Priority Review for oveporexton (TAK-861) for the treatment of narcolepsy type 1 (NT1). Oveporexton is an investigational oral orexin receptor 2 (OX2R)-selective agonist designed to address the underlying orexin deficiency that causes NT1 by restoring orexin signaling. The FDA has set a Prescription Drug User Fee Act (PDUFA) goal date in the third quarter of this calendar year. Takeda remains on track to potentially bring the first approved orexin agonist treatment to people living with NT1. NT1 is a chronic, rare neurological disease caused by a loss of orexin and characterized by excessive daytime sleepiness and cataplexy (sudden loss of muscle tone). This results in a spectrum of physical, cognitive and psychosocial effects that can have a debilitating impact on many aspects of a person’s life, including work, education and social interactio
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
