Ministry of Testing Coventry
Testing LLM-Based Applications: Strategies, Risks & Automated Evaluations
Book now- 08/05/2025
- 6.00pm
- Sainsbury's Store Support Centre, Ansty Park, Warwickshire
About our speaker: Bill Matthews
• Owner & Consultant - Target Testing Ltd
• Technical Lead - Voy Finance
• Strategic Business Consultant
About the talk:
As Large Language Models (LLMs) become integral to modern applications, software testers must adapt their strategies to assess these AI-driven systems effectively. This talk explores the unique challenges of testing LLM-based applications, including their probabilistic nature, hallucinations, bias, and prompt sensitivity.
We will discuss key capabilities and risks that testers should focus on, methodologies for designing robust evaluation frameworks, and automation techniques to streamline testing. Attendees will also learn how to leverage LLMs as judges for qualitative assessments and explore tools that support scalable, systematic testing. Whether you're new to AI testing or looking to enhance your approach, this session will equip you with practical insights to ensure the reliability and fairness of LLM-based applications.
This talk explores the unique challenges of testing LLM-based applications, including their probabilistic nature, hallucinations, bias, and prompt sensitivity.
Hosted by
