AI & LLM Testing Automation Course with Python, DeepEval and CI/CD

AI & LLM Testing Automation Course with Python, DeepEval & CI/CD – Live Training

This AI & LLM Testing Automation Course equips QA engineers and software testers with the skills required to test and validate modern AI, Generative AI, and LLM-based applications. The course begins with Python foundations for test automation, followed by in-depth coverage of AI and LLM testing challenges, RAG (Retrieval-Augmented Generation) evaluation, and AI model testing strategies. Learners will understand different types of AI testing, including bias, fairness, and safety along with industry-standard LLM evaluation metrics such as faithfulness, relevancy, robustness, and toxicity. The program provides hands-on training with Playwright for UI automation, DeepEval, and OpenAI APIs, including local LLM testing using Ollama. It also covers AI test automation, CI/CD integration using GitHub Actions, and concludes with real-world case studies and a capstone project focused on end-to-end AI and LLM application testing.

About the Instructor:

About the Trainer: Kavya

Kavya is an accomplished Quality Engineering and Test Automation Professional with over 7 years of industry experience in software testing, test automation, and quality assurance practices. She has worked extensively on building and maintaining automation frameworks, implementing UI and API test automation, and integrating automated testing into CI/CD pipelines for enterprise applications.

She has successfully trained 200+ students and professionals, helping them upskill in automation testing, AI testing, and modern QA methodologies. With a strong focus on AI, LLM, and Generative AI testing, Kavya brings real-world insights into testing intelligent systems, including AI model evaluation, bias and safety testing, and automation strategies.

Known for her practical and learner-focused teaching approach, Kavya simplifies complex concepts using hands-on examples, real project scenarios, and industry best practices, enabling learners to confidently apply their skills in real-world testing environments and advance their QA careers.

Live Sessions Price:

For LIVE sessions – Offer price after discount is ~~129 USD 109 ~~USD~~~~ 119 USD Or ~~15000 INR~~ ~~12900 INR~~ 9900 Rupees.

Enroll For Free Demo

Free Demo On:

Indian Timings: 8th January @ 9 PM – 10 PM (IST)/

U.S Timings: 8th January @ 10:30 AM – 11:30 AM (EST)/

U.K Timings: 8th January @ 3:30 PM – 4:30 PM (BST)

Class Schedule:

For Participants in India: Monday, Tuesday, Thursday & Friday @ 9:00 PM – 10:00 PM (IST)

For Participants in the US: Monday, Tuesday, Thursday & Friday @ 10:30 AM – 11:30 PM (EST)

For Participants in the UK: Monday, Tuesday, Thursday & Friday @ 3:30 PM – 4:30 PM (BST)

What students have to say about Kavya:

I joined this course mainly to understand how AI and LLM testing actually works in real projects. The way Python, Playwright, and DeepEval were explained made it very practical. The hands-on sessions helped a lot. – Faria Aziz

What I liked most is that this is not just theory. We worked on real AI testing scenarios like bias, hallucination, and evaluation metrics. The capstone project gave me confidence to apply this at work. – Arjun Mehta

I was new to AI testing, but the trainer started from the basics and slowly moved to advanced topics. The modules on RAG testing and CI/CD integration were especially useful. – Vibha

Good course for QA engineers who want to move into AI and Generative AI testing. The Playwright automation and LLM evaluation frameworks were explained clearly with examples. – Sarah Johnson

The sessions on OpenAI APIs and running LLMs locally using Ollama were something I had not seen in other courses. It gave a clear idea of how AI testing is done in enterprise environments – Vikram Patel

This course helped me understand the difference between traditional testing and AI/LLM testing. Topics like safety and fairness were covered in a simple and practical way. – Robert Anderson

Overall a well-structured and job-relevant course. The combination of Python, AI testing concepts, automation tools, and real-world use cases makes it worth attending. – Sangeetha Roy

Salient Features:

40+ Hours of Live Training along with recorded videos
Lifetime access to the recorded videos
Course Completion Certificate

Who can enroll for this course?

Software Testers and QA Engineers who want to upskill in AI, LLM, and Generative AI testing
Automation Test Engineers looking to expand into AI test automation using Python and Playwright
Manual Testers aiming to transition into AI and LLM testing roles
QA Leads and Test Managers seeking practical knowledge of AI model evaluation, bias, and safety testing
Developers and SDETs involved in building or testing AI and LLM-based applications
Professionals working on chatbots, RAG systems, and AI-driven products
Engineering graduates and freshers interested in starting a career in AI testing and automation

What will I learn by the end of this course?

Build a strong foundation in Python for AI and test automation, including data handling, logging, and modular code
Understand AI, Generative AI, LLM, and RAG architectures from a testing and quality perspective
Identify and handle unique AI/LLM testing challenges such as non-deterministic outputs, hallucinations, bias, and privacy risks
Apply LLM evaluation metrics like faithfulness, relevancy, robustness, toxicity, and latency
Automate AI and LLM application testing using Playwright, and DeepEval
Test OpenAI-powered applications and run LLMs locally using Ollama for private AI testing
Integrate AI test automation into CI/CD pipelines using GitHub Actions
Gain hands-on experience through real-world case studies and complete an end-to-end AI testing capstone project

Course syllabus:

Module 0: Introduction to AI and testing LLM Applications (1 hr)

Introduction to AI & LLM Applications
What is RAG (Retrieval-Augmented Generation)?
Why RAG matters in AI applications: search, QA, summarization, decision support
RAG Architecture Overview
What is LLM and its application

Module 1: Unique Testing Challenges in AI/LLM (1 hr)

Differences between traditional software and AI systems testing
Probabilistic, non-deterministic outputs
Risks: hallucinations, bias, toxicity, privacy concerns
Ethical and legal implications
Regulatory landscape and compliance basics

Module 2: Types of Testing in AI/LLM Applications (2 hrs)

Functional testing principles in AI contexts
Bias and fairness testing: detecting and mitigating unwanted bias
Safety and ethical testing: toxicity, refusal, harmful content prevention
Performance and scalability testing for AI inference services
Usability and accessibility testing for AI user interfaces

Module 3: Evaluation Metrics for AI/LLM Outputs (1 hr)

Understanding evaluation metrics: what to measure and why
Core metrics: faithfulness, relevancy, completeness
Fairness, bias scores, stereotype detection
Safety metrics: toxicity, refusal rate
Robustness and consistency
Latency and scalability measures
Sentiment, readability, coherence, and ﬂuency
Privacy and compliance metrics

Module 4: Running LLM locally (2 hrs)

Introduction to Ollama: What Ollama is & why a private AI matters and Key use case
Setup: System requirements & installation and Basic conﬁguration
Exploring Models: Pre-trained models & switching between them, Key parameters (temperature, max tokens)
Using Ollama: Text generation, summarization, Q&A and Prompt tips & exporting outputs
Exposing via API: Running locally and API calls & script integration

Module 5: Introduction to OpenAI (4 hrs)

What is OpenAI
How to generate Open AI key
Creating a custom evaluation using Open AI

Module 6: Introduction to Python (4+ hrs)

Introduction to Python and installation setup
Understanding variables, data types, and operators
Control ﬂow: conditional statements and loops
Working with functions and reusable modules
Lists, tuples, sets, and dictionaries in test data handling
String manipulation and regular expressions
File handling and JSON data parsing
Exception handling, debugging, and logging best practices
Virtual environments (venv, pip)
Writing clean, modular Python code
Mini hands-on tasks for QA automation

Module 7: AI/LLM Testing Frameworks and Tools (10+ hrs along with hands-on)

Overview of popular testing frameworks
Playwright for frontend UI automation (4 hrs)
- What is Playwright and why it’s popular
- Setting up playwright
- Core Playwright Concepts – Locators, assertions,..
DeepEval for LLM evaluation (4 hrs)
What DeepEval is and where it ﬁts in the AI testing stack
Key concepts: metrics, test cases, evaluators
Faithfulness, relevancy, etc
Core metrics evaluation
Golden datasets and expected outputs
LLM as Judge
Rule based evaluation

Module 8: Hands-On Setup and Test Automation

Environment setup: js, Python virtualenv, API keys
Writing and running Playwright UI tests
Implementing DeepEval Python test scripts
Automating test suites for continuous integration

Module 9: Integrating AI/LLM Testing into CI/CD Pipelines (5+ hrs)

Overview of CI/CD concepts for AI apps
Connecting tests to GitHub Actions
Automating evaluations on model updates or frontend changes
Monitoring test results and alerting mechanisms

Module 10: Case Studies and Industry Practices (1 hr)

Analysis of real AI system failures and lessons learned
Successful AI testing implementations
Ethical AI and responsible innovation in testing context
Future trends: multimodal AI, self-supervised evaluation

Module 11: Capstone Project and Assessment (8+ hrs)

Learners design and implement a full testing pipeline
Test automation covering UI and model outputs
Evaluate a publicly available LLM or AI chatbot system
Submit report and demo with lessons learned
Peer review and instructor feedback

How can I enroll for this course?

Enroll For Free Demo

For any other details, Call me or Whatsapp me on +91- 90529 03733

Live Sessions Price:

For LIVE sessions – Offer price after discount is ~~129 USD 109 ~~USD~~~~ 119 USD Or ~~15000 INR~~ ~~12900 INR~~ 9900 Rupees.

Events

AI & LLM Testing Automation Course with Python, DeepEval and CI/CD

Event Information

AI & LLM Testing Automation Course with Python, DeepEval & CI/CD – Live Training

AI Powered Automation Testing training with Playwright JavaScript/TypeScript– Live Training Day 3

Mastering Git, Github & Jenkins for Software Testers: Hands-On Guide to Version Control, Continuous Integration & Automated Testing – Live Training

Leave A Reply Cancel reply

Buy Ticket