๐Ÿ”ฌLLM Evaluation

Prompt engineering, model metrics, performance benchmarking & data quality

22
Topics
66
Quiz Qs
7
Tabs/Topic

๐Ÿงช Prompt Testing

Prompt Templates
3 quiz Qs
Few-Shot Prompting
3 quiz Qs
Chain-of-Thought Prompting
3 quiz Qs
System Prompt Design
3 quiz Qs
Adversarial Prompting
3 quiz Qs
Prompt Injection Testing
3 quiz Qs

๐Ÿ“Š Model Evaluation

Accuracy, Precision & Recall
3 quiz Qs
BLEU & ROUGE Metrics
3 quiz Qs
Hallucination Detection
3 quiz Qs
Benchmarks: MMLU & HumanEval
3 quiz Qs
Bias & Fairness in AI
3 quiz Qs
Human Eval & RLHF
3 quiz Qs

โšก Performance Testing

Latency Benchmarking
3 quiz Qs
Throughput & Concurrency
3 quiz Qs
Token Budget Optimization
3 quiz Qs
Streaming Performance
3 quiz Qs
Cost Estimation & Optimization
3 quiz Qs

๐Ÿ“‹ Data Quality Testing

Training Data Validation
3 quiz Qs
Data Drift Detection
3 quiz Qs
Label Quality Assessment
3 quiz Qs
Feature Distribution Analysis
3 quiz Qs
Outlier Detection
3 quiz Qs
๐Ÿง  Score
0/0
correct