Annotation Worksheets
Interactive exercises to understand annotation challenges
About These Worksheets
These interactive worksheets help you understand the challenges of annotation for various NLP tasks. Each worksheet contains hands-on exercises where you'll annotate real examples, compare your decisions with classmates, and reflect on sources of disagreement.
Instructor Dashboard
View real-time student responses, agreement statistics, and text answers during class.
Open DashboardTraditional NLP Tasks
13 worksheetsToxicity Detection
Label comments as toxic or not toxic. Explore how sarcasm and context affect interpretation.
ClassificationSentiment Analysis
Determine positive, negative, or neutral sentiment. Handle mixed emotions and implicit opinions.
ClassificationSarcasm Detection
Identify sarcastic statements. Learn why context and tone are crucial for detection.
ClassificationEmotion Detection
Classify text into emotions like joy, anger, fear. Handle multi-label and intensity challenges.
ClassificationStance Detection
Determine if text supports, opposes, or is neutral toward a target. Explore implicit stances.
ClassificationNamed Entity Recognition
Identify and classify entities like people, organizations, locations. Handle boundary decisions.
Sequence LabelingEvent Extraction
Identify events and their participants. Understand triggers and argument structure.
Sequence LabelingSemantic Role Labeling
Label who did what to whom. Identify agents, patients, and other semantic roles.
Sequence LabelingCoreference Resolution
Link pronouns and mentions to entities. Handle ambiguous references and bridging.
Sequence LabelingTemporal Annotation
Mark time expressions and temporal relations. Normalize dates and durations.
Sequence LabelingRelation Extraction
Identify relationships between entities. Handle implicit and multiple relations.
ClassificationCausal Relations
Identify cause-effect relationships. Distinguish correlation from causation.
ClassificationWord Sense Disambiguation
Choose the correct meaning of ambiguous words. Explore sense granularity challenges.
ClassificationLLM-Related Tasks
12 worksheetsLLM Preference (RLHF)
Compare model outputs and choose the better response. The foundation of RLHF training.
LLMHallucination Detection
Identify when LLMs generate false or unsupported information. Critical for AI safety.
LLMPrompt Quality Assessment
Evaluate prompt clarity, specificity, and safety. Learn what makes prompts effective.
LLMSummarization Evaluation
Assess summary quality on faithfulness, relevance, and coherence dimensions.
EvaluationTranslation Quality
Evaluate machine translation accuracy, fluency, and meaning preservation.
EvaluationText Simplification
Evaluate if simplified text preserves meaning while improving readability.
EvaluationCode Generation Review
Evaluate LLM-generated code for correctness, efficiency, and best practices.
LLMQuestion Answering
Select answer spans and handle unanswerable questions. Build QA datasets.
ClassificationDialogue Act Classification
Label conversational intents: questions, statements, requests. Build chatbot training data.
ClassificationArgumentation Mining
Identify claims, premises, and argument structure. Evaluate reasoning quality.
ClassificationBias Detection
Identify various forms of bias in text: gender, racial, political, and more.
LLMImplicit Hate Speech
Detect subtle, coded, and implicit forms of hateful content.
Classification