Worksheet: A Hard NLP Sequence Labeling Task — Event Extraction

Understand why sequence labeling is harder than classification
Course: Natural Language Annotation for Machine Learning Task Type: Sequence labeling (event triggers + arguments)
Author: Jin Zhao

Background

You are annotating data for an AI system that extracts events from news text.

An event consists of:

Event Type: ATTACK
An action involving physical or violent harm against a person, group, or location.

Annotation Labels

Trigger labels:

B-ATTACK beginning of attack trigger
I-ATTACK continuation of trigger
O not part of a trigger

Argument labels:

B-AGENT / I-AGENT who carried out the attack
B-TARGET / I-TARGET who/what was attacked
B-PLACE / I-PLACE where it happened

Part 1: Warm-up

Q1

Which of the following could be valid attack triggers? (Check all that apply.)

Part 2: Sequence Labeling

Sentence 1
"The militants launched an attack on the military base in the city."
The
militants
launched
an
attack
on
the
military
base
in
the
city
Q1

Which word(s) did you choose as the event trigger? Why?

Part 3: Boundary Ambiguity

Sentence 2
"The militants carried out a deadly attack in the capital."
The
militants
carried
out
a
deadly
attack
in
the
capital
Q1

Is the trigger:

Part 4: Trigger Ambiguity

Sentence 3
"Fighting erupted near the border late Monday."
Fighting
erupted
near
the
border
late
Monday
Q1

Does this sentence contain an ATTACK event?

Part 5: Argument Ambiguity

Sentence 4
"The army shelled rebel positions outside the town."
The
army
shelled
rebel
positions
outside
the
town
Q1

Which argument roles were hardest to decide?

Part 6: Reflecting on Disagreement

Looking back at your annotations in Parts 2–5, consider where other annotators might make different choices.

Q1

Where do you think annotators would disagree most?

Q2

Which guideline change would most improve agreement?

Part 7: Why Event Extraction Is Hard

Q1

Why is event extraction harder than sequence labeling tasks like NER? (Check all that apply.)

Part 8: Aspectual Verbs and Event Boundaries

Aspectual words describe the beginning, continuation, or end of an event, rather than the event itself.

Common aspectual words: began, started, continued, stopped, resumed, ended

Whether these should be included as part of an event trigger is often unclear and guideline-dependent.

Sentence 5
"The army began shelling rebel positions near the town."
The
army
began
shelling
rebel
positions
near
the
town
Q1

Which word(s) did you label as the ATTACK trigger?

Q2

What does "began" contribute to the meaning of the event?

Sentence 6
"The fighting continued throughout the night."
The
fighting
continued
throughout
the
night
Q3

Does this sentence describe:

Part 9: Aspectual Decisions

Q1

Should aspectual words like began, continued, ended be:

Q2

What guideline rule would reduce disagreement the most?

Part 10: Aspect, Intent, and Event Reality

Aspect vs. Core Event

Sentence 7
"The army attacked rebel positions near the town."
Sentence 8
"The army was preparing to attack rebel positions near the town."
Q1

Which sentence(s) contain an ATTACK event?

Q2

What is the key difference between these sentences?

Continuation vs. New Events

Sentence 9
"The fighting stopped after international pressure."
Sentence 10
"The fighting resumed the next morning."
Q3

How many ATTACK events are described here?

Part 11: Guideline Design and Modeling Consequences

Q1

Which rule would you adopt for this project?

Q2

Suppose you include began, continued, stopped as part of ATTACK triggers. Which outcomes are likely? (Check all that apply.)

Q3

If instead you exclude aspectual words from triggers, what might the model fail to learn?