Relation extraction identifies semantic relationships between entities mentioned in text, creating structured knowledge from unstructured data.
Relation Triple: (Subject Entity, Relation Type, Object Entity)
Example: "Steve Jobs founded Apple" → (Steve Jobs, FOUNDED, Apple)
| Relation | Description | Example |
|---|---|---|
| WORKS_FOR | Employment | (John, WORKS_FOR, Google) |
| LOCATED_IN | Physical location | (Paris, LOCATED_IN, France) |
| BORN_IN | Birthplace | (Einstein, BORN_IN, Germany) |
| FOUNDED | Created organization | (Gates, FOUNDED, Microsoft) |
| SPOUSE | Married to | (Obama, SPOUSE, Michelle) |
| SUBSIDIARY_OF | Owned by | (Instagram, SUBSIDIARY_OF, Meta) |
PERSON ORGANIZATION LOCATION DATE MISC
Identify all relation triples in the sentence:
Which relations are explicitly stated vs. inferred?
| Relation | Explicit or Inferred? |
|---|---|
| (Dr. Sarah Chen, WORKS_FOR, MIT) | |
| (Dr. Sarah Chen, PUBLISHED_IN, Nature) | |
| (Dr. Sarah Chen, PROFESSION, professor) |
What is the correct direction for the acquisition relation?
How should co-founding be represented?
What about the location relation?
How should these relations be annotated?
| Statement | Relation Status |
|---|---|
| (John, WORKS_FOR, Google) | |
| (John, WORKS_FOR, Meta) | |
| (Mary, WORKS_FOR, Apple) |
Extract all relation triples (aim for at least 5):
Compare your annotations with your group. Where did you disagree?
Why is relation extraction difficult?
Relation extraction turns unstructured text into structured knowledge, but requires many annotation decisions.