HomePage RecentChanges

DIPRE


References

1. R0 ¬ Seed
A Small set of trusted relation instance R0 representing the target relation is provided
by a human. In Brin’s experiment, only five relations consisting of books with authors
were given.
2. O ¬ Occurrences (R0,D)
The relations tuples R0 from step one are searched in D, where D is the collection of
all web pages.
3. P ¬ GeneratePatterns (O)
Form generalized patterns based on the set of occurrences identified in step 2. Brin
has noted over generalization could result in a large number of bad patterns being
generated in this stage. It is also noted that the higher the coverage of the patterns the
better the result.
4. Uses patterns generated from step 3 as new seeds to extract more relations.
5. If no more new relations can be learned from D, stop. Otherwise go to step 2.

The algorithm is illustrated in Figure 2.3.1 1.