Seol mar théacs é seo: Information extraction: algorithms and prospects in a retrieval context