Seol mar théacs é seo: Compression schemes for mining large datasets :