Feature Extraction (FE)

Feactures from Dicts

movie_entry = [{'category': ['thriller', 'drama'], 'year': 2003}, {'category': ['animation', 'family'], 'year': 2011}, {'year': 1974}]

vec.fit_transform(movie_entry).toarray()

vec.get_feature_names() == ['category=animation', 'category=drama', 'category=family', 'category=thriller', 'year']

vec.transform({'category': ['thriller'], 'unseen_feature': '3'}).toarray()

Dict Vectorizer - NLP applications

Feature Hashing

Implementation