scikit-learn vectorizer passing tokens