You are viewing a single comment's thread from:
RE: *TrufflePig*: A Bot based on Natural Language Processing and Machine Learning to support Content Curators and Minnows
Short update on the roadmap:
I want to conduct further experiments with different ML regressors as well as feature encodings. I already made some experiments using Doc2Vec instead of LSI. But this was not very fruitful. A more thorough investigation may improve the bot's judgment further.
I did this and improved the bot slightly. From now on the LSA is not only computed over tokens, but over bigrams of tokens as well.
I also tried trigrams and 4grams as well as skip-grams, but they did not improve the bot's performance.
I'm currently working on the @trufflepig call a pig feature, afterwards I'll focus on the recommendation system.