Use machine learning and natural language processing techniques to analyze the changes made in a project, and classify them in:
- Small / unimportant fix
- Big / important fix
- Small / important feature
- Big / important feature
For this project I will
- Generate a basic corpus of labeled data from a different set of project related with openSUSE
- Evaluate the best features to make a proper classification: n-gram, PoS tag, TF-IDF (with and without stemmer)
- Evaluate and measure the best classification model: Naive Bayes, Linear SVM, Max Entropy, ...
Looking for mad skills in:
nlp machinelearning git github
This project is part of:
Hack Week 10 Hack Week 11 Hack Week 12
For certain directories (e.g. his own documents...
It is well-known that two git commits within a ...
During Hack Week 7 I worked on an archive of Qt...
QDirStat - Qt-based directory statistics: KDirStat without any KDE, now based on Qt 5 by shundhammer
This is about porting the old KDE 3 based KDirs...
Wayland would replace X11 in the future (maybe ...