I have finished the Weka tutorial I was doing earlier. I have also done some work in document analysis, which could be useful for the data sets for this project. The document analysis included training an algorithm so it could tell whether a passage was about a given topic or not.
I have also joined the Coursera machine learning course. I am currently on Week 1, watching videos and reading the information about this course.
Finally, I am now trying out some algorithms on sample student performance data sets from Kaggle. One of them is IBk, a nearest-neighbor classifier. While it works on the data set, its error rate is very high, so I am looking for other algorithms that could work better on the data set. I should also be able to obtain the actual data set for the final project soon.