DATE
October 25, 2017
TIME
2:30pm
LOGISTICS
Skype
AGENDA
Discuss 3rd year deliverables and deadlines
PARTICIPANTS
Guokan Shang (Linagora/LIX), Polykarpos Meladianos, Antoine Tixier (LIX)
MINUTES
GENERAL
- by March 20, we will have to deliver 2nd versions of the offline and real-time systems (code + documentation)
- for the review in April, we will also have to prepare demos and oral presentations
- Tom (LINAGORA) is leaving - we should find an interlocutor to replace him as a point of contact
OFFLINE SYSTEM
- the 2nd version of the offline system will consist in the abstractive model developed by the 2 rd year X students and followed-up by Guokan
- the offline system will eventually have to offer support for French - would be best if we could have it ready for the review in April. We need French WordNet, French language model, French POS tagger/stopwords/word embeddings
- Guokan will contact Abdel to get access to the French language model
- Guokan will see if he can use http://alpage.inria.fr/~sagot/wolf.html as a French replacement for WordNet
- we most likely will be able to use the same restAPI for the new offline system
- Guokan will have the submodularity function done by the end of the week
- AT MOST before Christmas break, the experiments and the paper for the 2nd version of the offline system will be done/ready for submission
ONLINE SYSTEM
- the 2nd version of the real-time system will consist in the previous version with the following improvements: use text from Cryptpad and Hublin chat, perform word sense disambiguation before sending requests to APIs for recommendation
- integration with Cryptpad and Hublin should be finalized during the barcamp in January. Calibrate with the relevant persons sufficiently early.
- Antoine and Pol will look for an off-the-shelf solution to perform word sense disambiguation in English and French. The following resource was suggested by Guokan: http://www.nltk.org/howto/wsd.html
BOTH SYSTEMS
- eventually, packaging the offline and real-time systems with Docker would work better for Linagora. Check that with the relevant person
- Antoine will send an email to Abdel to follow-up on the use of utterance annotations for French (could be used either for research or for production if the tool that generates the annotations is available)