Discussions

Ask a Question
Back to all

What is the most effective way to process the premise of this “non-standard” French text?

Hello TextMine Community,
I am currently trying to build a pipeline to process long Vietnamese text — the input data often has missing punctuation, contains slang and spelling errors. The goals are:

  1. Identify the main topic (topic modeling),
  2. Extract proper named entities (Named Entity Recognition),
  3. Analyze implied sentiment (implicit sentiment analysis).
    I am wondering:
    What is the most effective way to process the premise of this “non-standard” French text?

Edited by Tap Road 1 day ago