September 12, 2018·Honza Tyl·1 min read·Archive 2018

A New Challenge!

Recently, an interesting competition took place on Kaggle (https://www.kaggle.com/c/jigsaw-toxic-comment-classificatio…) to create a detector capable of recognising insults, toxic and obscene remarks, and so forth – the Toxic Comment Classification Challenge….

I found out about it late, but I still managed to write a deep neural network based on LSTM + FastText (the algorithm's performance would have earned a gold medal in the Kaggle rankings). A colleague from Alpha Industries translated the training dataset into Czech (70 megabytes of text!) and deployed it on an Amazon server, and you can now try it out here: www.detector.alphai.cz.

The algorithm is not perfect; however, it works reasonably well in both Czech and English.

Here's a task for you – can you find a sentence, or even a longer text, that the algorithm evaluates as non-vulgar (non-toxic), yet is actually offensive?

Original source: wordpress

Související články

February 2019

I bring bad news. We have improved the Czech and English emotion detector!

Read

January 2019

The End of Politicians' Lies? Artificial Intelligence Will Soon Be Able to Verify Their Statements in Real Time

Read

December 2018

And here it is! Toxic II. = an advanced insult detector that combines the power of Deep Learning and expert systems is ready.

Read