January 14, 2018·Honza Tyl·1 min read·Archive 2018

What is an Autoencoder and How Does it Compress Data?

I continue my study of artificial intelligence in the 'Russian school'. It's a tough but good school. It took me 10 minutes to solve the autoencoder task with an error rate of 6.7%, but it took three days of experimentation to tame the error size down to 5%.

The first task involves training a network called an encoder to encode an image of a human face as efficiently as possible into a few digits (in this case, 32 digits). Then, I train a second network called a decoder to attempt to reconstruct the image from those few digits. Essentially, it's a clever and lossy super-compression of the image.

32 digits is quite a small amount to describe an image. The network must focus only on what is important. How do you like the results?

Photo of user Artificial Intelligence.

This one turned out quite nicely. The face reconstructed from just 32 digits resembles the original and appears only slightly blurred.

Photo of user Artificial Intelligence.

A somewhat poorer result is likely due to the uneven distribution of the dataset. The network inaccurately reconstructs a wider nose and full, slightly open mouth.

Photo of user Artificial Intelligence.

A fairly nice reconstruction. It seems to have somewhat softened the sharp features.

Photo of user Artificial Intelligence.

Here, the glasses took quite a hit. Their structure is too complex for such strong compression. The expression of the mouth was also translated into something more common.

Original source: wordpress

Související články

January 2018

What is an Autoencoder and How Does it Compress Data?

Související články

The First Part of the Trilogy on the Use of Autoencoders – Noise Reduction

AI Can Describe Photos!

Advanced Machine Learning at the National Research University Higher School of Economics