What can TensorFlow really do?

Published in

Oursky Team

6 min readFeb 14, 2018

TL;DR: Google’s open-source TensorFlow library helps developers build computational graphs and contains free additional modules that make AI/ML software development easier for consumer products and mobile apps. Developers require knowledge of Python or C++. This piece covers 3 common modules and possible business applications:

Convolutional Neural Networks (CNN) for image recognition
Sequence-to-Sequence (Seq2Seq) models for human language
Large-scale linear models for data analysis

What’s in TensorFlow?

Open-sourced in 2015, TensorFlow is a framework by Google for creating deep learning models. Deep Learning is one of several categories of machine learning (ML) models that use multi-layer neural networks. The TensorFlow library allows users to perform functions by creating a computational graph.

TensorFlow took the world by storm because it is free, (relatively) easy to use and gives developers with entry-level machine learning backgrounds access a powerful library instead of building all their AI models from scratch. TensorFlow and its accompanying modules make ML/AI software development for both mobile apps and backend services easier.

How does TensorFlow fit into AI and Machine learning?

Tensorflow is one of many machine learning libraries (other examples include CNTK and Theano). Machine learning is a field of computer science that gives computers the ability to learn without being explicitly programmed.[1] For example, AlphaGo Zero‘s AI taught itself to play Go and outperformed its predecessor, AlphaGo, which had defeated the world champion in Go. Machine learning is useful for tasks where explicit algorithms don’t yield good results, such as user screening, sorting high context data and clustering for predictions and profiling.

Practical examples include detecting fraud or data breaches, email filtering, optical character recognition (OCR), and ranking.

In the next section, we’ll demystify the computational graph, which is essential for understanding how TensorFlow works in human-speak.

Trigger Warning: Computational graphs and a bit of theory

*Image source:* *Deep Learning from Scratch*.

What do developers need to do to use TensorFlow?

TensorFlow was created with processing power limitations in mind (check TensorFlow Lite and TensorFlow Mobile), making it easier for mobile and web developers to make use of the library and create AI-powered features for consumer products. Developers with a basic background in neural networks can use the framework for data sets, estimators, training and inference.

Developers with no background in neural networks may want to start with a higher-level neural network API, such as Keras.io. Written in Python, Keras is capable of running on top of TensorFlow, CNTK, and Theano and is good for easy and fast prototyping.

To start with TensorFlow, a developer needs to know:

Python or C++
A little bit about arrays (specifically the numpy.array, which is provided by Numpy, a numerical computation library for Python, that Tensorflow uses to deal with an array/matrix)

3 Common Applications for TensorFlow

The many modules included in the TensorFlow library can perform a variety of functions known as AI/ML. Below are three commonly used modules:

Convolutional Neural Networks (CNN) for image recognition and processing
Sequence-to-Sequence (Seq2Seq) models for human language-related features
Large-scale linear models for data analysis and simple behavioural predictions

Below, we’ll list out some examples for commercial and consumer applications:

Convolutional Neural networks (CNN) and Image Recognition

*We used a* *dilib* *for our* *facial landmark detection* *on a client app.*

One of the many applications for CNNs is to help machines recognize images and make them meaningful in a way that is intuitive to humans. For example, this means having a computer recognizing the coloured dots in an image combine to become a cake. In order to check accuracy, an AI can be trained on a training data where correct answers are already available. The CNN will learn to identify an image, check if it is correct, and improve its ability to make sense of the coloured clusters. But a true performance is giving AI a new dataset to see how it performs.

*Oursky compared two machine learning libraries to help a client use image classification for inventory matching.*

Transfer learning is a shortcut technique that takes a fully-trained AI model and repurpose it for new image classification rather than retraining an AI model from scratch every time. ImageNet is a good resource for providing an image training data set.

CNN can be applied in a range of fields such as:

image classification for business applications like our client case of matching and sorting inventory
medical diagnostics to match patient image data with known visual symptoms for medical conditions
line sensors for sports for juding in-out based on high resolution image data
risk assessment for applications (such as for insurance or a credit card)

The benefits of starting early with AI/ML is that early adopters will have larger datasets to work with than competitors years from now.

Sequence to Sequence Models and human language

*Implementing TensorFlow’s Sequence-to-Sequence framework for* *chatbot conversation*.

Sequence-to-Sequence (Seq2Seq) models use recurrent neural networks as a building block by feeding lots of sentence pairs during model training so that we can generate one sentence from another sentence. These sentence pairs can be anything. For example, when it is words from two different languages, the model can be used for translations. When it is a pair of conversational messages, the model can be used for chatbots.

One interesting point is that we just need to feed in a raw word sequence and we can get back a word sequence output with mostly correct grammar (not perfect, but at least understandable). This means that seq2seq models can learn the language model from the training sample and implicitly learn it during training, which usually is not the case in traditional NLP where the language model needs to be explicitly taught.

*Oursky trained a* *Cantonese chatbot* *from scratch using local forum data.*

Language modeling (in short, the probability a word is going to appear given previous sequence of words) is key to many interesting problems such as:

Speech recognition for real-time subtitling
AI chatbots, such as our Cantonese Chatbot for Telegram
Virtual assistants like Apple’s Siri, Amazon’s Alexa, and the Google Assistant
Machine translation
Summarize documents
Batch image captioning

Large-Scale Linear models

Large-scale linear models are more simple than complex neural networks with many layers. But going on the principle of using the right tool for the right job, linear models are still powerful when applied for the appropriate cases. Compared to deep neural nets, linear models train more quickly, can work well on large feature sets, and be interpreted and debugged more easily. As a general approach, linear models are effective for data crunching and making simple predictions to help businesses with large datasets.

Linear models can be used for binary classification (predicting a or b outcome), multiclass classification (predicting one of multiple outcomes) and regression (predicting a numeric value). These models can be used for evaluating big data, such as census data or financial data.

As an example, Amazon ML uses linear models in ecommerce to help make suggestions for sellers such as:

Will the customer buy this product or not buy this product? (Binary Classification)
Is this product a computer, smartphone, or accessory? (Multiclass Classification)
What is an appropriate price for this car model and vintage? (Regression)

Of course, we’ve just scraped the surface of what AI/ML can do. If you would like to learn more about how AI-powered features can be incorporated into your product, get in touch.

😻 At Oursky we’re all about helping brands and entrepreneurs make their ideas happen. Contact us if you’re looking for a partner to help build your next digital product.

Originally published at blog.oursky.com on February 14, 2018.