Generative AI — A quick overview of GPT / ChatGPT algorithm and its applications

3 min readMar 5


Generative AI has taken the world by storm. Text and Image Generation with algorithms like ChatGPT, Stable Diffusion etc are breaking out and disrupting industries all over.

GPT / ChatGPT is a neural network that generates human-like text. GPT-3, or the third-generation Generative Pre-trained Transformer, is a neural network machine learning model trained using internet data to generate any type of text. Developed by OpenAI, it requires a small amount of input text to generate large volumes of relevant and sophisticated machine-generated text.

The architecture is a decoder-only transformer network with a 2048-token-long context and then-unprecedented size of 175 billion parameters. The model was trained using generative pre-training; it is trained to predict what the next token is based on previous tokens. The model demonstrated strong zero-shot and few-shot learning on many tasks.

GPT3.5 has incredibly large number of language uses such as text generation, language understanding, information mining, Question-Answer, language translation, code generation, content generation for literature or marketing, text summarization, entity extraction, sentiment analysis, text classification and so on..

ChatGPT, launched in Nov 2022, is a front-end for GPT3.5 that interacts in a conversational way. The dialogue format makes it possible for ChatGPT to answer followup questions, admit its mistakes, challenge incorrect premises, and reject inappropriate requests.

ChatGPT was fine-tuned (an approach to transfer learning) over an improved version of OpenAI’s GPT-3 known as “GPT 3.5”. The fine-tuning process leveraged both supervised learning as well as reinforcement learning in a process called reinforcement learning from human feedback (RLHF). RLHF helped achieve more human-like answers, filter out biases, negative text, as well as prevent human engineering to manipulate Chatbot etc.

