Navigating the High Cost of AI Compute
While ChatGPT uses a specific type of reinforcement learning called "Reinforcement Learning from Human Feedback (RLHF)", at a high level it is an example of a Large Language Model (LLM). A16Z has published a very useful analysis of the cost of training LLMs:
https://a16z.com/2023/04/27/navigating-the-high-cost-of-ai-compute