I've been subjecting chatbots to a set of real-world programming tests for two years now. There are two I recommend if you're looking for AI coding help - and several to avoid.
Move over, DeepSeek. Seattle-based nonprofit AI lab Ai2 has released a benchmark-topping model called Tulu3-405B.