It can be frustrating to get an AI application working amazingly well 80% of the time and failing miserably the other 20%. How can you close the gap and create something that you rely on? Chris and Daniel talk through this process, behavior testing, and the flow from prototype to production in this episode. They also talk a bit about the apparent slow down in the release of frontier models. :link: https://practicalai.fm/295
Ch | Start | Title | Runs |
---|---|---|---|
01 | 00:00 | Welcome to Practical AI | 00:46 |
02 | 00:57 | Sponsor: Fly | 02:29 |
03 | 03:33 | Thanksgiving preparations | 01:24 |
04 | 04:57 | Agents in production | 01:30 |
05 | 06:27 | AI ceiling & current hype | 02:11 |
06 | 08:39 | Level of transformation | 02:10 |
07 | 10:49 | Current models are mostly good enough | 05:56 |
08 | 16:55 | Sponsor: Timescale | 02:17 |
09 | 19:34 | Robust AI workflows | 05:04 |
10 | 24:39 | Finding the right workflow | 06:08 |
11 | 30:47 | Transition from notebook to code | 03:02 |
12 | 34:06 | Sponsor: Eight Sleep | 02:34 |
13 | 36:44 | Testing and integrating | 03:22 |
14 | 40:07 | Sketching out a good framework | 07:17 |
15 | 47:23 | Roles have shifted | 01:57 |
16 | 49:20 | Outro | 00:46 |
Last updated: Dec 12 2024 at 16:20 UTC