
The frontier labs are relentless in their launch of new models with Google Deepmind launching Gemini-3.1 Flash-Lite and OpenAI launching GPT-5.4 with impressive performance for their class. Cognition which makes Devin and Windsurf showed an early preview of their model SWE-1.6 which has a huge jump over their previous SWE-1.5 and offers 950 tokens/second inference via their Cerebras partnership
Anthropic's new feature of importing preferences/context from other AI providers into Claude is its salvo to be more attractive to consumers with continuous improvements to Claude Code as well as Cowork and also taking advantage of more organisations offering skills in Anthropic format such as Dune, Coinmarketcap
Readers who think others in their family, friends and acquaintances who are curious in knowing more about rapidly evolving AI tools/services/use cases and would benefit from being subscribers of this weekly newsletter are encouraged to share this publication link to them and invite them to subscribe
The following messages were posted on the 'All Things AI' Telegram group from Sunday 1st Mar 2026 to Saturday 7th Mar 2026
Anthropic created a process to bring over preferences and context from other AI providers to Claude
For those who are using other providers and fear starting from scratch with Claude, recommend going through the below landing page and read it completely to understand the process
https://claude.com/import-memory
1 | Cognition Unveils SWE-1.6 with 55.8% SWE-Bench Pro Score The new model jumps from SWE-1.5's 40.1% on the benchmark, thanks to reinforcement learning that used 100 times more compute on thousands of NVIDIA GB200 chips—while keeping its speedy 950 tokens per second. It edges out open-source rivals and matches top closed models like Claude Opus 4.6, with internal tests showing solve rates climb from 52.4% to 68.7% on engineering tasks. Tweaks are planned for quirks like overthinking. Team members shared their excitement, with CEO Scott Wu calling it a big milestone and researchers praising the 'insane team' effort |
1 | Chintan Turakhia, Senior Director of Engineering at Coinbase, in conversation with Clairo Vo, founder of ChatPRD and host of the 'How I AI' podcast Chintan led the transformation of a 1,000-plus-engineer organization to embrace AI tools at scale. When tasked with rewriting Coinbase’s self-custody wallet into a consumer social app in just six to nine months, Chintan turned to AI as a force multiplier |
1 | via Tom Ho Got Qwen 3.5 to run on iphone locally, testing on simulator for now |
1 | via Ben Jammin If anyone uses OpenClaw, here's a Morning Briefing use case of mine that you may be interested in It scrapes a bunch of different sources for me and compiles everything for me into bite sized info that I can read quickly: - My Newsletters - Youtube competitor videos - Product hunt launches - Hot Reddit Topics I'd be happy to help anyone set this kind of stuff up ![]() |
1 | OpenAI Launches GPT-5.4 as Top Model for Professional Tasks The new GPT-5.4 rolls out immediately via API as gpt-5.4 and gpt-5.4-pro, plus in Codex and gradually to ChatGPT users on Plus, Team, Pro, Enterprise, and Edu plans. It shines in agentic tasks with native computer use—interpreting screenshots, generating code, and controlling mouse or keyboard across apps Benchmarks show it leading rivals like Claude 4.6 and Gemini 3.1 Pro, with testers like Matt Shumer calling it the world's best and coding 'essentially solved.' |
1 | via Fazri Zubair Just started using 5.4 Extra High and evaluating its ability in code generation. We'll keep you guys posted. Anyone have some early notes or results? |
2 | response to the above from Coop The first few tasks I was thinking it was better than Opus, but after giving it some more full features it didn’t beat Opus for me. I will test it with e2e testing with Playwrite / Stagehand this week as I hear that’s where it excels besides frontend design which I also haven’t tested yet. My code base is 10m LOC though so I am expecting a high bar to 5.4m compared to most. |
Below is my personal website which aggregates links to many of my socials as well as the various content and community that I curate. Feel free to share this link to others who you think may find this content/community useful to them
The cover image of this newsletter via generated via the Nano Banana 2 model within the Freepik tool via the following prompt
A futuristic stadium in the year 2056, surrounded by grassy fields and crowds walking on bridges to reach it. The building is white with black details. In front, you can see an ocean with some boats around it. A huge bridge leads from one side over that river to another island where other buildings stand. High resolution, hyper-realistic, high detail, sharp focus, depth of field, volumetric lighting, global illumination.

The frontier labs are relentless in their launch of new models with Google Deepmind launching Gemini-3.1 Flash-Lite and OpenAI launching GPT-5.4 with impressive performance for their class. Cognition which makes Devin and Windsurf showed an early preview of their model SWE-1.6 which has a huge jump over their previous SWE-1.5 and offers 950 tokens/second inference via their Cerebras partnership
Anthropic's new feature of importing preferences/context from other AI providers into Claude is its salvo to be more attractive to consumers with continuous improvements to Claude Code as well as Cowork and also taking advantage of more organisations offering skills in Anthropic format such as Dune, Coinmarketcap
Readers who think others in their family, friends and acquaintances who are curious in knowing more about rapidly evolving AI tools/services/use cases and would benefit from being subscribers of this weekly newsletter are encouraged to share this publication link to them and invite them to subscribe
The following messages were posted on the 'All Things AI' Telegram group from Sunday 1st Mar 2026 to Saturday 7th Mar 2026
Anthropic created a process to bring over preferences and context from other AI providers to Claude
For those who are using other providers and fear starting from scratch with Claude, recommend going through the below landing page and read it completely to understand the process
https://claude.com/import-memory
1 | Cognition Unveils SWE-1.6 with 55.8% SWE-Bench Pro Score The new model jumps from SWE-1.5's 40.1% on the benchmark, thanks to reinforcement learning that used 100 times more compute on thousands of NVIDIA GB200 chips—while keeping its speedy 950 tokens per second. It edges out open-source rivals and matches top closed models like Claude Opus 4.6, with internal tests showing solve rates climb from 52.4% to 68.7% on engineering tasks. Tweaks are planned for quirks like overthinking. Team members shared their excitement, with CEO Scott Wu calling it a big milestone and researchers praising the 'insane team' effort |
1 | Chintan Turakhia, Senior Director of Engineering at Coinbase, in conversation with Clairo Vo, founder of ChatPRD and host of the 'How I AI' podcast Chintan led the transformation of a 1,000-plus-engineer organization to embrace AI tools at scale. When tasked with rewriting Coinbase’s self-custody wallet into a consumer social app in just six to nine months, Chintan turned to AI as a force multiplier |
1 | via Tom Ho Got Qwen 3.5 to run on iphone locally, testing on simulator for now |
1 | via Ben Jammin If anyone uses OpenClaw, here's a Morning Briefing use case of mine that you may be interested in It scrapes a bunch of different sources for me and compiles everything for me into bite sized info that I can read quickly: - My Newsletters - Youtube competitor videos - Product hunt launches - Hot Reddit Topics I'd be happy to help anyone set this kind of stuff up ![]() |
1 | OpenAI Launches GPT-5.4 as Top Model for Professional Tasks The new GPT-5.4 rolls out immediately via API as gpt-5.4 and gpt-5.4-pro, plus in Codex and gradually to ChatGPT users on Plus, Team, Pro, Enterprise, and Edu plans. It shines in agentic tasks with native computer use—interpreting screenshots, generating code, and controlling mouse or keyboard across apps Benchmarks show it leading rivals like Claude 4.6 and Gemini 3.1 Pro, with testers like Matt Shumer calling it the world's best and coding 'essentially solved.' |
1 | via Fazri Zubair Just started using 5.4 Extra High and evaluating its ability in code generation. We'll keep you guys posted. Anyone have some early notes or results? |
2 | response to the above from Coop The first few tasks I was thinking it was better than Opus, but after giving it some more full features it didn’t beat Opus for me. I will test it with e2e testing with Playwrite / Stagehand this week as I hear that’s where it excels besides frontend design which I also haven’t tested yet. My code base is 10m LOC though so I am expecting a high bar to 5.4m compared to most. |
Below is my personal website which aggregates links to many of my socials as well as the various content and community that I curate. Feel free to share this link to others who you think may find this content/community useful to them
The cover image of this newsletter via generated via the Nano Banana 2 model within the Freepik tool via the following prompt
A futuristic stadium in the year 2056, surrounded by grassy fields and crowds walking on bridges to reach it. The building is white with black details. In front, you can see an ocean with some boats around it. A huge bridge leads from one side over that river to another island where other buildings stand. High resolution, hyper-realistic, high detail, sharp focus, depth of field, volumetric lighting, global illumination.
2 | Paul Graham, the OG of Y Combinator hinting at the much improved capabilities of Replit v4 |
3 | via Yat Siu had an opportunity to tell Peter that crypto isn't bad for AI at all, on stage he actually said "I don't hate crypto" but do think the scammers out there scarred him very badly |
4 | via Simon Davis I created a guide to give your agent ultimate context for free by automatically hooking up your OpenClaw to notes from every meeting and discussion. Hope some of you find this useful. |
via Marc McGinley |
2 | via Tom Ho MCP, x402, claude code & Openclaw skills for CoinMarketCap |
3 | Anthropic Rolls Out Voice Mode for Claude Code Developers activate it via '/voice' and hold the spacebar to speak, streaming transcripts right into their code editor without overwriting text. The feature shines for quick ideas, refactoring tasks, and accessibility |
5 | New venture SecretSauce from my friends Simon Davis and Benjamin Chevalier Small blurb from the Forbes article which introduces their product ===== Rather than prompting from scratch each time, users upload brand assets or share a website. The system builds what the company calls a “codex”, a memory layer that encodes visual identity, tone and product rules. via Simon Davis SecretSauce encodes your brand once and uses that intelligence to produce on-brand content by default. No prompting, no fixing, no starting from zero every time. Built on a system originally developed for live games at massive scale, it gets smarter with every interaction. The longer you use it, the better it knows your brand, and the harder it becomes to replicate with anything else. I'll make sure we bump anyone from this group for the free beta. You can sign up here. |
2 | From Google Announcing Gemini 3.1 Flash-Lite! ️ Our fastest and most cost-efficient Gemini 3 series model yet. A 45 % increase in output speed and it outperforms 2.5 Flash. It also has dynamic thinking levels to match task complexity. |
3 | check out the updated skill-creator with built-in support for test generation (e.g., to measure + optimize tricky things like skill trigger rate). available in Claude Code as plugin, https://claude.ai and Cowork. |
4 | via Coop |
5 |
2 | Tanishq Kumar, Tri Dao, and Avner May from Together Compute introduce Speculative Speculative Decoding (SSD) for up to 2x faster LLM inference Researchers Tanishq Kumar, Tri Dao, and Avner May from Together Compute released SSD, a new LLM inference algorithm. SSD speculates verification outcomes in parallel to enable asynchronous drafting and verification, eliminating overhead from the small draft model in traditional speculative decoding by preemptively predicting and preparing likely verification paths. The method achieves up to 2x speedup over the strongest existing inference engines. |
3 | In the leadup to the highly awaited Replit Agent v4 release, Amjad Masad and Replit release a ~53 min documentary of the 'behind the scenes' leadup to Agent v3 which launched on Sep 21st 2025 |
2 | Netflix has acquired interpositive, a start-up founded by Ben Affleck that makes AI-powered tools for filmmakers. The system builds AI models from a film’s dailies to assist with postproduction tasks like color, relighting and VFX while keeping filmmakers “at the center of the process.” Bela Bajaria, Netflix’s CCO, says the tech will provide creatives “more choices, more control and more protection for their vision.” Ben Affleck has also been active in talk-shows and media appearances articulating his view on AI in the context of film making |
3 | via Robby Yung |
4 | via Ben Jammin Anthropic Report Labor market impacts of AI: A new measure and early evidence https://cdn.sanity.io/files/4zrzovbb/website/dc7bcd0224644fce97cecb7f9e68dcd8434b35f1.pdf |
5 | approx hour long conversation between Lisa Huang creator of Gemini Gems alongwith Aakash Gupta ===== Gemini Gems, Claude Projects, custom GPTs. If you're not using any of them, you're working harder than you need to. The creator of Gemini Gems walked me through her entire setup: |
2 | Paul Graham, the OG of Y Combinator hinting at the much improved capabilities of Replit v4 |
3 | via Yat Siu had an opportunity to tell Peter that crypto isn't bad for AI at all, on stage he actually said "I don't hate crypto" but do think the scammers out there scarred him very badly |
4 | via Simon Davis I created a guide to give your agent ultimate context for free by automatically hooking up your OpenClaw to notes from every meeting and discussion. Hope some of you find this useful. |
via Marc McGinley |
2 | via Tom Ho MCP, x402, claude code & Openclaw skills for CoinMarketCap |
3 | Anthropic Rolls Out Voice Mode for Claude Code Developers activate it via '/voice' and hold the spacebar to speak, streaming transcripts right into their code editor without overwriting text. The feature shines for quick ideas, refactoring tasks, and accessibility |
5 | New venture SecretSauce from my friends Simon Davis and Benjamin Chevalier Small blurb from the Forbes article which introduces their product ===== Rather than prompting from scratch each time, users upload brand assets or share a website. The system builds what the company calls a “codex”, a memory layer that encodes visual identity, tone and product rules. via Simon Davis SecretSauce encodes your brand once and uses that intelligence to produce on-brand content by default. No prompting, no fixing, no starting from zero every time. Built on a system originally developed for live games at massive scale, it gets smarter with every interaction. The longer you use it, the better it knows your brand, and the harder it becomes to replicate with anything else. I'll make sure we bump anyone from this group for the free beta. You can sign up here. |
2 | From Google Announcing Gemini 3.1 Flash-Lite! ️ Our fastest and most cost-efficient Gemini 3 series model yet. A 45 % increase in output speed and it outperforms 2.5 Flash. It also has dynamic thinking levels to match task complexity. |
3 | check out the updated skill-creator with built-in support for test generation (e.g., to measure + optimize tricky things like skill trigger rate). available in Claude Code as plugin, https://claude.ai and Cowork. |
4 | via Coop |
5 |
2 | Tanishq Kumar, Tri Dao, and Avner May from Together Compute introduce Speculative Speculative Decoding (SSD) for up to 2x faster LLM inference Researchers Tanishq Kumar, Tri Dao, and Avner May from Together Compute released SSD, a new LLM inference algorithm. SSD speculates verification outcomes in parallel to enable asynchronous drafting and verification, eliminating overhead from the small draft model in traditional speculative decoding by preemptively predicting and preparing likely verification paths. The method achieves up to 2x speedup over the strongest existing inference engines. |
3 | In the leadup to the highly awaited Replit Agent v4 release, Amjad Masad and Replit release a ~53 min documentary of the 'behind the scenes' leadup to Agent v3 which launched on Sep 21st 2025 |
2 | Netflix has acquired interpositive, a start-up founded by Ben Affleck that makes AI-powered tools for filmmakers. The system builds AI models from a film’s dailies to assist with postproduction tasks like color, relighting and VFX while keeping filmmakers “at the center of the process.” Bela Bajaria, Netflix’s CCO, says the tech will provide creatives “more choices, more control and more protection for their vision.” Ben Affleck has also been active in talk-shows and media appearances articulating his view on AI in the context of film making |
3 | via Robby Yung |
4 | via Ben Jammin Anthropic Report Labor market impacts of AI: A new measure and early evidence https://cdn.sanity.io/files/4zrzovbb/website/dc7bcd0224644fce97cecb7f9e68dcd8434b35f1.pdf |
5 | approx hour long conversation between Lisa Huang creator of Gemini Gems alongwith Aakash Gupta ===== Gemini Gems, Claude Projects, custom GPTs. If you're not using any of them, you're working harder than you need to. The creator of Gemini Gems walked me through her entire setup: |

This Week in All Things AI - Inaugural Edition
Greetings friends, Whilst you may have subscribed to my low-volume newsletter which focused on informing what had changed in the various public Notion pages that I curate, I decided to experiment with a new publication/newsletter which I hope to send on a weekly basis that aggregates what was posted the previous week on the 'All Things AI' telegram group Some of the reasons that I want to do this experimentThere are folks who have mentioned to me that they are not on Telegram and don't wish t...

This Week in All Things AI - Week 35-2025
Sunday 24th August 2025 and Saturday 30th August 2025

This Week in All Things AI - Week 39-2025
Sunday 21st September 2025 to Saturday 27th September 2025

This Week in All Things AI - Inaugural Edition
Greetings friends, Whilst you may have subscribed to my low-volume newsletter which focused on informing what had changed in the various public Notion pages that I curate, I decided to experiment with a new publication/newsletter which I hope to send on a weekly basis that aggregates what was posted the previous week on the 'All Things AI' telegram group Some of the reasons that I want to do this experimentThere are folks who have mentioned to me that they are not on Telegram and don't wish t...

This Week in All Things AI - Week 35-2025
Sunday 24th August 2025 and Saturday 30th August 2025

This Week in All Things AI - Week 39-2025
Sunday 21st September 2025 to Saturday 27th September 2025
>800 subscribers
>800 subscribers
Share Dialog
Share Dialog
No comments yet