summarize your thread for the model, not the other way around. neynar's lookup-cast-conversation-summary returns a compact convo summary plus recent interaction history in one call so your agent doesn't need to fetch every cast or manage per-user state.
for example, when building a reply bot for long threads call the summary endpoint, feed the summary to your llm, and generate focused replies with far fewer tokens and fewer race conditions.