Mac runtime artifact Parked 2026-06-17

ANE M8 Core ML Chain

This artifact maps the boundary between owning the model and using Apple’s acceleration stack. It is parked because capability comes from our model/eval loop; Core ML is a deployment target, not the product center.

Headline Numbers

ANE decode

17 tok/s Qwen3 28-block layer-chunked chain

FoundationModels context

4096 too small for real tool catalogs

Action grounding

25% FoundationModels BFCL agentic full catalog

Competitive Context

System Metric Score Size / Class Comparable? Readout
TinyGPT Qwen3 Core ML chain ANE decode 17 tok/s 28-block Qwen3 path Direct Runtime experiment only; capability still comes from TinyGPT weights and evals.
Apple FoundationModels action grounding / context 25% / 4096 tokens Apple on-device model Directional Useful as a free local floor, but too weak to be the specialist capability dependency.
TinyGPT active MLX path specialist eval readiness active owned weights Directional Preferred competition lane: model quality first, runtime optimization second.

Direct rows share this artifact's eval setup. Directional rows are useful market context but should not be read as leaderboard claims.

Platform stance

PathDecisionReason
Apple FoundationModelsRouting floor onlyWeak action grounding and short context
Our weights -> Core MLOptional futureBattery/perf optimization if capability is already solved
MLX/TinyGPT runtimeActiveOwn the model and eval gate

Release Blockers

Capability dependency rejected

Apple's model cannot be the differentiator; it is a free local floor at best.

Unblock: Only revive Core ML when a shipped TinyGPT specialist needs a battery/runtime optimization.

Evidence

Next Release Action

Leave parked. Keep the numbers public as boundary-mapping evidence.