Add coreml_compute_plan.py: report which CoreML ops dispatch to ANE / GPU / CPU by john-rocky · Pull Request #19252 · pytorch/executorch

john-rocky · 2026-05-01T05:53:25Z

Summary

CoreML decides at compile/load time which device each MIL operation will
execute on, and coremltools 9.0+ exposes that through MLComputePlan.
The recurring question on the issue tracker is "why isn't my model
running fully on the ANE?" — for example:

llama model is not fully lowered to ANE (coreml backend) #4091 — llama model is not fully lowered to ANE
CoreML model is crashing on iPhone GPU, but not on iPhone CPU or macOS GPU #11541 — CoreML model is crashing on iPhone GPU, but not on iPhone CPU or macOS GPU
ANE compile OOMs on certain input shapes #8439 — ANE compile OOMs on certain input shapes
CPU Overhead After ANE Execution #8445 — CPU Overhead After ANE Execution

Today the only way for an ExecuTorch user to answer it is to break out
Swift / Xcode. This PR adds a Python wrapper around MLComputePlan so
the answer is one shell command:

$ python coreml_compute_plan.py --model_path my_model.mlpackage \
      --compute_units cpu_and_ne --show_non_ane

=== my_model.mlpackage ===
  ANE:   412 / 480 ( 85.8%)
  CPU:    68 / 480 ( 14.2%)

  Non-ANE op types:
       32  ios17.cast
       18  ios17.gather
       12  ios17.reshape
        6  ios17.constexpr_blockwise_shift_scale

Inputs supported:

Input	Behavior
`.pte`	Extract every Core ML partition into a tempdir, then analyze each.
`.mlpackage`	Compile to `.mlmodelc` in a tempdir, then analyze.
`.mlmodelc`	Analyze directly.

The PTE path reuses the same JSON/named-data extraction logic that
extract_coreml_models.py uses, and is inlined into the script so it can
be run against a plain CoreML model without depending on the executorch
package.

Test plan

Added test_coreml_compute_plan.py covering:

_device_name(...) for None and a stub MLNeuralEngineComputeDevice.
_COMPUTE_UNIT_CHOICES mapping (cpu_and_ne / all).
analyze_one(...) end-to-end on a tiny relu(x @ x.T) + x.sum()
mlpackage built with coremltools.convert(...): returns rows for
every dispatched op, with a main function and the expected MIL op
types (matmul, relu, add, reduce_sum).

$ python -m pytest examples/apple/coreml/scripts/test_coreml_compute_plan.py -v
============================== 7 passed in 3.68s ===============================

I also ran the script against a few hand-built .mlpackage and
.mlmodelc files on macOS 26 with coremltools 9.0 and verified the
output matches what MLComputePlan returns directly.

Authored with Claude.

CoreML decides at compile/load time which device each MIL operation will execute on; that decision is exposed through MLComputePlan in coremltools 9.0+. This script wraps it so users can answer 'why isn't my model running on the ANE?' without writing Swift, which is the recurring question behind issues like pytorch#4091, pytorch#11541, and pytorch#8439. Inputs supported: * .pte — extracts every Core ML partition first. * .mlpackage — compiles to .mlmodelc in a tempdir. * .mlmodelc — analyzed directly. Reports per-op dispatch (ANE / GPU / CPU), an aggregate breakdown, and optionally the op types that did not get assigned to the ANE (--show_non_ane). Authored with Claude.

pytorch-bot · 2026-05-01T05:53:29Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19252

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-05-01T05:54:08Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

john-rocky requested a review from metascroy as a code owner May 1, 2026 05:53

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 1, 2026

john-rocky mentioned this pull request May 1, 2026

Add Gemma 4 text-decoder export to CoreML #19253

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add coreml_compute_plan.py: report which CoreML ops dispatch to ANE / GPU / CPU#19252

Add coreml_compute_plan.py: report which CoreML ops dispatch to ANE / GPU / CPU#19252
john-rocky wants to merge 1 commit intopytorch:mainfrom
john-rocky:coreml/compute-plan-analyzer

john-rocky commented May 1, 2026

Uh oh!

pytorch-bot Bot commented May 1, 2026

Uh oh!

github-actions Bot commented May 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

john-rocky commented May 1, 2026

Summary

Test plan

Uh oh!

pytorch-bot Bot commented May 1, 2026

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19252

Uh oh!

github-actions Bot commented May 1, 2026

This PR needs a release notes: label

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

This PR needs a `release notes:` label