Files
kaizen/suite/auto-sync/README.md
irisz64 16a2cf3873 Squashed 'external/capstone/' changes from b102f1b8..5af28808
5af28808 Update Auto-Sync to Python 3.13 and tree-sitter-py 24.0 (#2705)
99f018ac Python binding: (#2742)
a07baf83 Auto-Sync update Sparc LLVM-18 (#2704)
81c5c93d Enable to generate legacy MC tests for the fuzzer. (#2733)
a25d4980 Add warning about naive search and replace to patch reg names. (#2728)
7ac87d17 Print immediate only memory operands for AArch64. (#2732)
c34034c8 Add x30 implicit read to the RET alias. (#2739)
95a4ca3e Update source list before installing valgrind. (#2730)
6909724e Make assertion hit warnings optional in release builds. (#2729)
fe6bdc6e Make SStream respect the CS_OPT_UNSIGNED flag. (#2723)
21ce3624 Use cs_ac_type for operand access mode in all arches and use cs_xtensa_op_type for Xtensa operand type (#2721)
df26583f clang-format: change license to BSD-3-Clause (#2724)
280b749e Remove unused files. (#2709)
87908ece Add flag for the SoftFail case of the LLVM disassembler. (#2707)
efc0ba44 Fix missing operand for smstart, due to space replaced by tab (#2720)
2ae64133 Fix missing sp register read in ret instruction (#2719)
8df252a6 Fix arm pop reg access (#2718)
14612272 ARM: fix typo, cspr -> cpsr (#2716)
f2f0a3c3 Fix LoongArch ld/st instructions register info (#2701)
829be2bf LoongArch: Compute absolute address for address operand (#2699)
42fbce6c Add jump group for generic jirl (#2698)
fc525c73 Apple AArch64 proprietary (#2692)
895f2f2e Build PDB for debugging on Windows (#2685)
5c3aef03 Version: Update to v6.0.0-alpha4 (#2682)
106f7d3b Update read/written registers for x87 comparison instructions (#2680)
ebe3ef2a Add workflow for building on Windows (#2675)
72f7d305 Revert "Add a script to compare the inc file content with the latest generate…" (#2678)
5b5c5ed8 Fix nanomips decoding of jalrc (#2672)
ae03cca4 Mips32r6_64r632 is for both mips32r6 and mips64r6 (#2673)
21178aea Add a script to compare the inc file content with the latest generated ones. (#2667)
81a6ba03 MIPS: Fix MIPS16 decoding, wrong flags and ghost registers (#2665)
98a393e3 Stringify BH fields when printing ppc details (#2663)
2607d0f3 Remove undefined constants in riscv_const.py (#2660) (#2661)
5058c634 Decode BH field in print_insn_detail_ppc (#2662)
6461ed08 Add Call group to svc, smc and hvc. (#2651)
e2f1dc8d Tms32c64x Little Endian (#2648)
5464c91d Fix build for compilers requiring explicit static for inline functions.. (#2645)
bb2f6579 Enhance shift value and types of shift instructions. (#2638)
cd282ef5 Update operand type enums of all arch modules to the one in `capstone.h` (#2633)
dc0c0909 cmake: Fix building capstone as sub-project (#2629)
cd8dd20c - Added missing files for sdist archive (#2624)
9affd99b Give the user some guidance where to add missing enumeration values. (#2639)
1bea3fab Add checks for MIPS details on cstest_py (#2640)
ace8056c Add aliases mapping for MIPS & test for id, alias_id (#2635)
1abe1868 Build Tarball before DEB/RPM package. (#2627)
0a012190 Switch to ubuntu-24.04-arm runner image (#2625)
4e0b8c48 Fix wrong version requirement of tricore instructions: (#2620)
8ac2843b chore(version): Update Version to 6.0.0-Alpha3 (#2616)
d7ef910b Rebased #2570 (#2614)
c831cd5e Fix SystemZ macro in Makefile (#2603)
30601176 Apply new EVM opcode updates (#2602)
3c4d7fc8 Add tricore tc1.8 instructions (#2595)
5f290cad Create debian and rpm package on releases (#2590)
0f09210a delete travis (#2600)
5c5f756f Downgrade labeler to v4 due to https://github.com/actions/labeler/issues/710. (#2598)

git-subtree-dir: external/capstone
git-subtree-split: 5af288083e9f03e32723f9708c305692f866b666
2025-06-26 22:15:44 +02:00

5.5 KiB

Architecture updater - Auto-Sync

Auto-Sync is the architecture update tool for Capstone. Because the architecture modules of Capstone use mostly code from LLVM, we need to update this part with every LLVM release. Auto-Sync helps with this synchronization between LLVM and Capstone's modules by automating most of it.

Please refer to intro.md for an introduction about this tool.

Install

Setup Python environment and Tree-sitter

cd <root-dir-Capstone>
# Python version must be at least 3.11
sudo apt install python3-venv
# Setup virtual environment in Capstone root dir
python3 -m venv ./.venv
source ./.venv/bin/activate

Install Auto-Sync framework

cd suite/auto-sync/
pip install -e .

Clone Capstones LLVM fork and build llvm-tblgen

git clone https://github.com/capstone-engine/llvm-capstone vendor/llvm_root/
cd vendor/llvm_root/llvm-capstone
git checkout auto-sync
mkdir build
cd build
# You can also build the "Release" version
cmake -G Ninja -DCMAKE_BUILD_TYPE=Debug ../llvm
cmake --build . --target llvm-tblgen --config Debug
cd <capstone-root>/suite/auto-sync/

Install llvm-mc and FileCheck

Additionally, we need llvm-mc and FileCheck to generate our regression tests. You can build it, but it will take a lot of space on your hard drive. You can also get the binaries here or install it with your package manager (usually something like llvm-18-dev). Just ensure it is in your PATH as llvm-mc and FileCheck (not as llvm-mc-18 or similar though!).

Architecture

Please read ARCHITECTURE.md to understand how Auto-Sync works.

This step is essential! Please don't skip it.

Update an architecture

Updating an architecture module to the newest LLVM release, is only possible if it uses Auto-Sync. Not all arch-modules support Auto-Sync yet.

Check if your architecture is supported.

ASUpdater -h

Run the updater

ASUpdater -a <ARCH>

Update procedure

  1. Run the ASUpdater script.
  2. Compare the functions in <ARCH>DisassemblerExtension.* to LLVM (search the function names in the LLVM root) and update them if necessary (some architectures don't have this file).
  3. Try to build Capstone and fix the build errors.

Post-processing steps

This update translates some LLVM C++ files to C. Because the translation is not perfect (maybe it will some day) you will get build errors if you try to compile Capstone.

The last step to finish the update is to fix those build errors by hand.

Refactor an architecture

Not all architecture modules support Auto-Sync yet. Here is an overview of the steps to add support for it.


To refactor one of them to use Auto-Sync please follow the RefactorGuide.md

Adding a new architecture

Adding a new architecture follows the same steps as above. With the exception that you need to implement all the Capstone files from scratch.

Check out an Auto-Sync supporting architectures for guidance and open an issue if you need help.

Additional details

Overview updated files

This is a rough overview what files of an architecture are updated and where they are coming from.

Files originating from LLVM (Automatically updated)

These files are LLVM source files which were translated from C++ to C Not all the listed files below are used by each architecture. But those are the most common.

  • <ARCH>Disassembler.*: Bytes to MCInst decoder.
  • <ARCH>InstPrinter.* or <ARCH>AsmPrinter.*: MCInst to asm string decoder.
  • <ARCH>BaseInfo.*: Commonly use functions and definitions.

*.inc files are exclusively generated by LLVM TableGen backends:

*.inc files for the LLVM component are named like this:

  • <ARCH>Gen*.inc (note: no CS in the name)

Additionally, we generate more details for Capstone with llvm-tblgen. Like enums, operand details and other things.

They are saved also to *.inc files, but have the CS in the name to make them distinct from the LLVM generated files.

  • <ARCH>GenCS*.inc

Capstone module files (Not automatically updated)

Those files are written by us:

  • <ARCH>DisassemblerExtension.* All kind of functions which are needed by the LLVM component, but could not be generated or translated.
  • <ARCH>Mapping.*: Binding code between the architecture module and the LLVM files. This is also where the detail is set.
  • <ARCH>Module.*: Interface to the Capstone core.

Relevant documentation and troubleshooting

LLVM file translation

For details about the C++ to C translation of the LLVM files refer to CppTranslator/README.md.

Generated .inc files

Documentation about the .inc file generation is in the llvm-capstone repository.

Troubleshooting

  • If some features aren't generated and are missing in the .inc files, make sure they are defined as AssemblerPredicate in the .td files.

    Correct:

    def In32BitMode  : Predicate<"!Subtarget->isPPC64()">,
      AssemblerPredicate<(all_of (not Feature64Bit)), "64bit">;
    

    Incorrect:

    def In32BitMode  : Predicate<"!Subtarget->isPPC64()">;
    

Formatting

  • If you make changes to the CppTranslator please format the files with black and usort
    pip3 install black usort
    python3 -m usort format src/autosync
    python3 -m black src/autosync