Feat: Integrate Gemma3 weight mappings, vLLM adapter, and E2E pipelines by RexBearIU · Pull Request #4068 · AI-Hypercomputer/maxtext

RexBearIU · 2026-06-04T14:19:51Z

Description

Integrates Gemma3 with the MaxText-vLLM adapter and end-to-end post-training pipelines, adding full weight conversions, logical partition rules, and verification scripts for post-SFT serving.

Note: This PR depends on PR #4066 (Gemma3/4 decoders fixes) and PR #4067 (LoRA restore refactoring) being merged first.

Key Changes:

Gemma3 Weight Mapping (src/maxtext/integration/tunix/weight_mapping/gemma3.py): Added full weight mappings for attention queries, keys,
values, output projections, and gated MLPs.
vLLM Adapter Integration (src/maxtext/integration/vllm/maxtext_vllm_adapter/adapter.py): Integrated context initialization and logical
partition rules for Gemma3 serving.
E2E Integration Script (tests/end_to_end/tpu/gemma3/4b/test_gemma3_lora.sh): Added end-to-end shell test script validating full-pass
generation correctness, logit matches, and successful decoding post-SFT.

Tests

Validated full post-SFT weight loading and end-to-end decoding steps on TPU.
Executed integration script:

bash tests/end_to_end/tpu/gemma3/4b/test_gemma3_lora.sh

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

codecov · 2026-06-04T14:30:08Z

Codecov Report

❌ Patch coverage is 25.00000% with 42 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/maxtext/integration/tunix/utils.py	0.00%	28 Missing ⚠️
...maxtext/integration/tunix/weight_mapping/gemma3.py	60.00%	8 Missing ⚠️
...t/integration/vllm/maxtext_vllm_adapter/adapter.py	20.00%	4 Missing ⚠️
...xtext/integration/tunix/weight_mapping/__init__.py	33.33%	2 Missing ⚠️

📢 Thoughts on this report? Let us know!

NuojCheng · 2026-06-30T22:12:28Z

+  rollout_vllm_additional_config = {
+      "maxtext_config": {
+          "model_name": config.model_name,
+          "weight_dtype": "bfloat16",


is it a good idea hardcoded weight dtype being bf16? waht about config.weight_dtype

NuojCheng · 2026-06-30T22:15:10Z

+    vllm_hf_overrides='{architectures: ["MaxTextForCausalLM"]}' \
+    hbm_utilization_vllm=0.6 \
+    prompt="Suggest some famous landmarks in London." \
+    use_chat_template=True scan_layers=false


new line for sca_layers=false

NuojCheng · 2026-06-30T22:15:38Z

+
+# Step 4: Run inference on the checkpoint generated from the previous run
+python3 -m maxtext.inference.vllm_decode \
+    --use_tunix=True \


…ving integration

RexBearIU force-pushed the jackyf/gemma3-lora-e2e-integration branch from e16648e to faf1c26 Compare June 4, 2026 14:23

RexBearIU mentioned this pull request Jun 4, 2026

feat: Add Gemma3 LoRA SFT integration and programmatic weight mapping… #4004

Closed

4 tasks

RexBearIU marked this pull request as ready for review June 4, 2026 15:02

RexBearIU requested a review from darisoy as a code owner June 4, 2026 15:02

RexBearIU changed the title ~~feat(lora): Add Gemma3 weight mappings, vLLM adapter serving, and end…~~ Integrate Gemma3 weight mappings, vLLM adapter, and E2E pipelines Jun 8, 2026

RexBearIU changed the title ~~Integrate Gemma3 weight mappings, vLLM adapter, and E2E pipelines~~ Feat: Integrate Gemma3 weight mappings, vLLM adapter, and E2E pipelines Jun 8, 2026

RexBearIU force-pushed the jackyf/gemma3-lora-e2e-integration branch from faf1c26 to 50963ee Compare June 8, 2026 07:19

RexBearIU force-pushed the jackyf/gemma3-lora-e2e-integration branch 2 times, most recently from 27fc440 to eadf862 Compare June 26, 2026 08:14

NuojCheng reviewed Jun 30, 2026

View reviewed changes

NuojCheng approved these changes Jun 30, 2026

View reviewed changes

feat(gemma3): add Gemma-3 LoRA end-to-end (E2E) post-training and ser…

f93d241

…ving integration

RexBearIU force-pushed the jackyf/gemma3-lora-e2e-integration branch from eadf862 to f93d241 Compare July 1, 2026 08:43

RexBearIU requested a review from xibinliu as a code owner July 1, 2026 08:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat: Integrate Gemma3 weight mappings, vLLM adapter, and E2E pipelines#4068

Feat: Integrate Gemma3 weight mappings, vLLM adapter, and E2E pipelines#4068
RexBearIU wants to merge 1 commit into
mainfrom
jackyf/gemma3-lora-e2e-integration

RexBearIU commented Jun 4, 2026 •

edited

Loading

Uh oh!

codecov Bot commented Jun 4, 2026 •

edited

Loading

Uh oh!

NuojCheng Jun 30, 2026

Uh oh!

NuojCheng Jun 30, 2026

Uh oh!

NuojCheng Jun 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

RexBearIU commented Jun 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests

Checklist

Uh oh!

codecov Bot commented Jun 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

NuojCheng Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

NuojCheng Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

NuojCheng Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

RexBearIU commented Jun 4, 2026 •

edited

Loading

codecov Bot commented Jun 4, 2026 •

edited

Loading