NVIDIA / TensorRT-LLM Public

Notifications You must be signed in to change notification settings
Fork 2.5k
Star 13.9k

Code
Issues 596
Pull requests 834
Discussions
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security and quality
Insights

Pull requests: NVIDIA/TensorRT-LLM

Labels 66 Milestones 1

New pull request New

834 Open 10,630 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[None][test] Remove 60 closed-bug waive entries for main

#15511 opened Jun 21, 2026 by tensorrt-cicd Collaborator • Draft

[None][test] Waive 4 failed cases for main in QA CI

#15510 opened Jun 21, 2026 by tensorrt-cicd Collaborator • Draft

[None][test] Waive 3 failed cases for main in QA CI

#15509 opened Jun 21, 2026 by tensorrt-cicd Collaborator • Draft

[None][test] Waive 7 failed cases for main in QA CI

#15507 opened Jun 20, 2026 by tensorrt-cicd Collaborator • Draft

[None][test] Waive 11 failed cases for main in QA CI

#15506 opened Jun 20, 2026 by tensorrt-cicd Collaborator • Draft

[None][test] Waive 4 failed cases for main in QA CI

#15505 opened Jun 20, 2026 by tensorrt-cicd Collaborator • Draft

[None][test] Waive 9 failed cases for main in QA CI

#15504 opened Jun 20, 2026 by tensorrt-cicd Collaborator • Draft

[None][bugfix] Fix executor test response timeout

#15502 opened Jun 19, 2026 by fallintoplace

Loading…

[None][bugfix] Fix Mamba preloaded HF model loading

#15501 opened Jun 19, 2026 by fallintoplace

Loading…

[None][fix] Make NIXL port-lock path configurable via TRTLLM_NIXL_PORT_LOCK_PATH

#15500 opened Jun 19, 2026 by CodersAcademy006

Loading…

4 tasks done

[None][test] Waive 1 failed cases for main in QA CI

#15499 opened Jun 19, 2026 by tensorrt-cicd Collaborator • Draft

[https://nvbugs/6329227][fix] Use pkgutil.extend_path to merge the two flash_attn distributions before…

#15498 opened Jun 19, 2026 by tensorrt-cicd Collaborator

Loading…

2 tasks done

[https://nvbugs/6337228][fix] In tests/unittest/tools/test_layer_wise_benchmarks.py, replace check_call with…

#15497 opened Jun 19, 2026 by tensorrt-cicd Collaborator

Loading…

2 tasks done

[https://nvbugs/6316980][fix] Added a runtime guard in FlashInferTrtllmGenAttention.is_supported using the…

#15496 opened Jun 19, 2026 by tensorrt-cicd Collaborator

Loading…

2 tasks done

[https://nvbugs/6341070][fix] Pass kv_cache_free_gpu_memory_fraction=0.5 to TRTLLMWorker.init_with_new_llm…

#15495 opened Jun 19, 2026 by tensorrt-cicd Collaborator

Loading…

2 tasks done

[https://nvbugs/6329165][fix] In TestNemotronNanoV3.test_accuracy, mocker.patch.dict GSM8K.EVALUATE_KWARGS…

#15494 opened Jun 19, 2026 by tensorrt-cicd Collaborator

Loading…

2 tasks done

[feat] gRPC: implement SubscribeKvEvents for KV-cache event streaming

#15493 opened Jun 19, 2026 by key4ng • Draft

[https://nvbugs/6341072][fix] Change the model_name fixture to "llama-models-v2/TinyLlama-1.1B-Chat-v1.0"…

#15492 opened Jun 19, 2026 by tensorrt-cicd Collaborator

Loading…

2 tasks done

[https://nvbugs/6337233][fix] In _build_fake_self, bind PyExecutor._is_stats_dummy_request onto the fake…

#15491 opened Jun 19, 2026 by tensorrt-cicd Collaborator

Loading…

2 tasks done

[https://nvbugs/6337238][fix] Test-only fix — use the existing get_hf_rope_theta helper, adapt the…

#15490 opened Jun 19, 2026 by tensorrt-cicd Collaborator

Loading…

2 tasks done

[https://nvbugs/6337231][fix] Replace all 9 self._is_stats_dummy_request(req) calls with…

#15489 opened Jun 19, 2026 by tensorrt-cicd Collaborator

Loading…

2 tasks done

[https://nvbugs/6335726][fix] In test_qwen_moe_routed_expert_multi_lora_varying_ranks, drop ranks…

#15488 opened Jun 19, 2026 by tensorrt-cicd Collaborator

Loading…

2 tasks done

[https://nvbugs/6337226][fix] Switch max(positive_hits) → min(positive_hits) in both the range and…

#15487 opened Jun 19, 2026 by tensorrt-cicd Collaborator

Loading…

2 tasks done

[https://nvbugs/6317600][fix] Add an early return at the head of _run_attention_warmup when…

#15486 opened Jun 19, 2026 by tensorrt-cicd Collaborator

Loading…

2 tasks done

[None][feat] support per-layer mixed-precision MoE serving (GLM, Qwen3-MoE)

#15485 opened Jun 19, 2026 by joshua-hill

Loading…

Previous 1 2 3 4 5 … 33 34 Next

Previous Next

ProTip! What’s not been updated in a month: updated:<2026-05-21.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!