remove redundant empty_cache in parallel forward by akaitsuki-ii · Pull Request #161 · modelscope/DiffSynth-Engine

akaitsuki-ii · 2025-09-01T13:20:35Z

每次parallel forward过程中调用empty_cache会降低推理速度，只在unload_module的时候调用即可

gemini-code-assist

Summary of Changes

Hello @akaitsuki-ii, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

此拉取请求旨在通过精简并行处理中的内存管理操作来显著提升推理速度。核心改动是移除了每次前向传播后不必要的empty_cache()调用和torch.cuda.synchronize()，从而减少了计算资源的浪费和同步延迟。现在，内存清理操作仅在模块卸载时执行，确保了效率和正确性。

Highlights

优化推理速度: 移除了并行前向传播中冗余的empty_cache()调用，将其移至模块卸载时执行，以避免不必要的性能开销。
内存管理: 删除了每次迭代后的torch.cuda.synchronize()调用，进一步减少了同步开销。
模块加载逻辑: 调整了模块加载时的参数处理，不再对init_fn和kwargs调用to_device，简化了数据传递。

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in issue comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

本次变更通过移除并行前向传播过程中的冗余 empty_cache 调用来优化性能。主要改动包括：将 empty_cache 调用移至更合适的 unload_module 处理器中，并从每个工作循环的热路径中移除了 empty_cache 和 torch.cuda.synchronize，这符合性能优化的目标。此外，本次变更还修正了 load_module 在处理参数时一个潜在的 to_device 调用问题。总体来看，这些改动是合理且有益的。

remove redundant empty_cache in parallel forward

7425944

akaitsuki-ii requested a review from a team September 1, 2025 13:20

gemini-code-assist bot reviewed Sep 1, 2025

View reviewed changes

qzzz95 approved these changes Sep 1, 2025

View reviewed changes

qzzz95 merged commit 665f74a into main Sep 1, 2025

qzzz95 deleted the fix_parallel branch September 1, 2025 13:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remove redundant empty_cache in parallel forward#161

remove redundant empty_cache in parallel forward#161
qzzz95 merged 1 commit intomainfrom
fix_parallel

akaitsuki-ii commented Sep 1, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

akaitsuki-ii commented Sep 1, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants