Potential Bitblas compat with cuda 12.5

While doing ci testing for GPTQModel we found that with cuda 12.5, bitblas is generating broken compiled codes via apache/tvm. There are no errors. The end result is no runtime error but cache model gpu code (in .cached folder) generates non-sense using `backend=BACKEND.BITBLAS`: failed our PPL sanity test.  We copied over .cache/generated files from < cuda 12.5 and it works which isolates the issue to cuda 12.5/apache tvm. 


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Potential Bitblas compat with cuda 12.5 #120

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Potential Bitblas compat with cuda 12.5 #120

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions