CodeGeeX4/repodemo/llm/local/codegeex4.py

from pydantic import Field
from transformers import AutoModel, AutoTokenizer
from typing import Iterator
import torch


class CodegeexChatModel:
    device: str = Field(description="device to load the model")
    tokenizer = Field(description="model's tokenizer")
    model = Field(description="Codegeex model")
    temperature: float = Field(description="temperature to use for the model.")

    def __init__(self, model_name_or_path):
        super().__init__()
        self.device = "cuda" if torch.cuda.is_available() else "cpu"
        self.tokenizer = AutoTokenizer.from_pretrained(
            model_name_or_path, trust_remote_code=True
        )
        self.model = (
            AutoModel.from_pretrained(model_name_or_path, trust_remote_code=True)
            .to(self.device)
            .eval()
        )
        print("Model has been initialized.")

    def chat(self, prompt, temperature=0.2, top_p=0.95):
        try:
            response, _ = self.model.chat(
                self.tokenizer,
                query=prompt,
                max_length=120000,
                temperature=temperature,
                top_p=top_p,
            )
            return response
        except Exception as e:
            return f"error:{e}"

    def stream_chat(self, prompt, temperature=0.2, top_p=0.95):

        try:
            for response, _ in self.model.stream_chat(
                self.tokenizer,
                query=prompt,
                max_length=120000,
                temperature=temperature,
                top_p=top_p,
            ):
                yield response
        except Exception as e:
            yield f"error: {e}"
repodome: Update dependencies and add local model 2024-07-08 07:17:28 +00:00			`from pydantic import Field`
			`from transformers import AutoModel, AutoTokenizer`
			`from typing import Iterator`
			`import torch`

fix pep8 error 2024-07-09 03:37:30 +00:00
			`class CodegeexChatModel:`
repodome: Update dependencies and add local model 2024-07-08 07:17:28 +00:00			`device: str = Field(description="device to load the model")`
			`tokenizer = Field(description="model's tokenizer")`
			`model = Field(description="Codegeex model")`
			`temperature: float = Field(description="temperature to use for the model.")`

fix pep8 error 2024-07-09 03:37:30 +00:00			`def __init__(self, model_name_or_path):`
repodome: Update dependencies and add local model 2024-07-08 07:17:28 +00:00			`super().__init__()`
			`self.device = "cuda" if torch.cuda.is_available() else "cpu"`
fix pep8 error 2024-07-09 03:37:30 +00:00			`self.tokenizer = AutoTokenizer.from_pretrained(`
			`model_name_or_path, trust_remote_code=True`
			`)`
			`self.model = (`
			`AutoModel.from_pretrained(model_name_or_path, trust_remote_code=True)`
			`.to(self.device)`
			`.eval()`
			`)`
repodome: Update dependencies and add local model 2024-07-08 07:17:28 +00:00			`print("Model has been initialized.")`

fix pep8 error 2024-07-09 03:37:30 +00:00			`def chat(self, prompt, temperature=0.2, top_p=0.95):`
repodome: Update dependencies and add local model 2024-07-08 07:17:28 +00:00			`try:`
			`response, _ = self.model.chat(`
			`self.tokenizer,`
			`query=prompt,`
修复了代码中的错误，优化了流聊天功能，提高了最大长度限制并添加了错误处理。 2024-07-08 08:00:04 +00:00			`max_length=120000,`
repodome: Update dependencies and add local model 2024-07-08 07:17:28 +00:00			`temperature=temperature,`
fix pep8 error 2024-07-09 03:37:30 +00:00			`top_p=top_p,`
repodome: Update dependencies and add local model 2024-07-08 07:17:28 +00:00			`)`
			`return response`
			`except Exception as e:`
			`return f"error:{e}"`

fix pep8 error 2024-07-09 03:37:30 +00:00			`def stream_chat(self, prompt, temperature=0.2, top_p=0.95):`
repodome: Update dependencies and add local model 2024-07-08 07:17:28 +00:00
			`try:`
			`for response, _ in self.model.stream_chat(`
fix pep8 error 2024-07-09 03:37:30 +00:00			`self.tokenizer,`
			`query=prompt,`
			`max_length=120000,`
			`temperature=temperature,`
			`top_p=top_p,`
repodome: Update dependencies and add local model 2024-07-08 07:17:28 +00:00			`):`
修复了代码中的错误，优化了流聊天功能，提高了最大长度限制并添加了错误处理。 2024-07-08 08:00:04 +00:00			`yield response`
repodome: Update dependencies and add local model 2024-07-08 07:17:28 +00:00			`except Exception as e:`
fix pep8 error 2024-07-09 03:37:30 +00:00			`yield f"error: {e}"`