Bot stuck due to high load caused by mongo #488

ihor-chaban · 2024-12-09T15:16:08Z

As the title says, since I've been using the bot with the latest changes (master HEAD, gpt-4o model), I've noticed that it sometimes becomes unresponsive.
Then I went to a server to check what was going on and I couldn't even log into my server because of the crazy-high load caused by the bot (Load Average 25+ on 1 CPU server).
When I finally logged into the server I found that the load was caused by the mongo container.
I saw it does constant read/write operations at a rate of ~45-50 M/s.
But I couldn't get any relevant logs or other useful information because even simple commands like top or ps took a while to process under such load, not to mention any docker commands.
After killing mongod --bind_ip_all process the server went alive again.
I'm the only user of my bot and I use it quite rarely so it couldn't be any justifiable load.
I've tried deleting all files and rebuilding the bot image with clean MongoDB but it always happens again after some time.
I've never seen such a problem before v1.5 and gpt-4o model, so it must be related to the latest changes.
For now, I'm reverting to older releases, even though they don't have gpt-4o model.
Has anyone else had this problem? It's hard to debug because when it happens the server is unresponsive.

The text was updated successfully, but these errors were encountered:

ihor-chaban · 2024-12-14T03:32:13Z

This is the only recent change related to DB:
752f38b#diff-d7094880f6e1845d792ec1cb547780e39276fc2fb13321ce3b52e393fc1755a7L462-R469

Probably under some conditions it gets stuck in a loop which causes DB operation to go crazy.

Unfortunately, v1.5 release and all changes after feel like a step backwards because of the amount of major issues, missed bugs and the code not being reviewed properly.

ihor-chaban · 2024-12-15T18:47:24Z

https://github.com/father-bot/chatgpt_telegram_bot/blob/main/bot/bot.py#L462-L476

if current_model == "gpt-4-vision-preview" or current_model == "gpt-4o" or update.message.photo is not None and len(update.message.photo) > 0:
    ...
    if current_model != "gpt-4o":
        ...
    task = asyncio.create_task(_vision_message_handle_fn(...
else
    task = asyncio.create_task(message_handle_fn(...

These conditions make no sense together for a number of reasons:

It will always create _vision_message_handle_fn taks if current model is "gpt-4-vision-preview" or "gpt-4o" regardless if the message has any photo included or not. There is no way to create a regular message_handle_fn task with these models selected.
update.message.photo is not None and len(update.message.photo) > 0 why check both type and length? If optional parameter is not set it will be None.
Parent condition has current_model == "gpt-4o" and then child condition is current_model != "gpt-4o", it looks odd and contradicts itself.

I would change this code block to:

        if update.message.photo:
            if current_model != "gpt-4o" and current_model != "gpt-4-vision-preview":
                current_model = "gpt-4o"
                db.set_user_attribute(user_id, "current_model", "gpt-4o")
            task = asyncio.create_task(
                _vision_message_handle_fn(
                    update, context, use_new_dialog_timeout=use_new_dialog_timeout)
            )
        else:
            task = asyncio.create_task(
                message_handle_fn()
            )

I'm not sure if this will resolve the DB overload issue, but it looks much better than the original code.
I will test it for some time to see if I run into the same issue again.

ihor-chaban · 2024-12-15T18:48:46Z

Also, why not set "gpt-4o" model by default?
This could be changed in:

https://github.com/father-bot/chatgpt_telegram_bot/blob/main/bot/bot.py#L91 -> db.set_user_attribute(user.id, "current_model", config.models["available_text_models"][-1])
https://github.com/father-bot/chatgpt_telegram_bot/blob/main/bot/bot.py#L569 -> db.set_user_attribute(user_id, "current_model", "gpt-4o")
https://github.com/father-bot/chatgpt_telegram_bot/blob/main/bot/database.py#L48 -> "current_model": config.models["available_text_models"][-1],
https://github.com/father-bot/chatgpt_telegram_bot/blob/main/bot/openai_utils.py#L28 -> def __init__(self, model="gpt-4o"):

It's pretty stupid that there is no single place to easily change the default model.
These all could be a simple parameter in config/models.yml or config/config.yml

Is there any reason to keep obsolete models in the list?

ihor-chaban changed the title ~~Bot to stuck due to high load caused by mongo~~ Bot stuck due to high load caused by mongo Dec 9, 2024

ihor-chaban mentioned this issue Dec 23, 2024

Fix DB issue and change default model to gpt-4o #490

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bot stuck due to high load caused by mongo #488

Bot stuck due to high load caused by mongo #488

ihor-chaban commented Dec 9, 2024

ihor-chaban commented Dec 14, 2024 •

edited

Loading

ihor-chaban commented Dec 15, 2024

ihor-chaban commented Dec 15, 2024 •

edited

Loading

Bot stuck due to high load caused by mongo #488

Bot stuck due to high load caused by mongo #488

Comments

ihor-chaban commented Dec 9, 2024

ihor-chaban commented Dec 14, 2024 • edited Loading

ihor-chaban commented Dec 15, 2024

ihor-chaban commented Dec 15, 2024 • edited Loading

ihor-chaban commented Dec 14, 2024 •

edited

Loading

ihor-chaban commented Dec 15, 2024 •

edited

Loading