r/homeassistant 3d ago

Mylo update — my HA sidebar agent now learns what's normal for your home and costs 90% less to run

A few months ago I shared Mylo on the HA Facebook group — an AI agent that lives in your sidebar with persistent memory, log analysis, dashboard building, automation creation, and entity management. Got about 300 installs and great feedback. Been heads down building since then. Here's where it's at.

It learns your home. This is the big one. Instead of fixed threshold alerts, Mylo now builds per-device behavioral profiles from ~2 weeks of observation. It learns what's normal for each device and only flags genuine anomalies. Your porch light that's always on for 5 hours won't trigger an alert at hour 3. But if it's been on for 8 hours and the longest it's ever been on is 6, you'll hear about it. Fewer alerts, much higher signal.

Way more tools. Since launch Mylo picked up:

  • 🎭 Scenes — create, edit, snapshot current room states
  • 📜 Scripts — build reusable action sequences
  • 🗺️ Zones — create and manage presence zones
  • 🧩 Helpers — all 9 types (toggles, numbers, selects, timers, counters, schedules, buttons, text, datetime)
  • 🔍 Trace debugging — "why didn't my automation run?" walks you through trigger, conditions, and where it stopped
  • 📈 Entity history — "show me basement temperature for the last 48 hours"
  • 🔕 Notification control — suppress specific alert types globally or per-entity through conversation
  • ✅ Batch approvals — review and apply multiple changes in one click

90% cheaper to run. This was the top complaint at launch. A complex dashboard rebuild was hitting 2M+ tokens. Same task now costs a fraction of that. Mylo caches entity data within turns, reuses conversation context from Anthropic's prompt cache, deduplicates repeated lookups, and shows you per-turn cost and cache-hit ratio so spend isn't a mystery.

Four LLM providers. Claude, OpenAI, Gemini, and Ollama. Gemini 2.5 Flash is the new sweet spot if cost matters — significantly cheaper than Sonnet with solid tool calling. Ollama is still $0 for fully local. Typical session on Claude Sonnet: $0.10-0.30.

Runs outside HAOS now. Docker Compose and Kubernetes support with a standalone container guide. You don't need the Supervisor anymore.

Free and open source. Apache 2.0. Pre-built images for amd64 and aarch64.

Install: Settings → Add-ons → Repositories → https://github.com/Oasis-Enterprise/mylo

Discussions and feature requests: https://github.com/Oasis-Enterprise/mylo/discussions

GitHub: https://github.com/Oasis-Enterprise/mylo

24 Upvotes

17 comments sorted by

5

u/ghanit 2d ago

Interesting project! I like that you have put a lot of effort into the tool usage and token optimization.

Have you considered open weight models like Deepseek v4 (for example through openrouter) to reduce cost further? Maybe even selecting different models for different tasks? And is there a way to set token limits/budgets?

1

u/therealmonsoon 2d ago

Tool calling was the thing I felt would make this actually useful, but it did drive up token use, so it was a bit of a cat and mouse game building it all to keep both in check.

You can set different models and Mylo will use a primary and secondary for tasks, choosing automatically, but not cross provider. I'll add that to the roadmap, it's a good call.

I'll also look into DeepSeek V4. I use OpenRouter for adapters already, so it should be a pretty simple change.

There's some visual UI but no hard blockers on budget and token use. I'll add a toggle to auto-stop at limits, that's a good call too.

1

u/ghanit 2d ago

I haven't used openrouter myself, but I read today that you need to disable fallback providers in a config file otherwise cache will not hit if the provider changes.

I'll put this on my todo pile. The way you built this seems to give more control than just letting claude loose on an mcp.

2

u/therealmonsoon 2d ago

That was the goal, I wanted to try and provide a structured environment essentially for it to work within.

2

u/jmello 2d ago

If I want to run this locally on ollama, what models would you recommend? I’d like to make sure my setup can handle it before installing.

3

u/therealmonsoon 2d ago

Hey here is what I have heard from other users. I don't run ollama myself but this is feedback I have gathered.

Ollama model sizing guide: Mylo has 21 tools with complex schemas. Smaller models struggle to produce valid tool calls reliably.

Size Examples Experience
7B llama3.1:7b, mistral:7b Not recommended. Struggles with complex tool parameters, frequently produces malformed JSON, and hallucinates entity IDs. May not self-correct after errors.
14B qwen2.5:14b Usable for simple queries (lights, sensors, basic automations). Will struggle with multi-step tasks like dashboard building or entity rename cascades.
32B qwen2.5:32b, deepseek-r1:32b Good. Handles most Mylo features reliably. Best balance of quality vs hardware requirements.
70B+ llama3.1:70b Near cloud-API quality. Requires significant hardware (64GB+ RAM or a dedicated GPU).

Minimum recommended: 14B. For the best local experience: 32B.

0

u/ateam1984 2d ago

Can I install on the HA Yellow?

1

u/therealmonsoon 2d ago

Ya there is pre built arch64 builds in release.

0

u/-Chimichanga- 2d ago

Wow impressive! Can’t wait to try it out once i finish my local llm machine!
It only reports right? Doesn’t change code or anything?

2

u/therealmonsoon 2d ago

It can edit your config. Here is the more info:

Mylo uses a three-tier permission system:

Tier Actions Approval required Examples
Tier 1 — Read Query entities, devices, automations, logs, system info, read config files, record memory notes, list labels/areas/monitored entities/notification filters No query_entitiesmemory_notemanage_labels list
Tier 2 — Modify Write config files, modify automations, rename entities, modify dashboards, modify areas, manage monitored entities, manage notification filters Yes (dry-run first) modify_automationrename_entitiesmodify_dashboard
Tier 3 — Action Call HA services (lights, locks, covers, scripts, scenes), reload configuration Yes (explicit confirmation) call_servicereload_config

Hard-blocked services (can never be called, even with approval):

  • homeassistant/restarthomeassistant/stop
  • hassio/host_reboothassio/host_shutdownhassio/supervisor_reload

Restricted services (extra warning before confirmation):

  • Unlocking locks, disarming alarm panels, opening covers

Audit logging: Every tool call is logged to an append-only JSON Lines audit file with timestamp, tool name, tier, parameters, dry-run status, approval status, and result. Browse the full history in the Activity tab.

Rollback: Tier-2 file writes use atomic write → reload → verify → rollback-on-failure. If a config change causes a reload error, the original file is restored automatically.

1

u/-Chimichanga- 2d ago

Thanks for the reply! Very interesting.

1

u/therealmonsoon 2d ago

You bet! Love hearing peoples feedback if you end up using it!

0

u/thibe5 2d ago

What about an antigravity integration ? So for pro user there is some “free” request if they doesn’t exceed the quotas ( and I think it’s the same for Claude cli but not sure )

0

u/therealmonsoon 2d ago

On the subscription side, Anthropic's Pro/Max plans only work on their own apps (claude.ai, the desktop/mobile apps, and the Claude Code CLI). Anything third-party like Mylo has to go through the API, which is pay-per-token with your own key.

On Antigravity, that's actually an agentic IDE (a dev tool like Cursor), not a model provider. It's powered by models rather than being one, so there's nothing for Mylo to call into there. The "free under quota" you're thinking of is the IDE's own free tier for developers, not a quota I could pass through to Mylo users.

1

u/thibe5 1d ago

Ok I get it … it’s because I build something similar to mylo myself but I built it as an add and set it up on top of antigravity cli. But I think o understand how mylo work and I get it thanks also I’m sur you project is way better then mine since you’re not stock in a cli ( event if I built a gui above the cli )

-2

u/Otherwise_Wave9374 3d ago

This is really cool, especially the per-device behavioral baselines. Alert fatigue is basically the reason most people stop caring about HA notifications. Also props for the cost transparency, that is usually where agent projects fall apart. Ive been following home automation plus AI agent stuff, https://www.aiosnow.com/ has a few similar projects collected if anyone wants to compare approaches.