🚀 Full project sync: Hotels RAG & Audit System

 Major Features:
- Complete RAG system for hotel website analysis
- Hybrid audit with BGE-M3 embeddings + Natasha NER
- Universal horizontal Excel reports with dashboards
- Multi-region processing (SPb, Orel, Chukotka, Kamchatka)

📊 Completed Regions:
- Орловская область: 100% (36/36)
- Чукотский АО: 100% (4/4)
- г. Санкт-Петербург: 93% (893/960)
- Камчатский край: 87% (89/102)

🔧 Infrastructure:
- PostgreSQL with pgvector extension
- BGE-M3 embeddings API
- Browserless for web scraping
- N8N workflows for automation
- S3/Nextcloud file storage

📝 Documentation:
- Complete DB schemas
- API documentation
- Setup guides
- Status reports
This commit is contained in:
Фёдор
2025-10-27 22:49:42 +03:00
parent 0cf3297290
commit 684fada337
94 changed files with 14891 additions and 911 deletions

26
backup_to_s3.sh Executable file
View File

@@ -0,0 +1,26 @@
#!/bin/bash
# Бэкап git репозитория на S3 (TWC Storage)
BACKUP_NAME="hotels_git_backup_$(date +%Y%m%d_%H%M%S).tar.gz"
BACKUP_DIR="/tmp"
echo "📦 Создаём архив..."
cd /root/engine/public_oversight/hotels
tar -czf "$BACKUP_DIR/$BACKUP_NAME" \
--exclude='venv' \
--exclude='embedding_env' \
--exclude='parser_env' \
--exclude='__pycache__' \
--exclude='*.log' \
--exclude='*.xlsx' \
.
echo "☁️ Загружаем на S3..."
# Раскомментируй и добавь свои S3 настройки:
# s3cmd put "$BACKUP_DIR/$BACKUP_NAME" s3://your-bucket/backups/
echo "✅ Архив создан: $BACKUP_DIR/$BACKUP_NAME"
echo "📊 Размер: $(du -h "$BACKUP_DIR/$BACKUP_NAME" | cut -f1)"
# Удалить локальный архив после загрузки (опционально)
# rm "$BACKUP_DIR/$BACKUP_NAME"