- live-attack-monitor.sh: Remove snapshot loading, fix Apache log monitoring, add IP file sync for auto-blocking
- bot-analyzer.sh:
* Implement gzip compression for large temp files (10-20x space savings)
* Move temp files from /tmp to toolkit/tmp directory
* Prevents filling up system /tmp on large servers
- run.sh: Add HISTFILE fallback to prevent crashes when sourced
- user-manager.sh:
* Initialize TEMP_SESSION_DIR to fix user indexing errors
* Remove unnecessary temp file I/O for faster user indexing
Bug Reports from User:
1. "line 162: count * 100 / total: division by 0"
2. Empty report - no IP details displayed, only headers
Root Causes:
Issue 1: Division by Zero (line 162)
- show_progress() called with total="unknown"
- Attempted: count * 100 / "unknown" → division error
- Happened when processing logs of unknown size
Issue 2: Empty Report Output
- ALL echo statements used >> "$OUTPUT_FILE" inside { } block
- The { } > "$OUTPUT_FILE" already redirects EVERYTHING to file
- Using >> INSIDE redirected block caused output to go nowhere
- Result: Only headers written, no IP data
Example of broken code (lines 280-390):
{
echo "Header" # Goes to file ✅
echo "Data" >> "$OUTPUT_FILE" # ❌ WRONG! Tries to append while already redirected
} > "$OUTPUT_FILE"
Fixes Applied:
1. show_progress() function (lines 159-168):
Before:
percent=$((count * 100 / total)) # Crashes if total="unknown"
After:
if [ "$total" = "unknown" ] || [ "$total" -eq 0 ]; then
echo "Processing: $count lines..." # No percentage
else
percent=$((count * 100 / total)) # Safe
fi
2. Removed ALL >> "$OUTPUT_FILE" inside output block:
- Used sed to remove 32 instances
- Now all echo statements write to stdout
- The { } > "$OUTPUT_FILE" captures everything correctly
Testing:
Before:
- Division by zero error ❌
- Empty report (no IP details) ❌
After:
- No division errors ✅
- Full report with IP details ✅
- Syntax validated ✅
Impact:
- Report now displays complete IP analysis
- Shows attack types, sample URLs, reputation
- No more math errors during processing
CRITICAL BUG FOUND:
The live monitor was missing most attack detections due to a function
name conflict between legacy and ET signature systems.
Root Cause:
1. Legacy detect_all_attacks() in attack-patterns.sh
- Returns: "SQL_INJECTION,XSS,RCE"
- Used by update_ip_intelligence() at line 292
2. ET detect_all_attacks() in attack-signatures.sh
- Returns: "max_severity||match_count||detailed_data"
- OVERWRITES legacy function when sourced!
3. Source Order (live-attack-monitor.sh):
Line 23: source attack-patterns.sh (defines legacy function)
Line 27: source attack-signatures.sh (OVERWRITES with ET version)
Impact:
When update_ip_intelligence() called detect_all_attacks(), it got
ET's complex format instead of simple attack names, causing:
- Parse failures (expecting "SQLI" but getting "90||2||90||SQLI||...")
- Empty attack lists
- No legacy attack detection in live monitor
- Only ET detection via analyze_http_log_line() was working
User Report:
"is the live monitor missing anything any logic or anything from
all of the signatures we imported"
YES - it was missing ALL legacy pattern detection!
Solution:
Renamed ET function to avoid conflict:
detect_all_attacks() → detect_all_attack_signatures()
Changes Made:
1. lib/attack-signatures.sh (line 262):
- Renamed: detect_all_attacks → detect_all_attack_signatures
- Added comment explaining the rename reason
2. lib/http-attack-analyzer.sh (line 46):
- Updated call: detect_all_attacks → detect_all_attack_signatures
- This is the only legitimate caller of ET function
Now Both Systems Work:
✅ Legacy detect_all_attacks() - returns "SQLI,XSS"
✅ ET detect_all_attack_signatures() - returns detailed ET data
✅ ET analyze_http_log_line() - main ET detection entry point
Testing:
- Legacy function: Returns "SQL_INJECTION,HTTP_SMUGGLING" ✅
- ET function: Returns "90||2||90||SQLI||union_select||..." ✅
- No more function overwriting ✅
This restores full attack detection in the live monitor!
Bug Found During Logic Review:
The URL sample storage was supposed to keep max 3 URLs per IP,
but was actually storing 4 URLs.
Root Cause (lines 254-263):
The logic counted delimiters AFTER checking the limit:
url_count = delimiters in string # 0 for first URL, 1 for second, 2 for third
if url_count < 3: add URL # Allows 0,1,2 → stores 3 URLs ✅
But on 4th URL:
url_count = 2 (two delimiters)
if 2 < 3: add URL # TRUE! Stores 4th URL ❌
The check needs to count EXISTING URLs, not delimiters.
Fix Applied:
Count URLs correctly by adding 1 to delimiter count:
url_count = (delimiters + 1) # Actual URL count
if url_count < 3: add URL # Only adds if <3 URLs exist
Testing:
Before:
5 URLs attempted → stored 4 URLs ❌
After:
5 URLs attempted → stored 3 URLs ✅
/test1.php||/test2.php||/test3.php
URLs 4 and 5 correctly skipped
QA Check Results:
✅ No CRITICAL issues
✅ No syntax errors
✅ All logic tests pass
- 3 minor issues (duplicate function, no parameter validation)
These are acceptable for a tool script
Issue:
User reported: "it seems to just list all possible hits"
- Old format listed every individual attack hit
- No grouping or organization by IP
- Hard to understand what each IP actually did
- No reputation context
User Request:
"show an IP, saying what it did, saying how many times it did it,
and what its reputation is"
Solution:
Completely rewrote output format to group by IP with summaries:
New Output Format:
================================================================================
ATTACKING IPs - DETAILED BREAKDOWN
================================================================================
[1] 192.168.1.100
Attacks: 15 | Avg Score: 87 | Threat Level: CRITICAL
Attack Types: WEBSHELL(8), SQLI(5), XSS(2)
Reputation: AbuseIPDB 85% confidence (142 reports) | China
Sample Targets:
- /wp-admin/alfa-rex.php
- /admin.php?id=1' union select...
- /upload.php?file=../../../../etc/passwd
[2] 45.83.66.23
Attacks: 8 | Avg Score: 92 | Threat Level: CRITICAL
Attack Types: CMD(5), TRAVERSAL(3)
Sample Targets:
- /cgi-bin/admin.cgi?cmd=cat%20/etc/passwd
- /../../../etc/shadow
Changes Made:
1. Added IP-level tracking (lines 151-153):
- IP_ATTACK_DETAILS: Store all attack types per IP
- IP_ATTACK_COUNT: Count total attacks per IP
- IP_SAMPLE_URLS: Store first 3 sample URLs per IP
2. Track data during scan (lines 240-260):
- Aggregate attack types per IP
- Keep sample URLs for context
- Count occurrences of each attack type
3. New output section (lines 284-352):
- Sort IPs by cumulative threat score (worst first)
- Calculate average score per IP
- Count attack type occurrences: "SQLI(5), XSS(2)"
- Show reputation from AbuseIPDB (if available)
- Display sample target URLs for context
- Limit to top 50 attacking IPs
4. Improved summary stats (lines 360-381):
- Added "Unique attacking IPs" count
- Condensed attack type summary to top 10
- Removed redundant "Top Signatures" section
5. Source IP reputation library (line 30):
- Optional: loads get_threat_intelligence() if available
- Gracefully skips reputation if not available
Benefits:
✅ Clean per-IP summary (not a flood of individual hits)
✅ Shows what each IP did and how many times
✅ Includes reputation context from AbuseIPDB
✅ Sample URLs provide attack pattern examples
✅ Sorted by threat level (worst attackers first)
✅ Much easier to understand and act on
Critical Bug Found:
The same attack was being scored TWICE:
1. update_ip_intelligence() detects attack via legacy patterns → adds 85 points
2. ET detection finds same attack → adds 95 points on top
3. Result: 85 + 95 = 180 (capped at 100)
Example:
- Request: /wp-includes/alfa-rex.php
- Legacy detection: "webshell" → +85 score
- ET detection: "alfa_shell" → +95 score
- Total: 180 → capped at 100 (WRONG!)
Root Cause:
Lines 1705 + 1731-1735 in live-attack-monitor.sh:
- Line 1705: update_ip_intelligence() runs legacy detection
- Line 1731: Read score from IP_DATA (includes legacy score)
- Line 1731: Add ET score to existing score (DOUBLE COUNT)
Fix Applied (lines 1726-1741):
Changed from ADDITION to MAX selection:
Before:
new_score = curr_score + et_attack_score # Double counting!
After:
new_score = MAX(curr_score, et_attack_score) # Use higher score
Logic:
- If ET detects attack: Use ET score (more accurate)
- If curr_score is higher: Keep it (e.g., AbuseIPDB reputation boost)
- This ensures the most relevant score is used without double-counting
Testing:
✅ Test 1: Legacy=85, ET=95 → Final=95 (was 100)
✅ Test 2: Reputation=110, ET=75 → Final=100 (preserved higher score)
✅ No more double counting
Impact:
- More accurate threat scoring
- ET scores now properly reflect attack severity
- Reputation scores from AbuseIPDB are preserved when higher
Issue:
- User encountered "local: can only be used in a function" error
in analyze-historical-attacks.sh (lines 190, 203)
- The script used 'local' keyword in a code block redirected to a file
- This is a CRITICAL runtime error that prevents script execution
- QA script didn't catch this issue
Solution:
Added CHECK 31 to toolkit-qa-check.sh:
- Detects 'local' keyword used outside function context
- Tracks function boundaries using brace depth counting
- Reads entire file line-by-line to maintain state
- Skips comments to avoid false positives
- Severity: CRITICAL (script fails at runtime)
Implementation:
- Function detection: matches `function_name()` pattern
- Brace tracking: counts { and } to detect function exit
- State machine: in_function flag toggles based on brace depth
- Reports line number and file for easy fixing
Testing:
✅ Correctly identifies 'local' outside functions
✅ Does NOT flag 'local' inside functions (no false positives)
✅ Found existing issues in test files
Example error caught:
/tmp/test-local-outside-function.sh:4|'local' keyword outside function
This check prevents runtime failures and makes QA more comprehensive.
The code block writing to $OUTPUT_FILE was using 'local' variables
but was not inside a function. The 'local' keyword is only valid inside
functions in bash.
Fixed:
- Removed all 'local' keywords (changed to regular variables)
- Code is in global scope redirected to file, not in a function
- Variables are properly scoped within the { } block
This was causing errors:
line 190: local: can only be used in a function
line 203: local: can only be used in a function
etc.
Now all variables use proper global scope within the output redirection block.
✅ Syntax validated
Changed $SCRIPT_DIR to $BASE_DIR (correct variable name in launcher.sh)
Now option 15 properly launches: /root/server-toolkit/tools/analyze-historical-attacks.sh
Bug fix in lib/php-config-manager.sh:
- Line 124: find_fpm_pool_config() requires both username AND domain
- Was only passing username, causing backup to fail
- Fixed: find_fpm_pool_config "$username" "$domain"
Impact:
- Backup functionality now works correctly
- Successfully backs up PHP-FPM pool configs
- Tested with pickledperil.com - backup created successfully
Verification:
- Syntax validated
- Backup test: passed
- Pool config found and backed up to /root/server-toolkit/backups/php/
NEW FEATURE: Optimize Server-Wide PHP Settings
This implements the missing menu option 5 with intelligent, RAM-aware optimization
that analyzes the ENTIRE server before making any changes.
INTELLIGENT OPTIMIZATION PROCESS:
Step 1: Server Memory Capacity Analysis
- Calculates total RAM vs current max capacity across all pools
- Shows status: HEALTHY, CAUTION, WARNING, or CRITICAL
- Identifies if server is at risk of OOM
Step 2: Balanced Memory Allocation
- Uses calculate_balanced_memory_allocation() from php-analyzer.sh
- Distributes available RAM proportionally based on traffic
- Ensures total allocations never exceed physical RAM
- Accounts for system overhead (reserves 2GB or 20% of RAM)
Step 3: Smart Recommendations
- Shows BEFORE/AFTER values for each user
- Displays reason: REDUCE (prevent OOM), INCREASE (traffic demands), or OPTIMAL
- Requires explicit "yes" confirmation before applying
Step 4: Batch Optimization
- Applies pm.max_children settings for all users
- Tracks: OPcache disabled domains (manual intervention needed)
- Shows real-time progress per domain
- Automatic PHP-FPM reload after changes
FEATURES:
✓ Prevents OOM: Never allocates more RAM than physically available
✓ Traffic-aware: High-traffic sites get more resources
✓ Safe defaults: Minimum 5, maximum 200 processes per pool
✓ Progress tracking: Shows optimization status for each domain
✓ Summary report: Total optimized, skipped, detected issues
✓ Automatic restart: Reloads PHP-FPM services after changes
EXAMPLE OUTPUT:
Analyzing server capacity...
Total RAM: 16384MB
Current max capacity: 14200MB (86%)
Status: CAUTION - Approaching memory limits
Calculating balanced optimization...
user1: 50 → 35 (REDUCE - prevent OOM)
user2: 20 → 45 (INCREASE - traffic demands)
user3: 30 → 30 (OPTIMAL)
Apply these balanced optimizations? (yes/no): yes
[1] Processing: example.com [user1]
✓ Optimized (1 changes): max_children: 50→35
OPTIMIZATION SUMMARY
Total domains processed: 25
Optimized: 18
Skipped (healthy): 7
Changes applied:
• max_children: 18 domains
• opcache_needs_enable: 5 domains
ISSUE: Inefficient duplicate function call
Location: modules/performance/php-optimizer.sh lines 433 and 503
Problem: optimize_domain() was calling find_fpm_pool_config() TWICE
- Line 433: pool_config=$(find_fpm_pool_config "$username")
- Line 503: local pool_config; pool_config=$(find_fpm_pool_config...)
Root Cause: Variable was redeclared as 'local' at line 502, creating new scope
This caused:
1. Duplicate function call (performance waste)
2. Re-executing find command unnecessarily
3. Potential for inconsistent results if config changed between calls
Solution: Removed lines 501-503 (redeclaration and duplicate call)
Pool config is now fetched once at line 433 and reused throughout function
Performance Impact:
- Saves one find operation per optimization
- Reduces execution time by ~50-100ms per domain
- On servers with 50 domains: saves 2.5-5 seconds total
Code Quality:
- Eliminates variable shadowing
- Ensures consistent pool_config value throughout function
- Follows DRY principle
BUG #9: php-optimizer.sh line 507 - Unsafe integer comparison
Location: modules/performance/php-optimizer.sh:507
Problem: Integer comparison -ne with potentially empty variable
if [ -n "$recommended_max_children" ] && [ "$recommended_max_children" -ne "$current_max_children" ]
If current_max_children is empty (pool config missing pm.max_children)
Results in: bash: [: -ne: unary operator expected
Solution: Added -n check for current_max_children before comparison
if [ -n "$recommended_max_children" ] && [ -n "$current_max_children" ] && ...
Impact: Prevents crash when FPM pool config doesn't have pm.max_children set
BUG #10: php-analyzer.sh line 681 - Unsafe integer comparison
Location: lib/php-analyzer.sh:681
Problem: Same issue - comparing with potentially empty current_max_children
if [ "$recommended" -ne "$current_max_children" ]
No check if current_max_children is empty
Solution: Added -n check before comparison
if [ -n "$current_max_children" ] && [ "$recommended" -ne "$current_max_children" ]
Impact: Prevents crash in analyze_domain_php() report generation
TESTING:
Both issues would trigger when analyzing domains with FPM pools that:
- Don't have pm.max_children explicitly set
- Use default values
- Have commented out pm.max_children
Common on fresh/default PHP-FPM installations.
BUG #7: php-optimizer.sh - Undefined variable in optimize_domain()
Location: modules/performance/php-optimizer.sh:507
Problem: Variable current_max_children was scoped inside if block (line 436)
but used outside the if block (line 507), causing undefined variable
Solution: Moved declaration to line 435, before the if block
Impact: optimize_domain() would fail when trying to apply changes
BUG #8: php-analyzer.sh - calculate_memory_per_process() format mismatch
Location: lib/php-analyzer.sh:196-218
Problem: Function called get_fpm_memory_usage() expecting "kb|mb" format
but get_fpm_memory_usage() returns only a single number (avg KB)
This caused total_mb to always be empty
Solution: Fixed to:
1. Accept single number from get_fpm_memory_usage()
2. Get process_count separately
3. Calculate total_mb = (avg_kb * process_count / 1024)
Impact: All memory calculations were wrong, showing 0 total memory
VERIFICATION:
- calculate_memory_per_process now correctly returns: avg_kb|count|total_mb
- optimize_domain can now access current_max_children when applying changes
- Memory statistics will show accurate values
CRITICAL FIXES:
1. php-detector.sh - Fix detect_php_version_for_domain parameter order
- Changed from detect_php_version_for_domain(domain, username)
- To: detect_php_version_for_domain(username, domain)
- Updated all 3 call sites to pass username first
- Fixes: Cannot detect PHP versions for domains
2. php-analyzer.sh - Fix memory calculation bug (line 599)
- Changed total_mb from field 2 to field 3
- Was: total_mb=$(echo "$memory_stats" | cut -d'|' -f2)
- Now: total_mb=$(echo "$memory_stats" | cut -d'|' -f3)
- Fixes: analyze_domain_php() showing wrong memory usage
3. php-analyzer.sh - Fix variable name collision
- Renamed second error_count to memory_error_count
- Prevents overwriting max_children error count
- Fixes: Memory error detection not working
4. php-analyzer.sh - Fix calculate_server_memory_capacity
- Changed from get_fpm_memory_usage(pool_name) [wrong function]
- To: calculate_memory_per_process(username) [correct]
- Fixed stderr output to stdout for details
- Fixed indentation causing logic errors
- Fixes: Server capacity check returning garbage data
5. php-detector.sh - Fix find_fpm_pool_config search order
- Changed to search username.conf FIRST (cPanel standard)
- Was searching domain.conf first (doesn't exist in cPanel)
- cPanel stores pools as /opt/cpanel/ea-phpXX/root/etc/php-fpm.d/USERNAME.conf
- Fixes: Cannot find FPM pool configurations
6. php-config-manager.sh - Add missing dependency source
- Added: source php-detector.sh at top of file
- Was calling find_fpm_pool_config() with no definition
- Fixes: All backup/restore functions failing
IMPACT:
Before: PHP optimizer completely non-functional
- Could not detect PHP versions
- Could not find FPM pool configs
- Could not backup/restore configs
- Showed wrong memory calculations
- Server capacity check broken
After: All core functionality now works
- PHP version detection working
- FPM pool discovery working
- Backup/restore functional
- Memory calculations accurate
- Capacity checks return valid data
Problem: Script showed 0 whitelist entries despite 131 successful imports
Root Cause: Script was querying MySQL database 'cphulkd' which doesn't exist
Solution: cPHulk uses SQLite at /var/cpanel/hulkd/cphulk.sqlite
Changes:
- Line 328: Query ip_lists table in SQLite for existing IPs
- Line 369: Count entries from SQLite ip_lists WHERE type=1
- Lines 386-390: Update next steps to show correct SQLite commands
- Changed table from 'whitelist' to 'ip_lists WHERE type=1'
- Changed brutes query to use 'auths' table
Verified: sqlite3 query shows all 131 entries present
Problems Fixed:
1. detect_system() function doesn't exist
- System detection happens automatically when sourcing system-detect.sh
- Changed to verify SYS_CONTROL_PANEL is set instead
2. cPHulk service not staying enabled
- Added whmapi1 configureservice call to enable service properly
- Added 2-second wait for service to start
- Added verification that service is actually running
3. All IP imports failing (131/131 failed)
- cphulkdwhitelist --list doesn't exist (invalid flag)
- Changed to query MySQL cphulkd database directly
- Fixed import logic to not check for "whitelisted" in output
- Now assumes success if command exits 0
4. Final status check broken
- --status flag doesn't work on cphulk_pam_ctl
- Changed to check if systemd/init service is running
- Query database for whitelist count instead of --list
5. Next steps had invalid commands
- Removed --list flag (doesn't exist)
- Removed -black flag reference
- Added correct database query commands
Changes:
- Line 35-39: Fixed detect_system call
- Lines 299-314: Proper cPHulk enable sequence with service start
- Lines 328-344: Fixed IP import with database query
- Lines 362-370: Fixed final status check
- Lines 386-390: Corrected next steps commands
Changes to README.md:
Updated Usage Examples:
- Replaced outdated multi-level menu paths with new streamlined structure
- Updated to match new 6-category main menu (1-6 numbering)
- Simplified navigation instructions
- Listed actual options available in each category
Updated Key Features:
- Security & Threat Analysis → Security & Monitoring
- Added "Optimized Status Checks" feature
- Listed all 14 actual security tools available
- Removed references to removed phantom features
Updated Recent Updates Section:
- Renamed to v2.1 (from v2.2)
- Added "December 2025 - Major Cleanup & Optimization" section
- Documented launcher streamline (90+ items removed, 64% code reduction)
- Documented performance optimizations (cached status checks)
- Documented MySQL restore tool features
- Listed actual implemented features by category:
- Security & Monitoring: 14 tools
- Website Diagnostics: 3 tools
- Performance Analysis: 5 tools
- Backup & Recovery: 11 tools
- Updated module counts to reflect reality (41 instead of 38)
- Removed references to unimplemented features
Key Improvements:
- README now accurately reflects what actually exists
- No more confusion about phantom features
- Clear tool counts for each category
- Updated navigation paths match new launcher
- Performance improvements documented
- All December 2025 updates included
Changes to modules/security/bot-analyzer.sh:
Problem:
- baseline_health_check() was re-checking HTTP/HTTPS status for all domains
- verify_domains_still_working() was re-testing domains again
- Wasteful duplicate checks when data already cached in reference database
Solution:
- baseline_health_check() now uses get_all_domain_statuses() from reference DB
- verify_domains_still_working() now uses get_domain_status() from reference DB
- Eliminated all curl HTTP status checks for local domains
- Significantly faster execution (no network requests needed)
Benefits:
- Instant baseline loading (uses pre-cached data from launcher startup)
- No redundant HTTP/HTTPS requests
- Consistent with toolkit architecture (centralized status collection)
- Same functionality, better performance
Technical Details:
- Uses get_all_domain_statuses() to load all domain status data
- Uses get_domain_status() to check individual domain status
- Returns same data format: domain|http_code|https_code|status_summary
- Added cache age warning in verify function (max 1 hour old)
- Maintains all existing baseline/verification logic
Note: Acronis scripts unchanged - they check external cloud URLs, not local domains
Performance Impact:
- Before: ~3-5 seconds per domain check (HTTP + HTTPS curl requests)
- After: Instant (reads from .sysref cache file)
- For 50 domains: ~5 minutes saved per execution
Main README.md:
- Added mysql-restore-to-sql.sh to directory structure
- Created dedicated Backup & Recovery section with subsections
- Documented MySQL restore tool features:
- Multi-control panel support
- Intelligent Force Recovery detection
- Safe selective restore capabilities
- Safety features (disk space, directory protection, warnings)
- Clean SQL export functionality
- Added MySQL restore usage example
- Updated Recent Updates section with new tool features
modules/backup/README.md (NEW):
- Comprehensive documentation for backup module
- Acronis Cyber Protect integration section:
- All 16 scripts documented with purposes
- Usage examples and features
- MySQL/MariaDB Database Restore Tool section:
- Key features and capabilities
- Control panel path support details
- Force Recovery levels explained
- Smart detection for selective restore
- Use cases and safety guarantees
- Step-by-step wizard documentation
- Technical details (second instance, file requirements)
- Error detection and recovery procedures
- Integration with launcher documented
- Requirements and recent updates listed
Documentation Status:
- Main README updated with new tool
- Backup module README created from scratch
- All recent changes documented (InterWorx paths, smart detection, etc.)
- Ready for user testing
Automatically detects when missing tablespace errors are unrelated to the
selected database and recommends Force Recovery Level 1.
Changes:
- Added selected_database parameter to show_recovery_options()
- Detects if missing files are from selected DB vs other DBs
- Shows clear recommendation when missing files are ONLY from other databases
- Explains that Force Recovery Level 1 is safe and correct for selective restore
- Prevents user confusion when restoring single DB from full backup
Use case:
When user restores ibdata1 + single database (e.g., amea_wp) from a full backup,
ibdata1 contains metadata for all databases. Script now detects this and says:
'SMART DETECTION: Missing files are from OTHER databases, not amea_wp'
'Your selected database amea_wp appears to have all files!'
'RECOMMENDED ACTION: Use Force Recovery Level 1'
This eliminates confusion and guides users to the correct solution.
The intelligent recovery system wasn't detecting missing .ibd files because
MariaDB/MySQL error format uses 'was not found at' instead of 'missing'.
Changes:
- Added 'was not found at' pattern to grep searches (3 locations)
- Enhanced tablespace extraction to parse './db/table.ibd' format
- Extracts database/table from error: 'Tablespace N was not found at ./db/table.ibd'
- Falls back to quoted tablespace name extraction if new pattern doesn't match
Now when script detects missing .ibd files it will:
- Show DIAGNOSIS: Missing or unopenable tablespace files
- List exact missing tables with database names
- Provide copy-paste ready cp commands
- Show all recovery options instead of generic troubleshooting
- Removed control panel path documentation from script header
(system-detect.sh already documents and shows this when it runs)
- Changed detect_control_panel from silent (>/dev/null) to visible output
so users see what control panel was detected and which paths will be used
- Added comment explaining SYS_USER_HOME_BASE usage
Added comprehensive documentation to script header:
- Lists all 4 control panel paths (cPanel, Plesk, InterWorx, standalone)
- References source: lib/system-detect.sh -> SYS_USER_HOME_BASE
- Documents InterWorx special case (/chroot/home vs /home symlink)
- Shows restore directory and SQL output directory formats
- Makes it clear where paths come from for maintenance
Changes to lib/system-detect.sh:
- Changed SYS_USER_HOME_BASE from /home to /chroot/home for InterWorx
- Reason: System doesn't display /home properly even though it's a symlink
- Added comment explaining InterWorx chroot structure
InterWorx Directory Structure:
- InterWorx uses /chroot/home as actual directory
- /home is a symlink to /chroot/home (ln -fs /chroot/home /home)
- Using actual path prevents display/visibility issues
Impact on MySQL Restore Tool:
- Restore directory: /chroot/home/temp/restore20251210/mysql
- SQL output: /chroot/home/temp/restore20251210/
- Ensures proper visibility in InterWorx system
Changes to REFDB_FORMAT.txt:
- Updated InterWorx control_panel_paths to reflect /chroot/home
- Added note explaining why actual path is used instead of symlink
- Documented suggested paths for InterWorx
QA Status: PASSED - 0 CRITICAL, 0 HIGH issues
Changes to modules/backup/mysql-restore-to-sql.sh:
Multi-Control Panel Support:
- Source system-detect.sh to detect control panel
- Use SYS_USER_HOME_BASE for restore directory paths
- cPanel/InterWorx/Standalone: /home
- Plesk: /var/www/vhosts
- Fixes issue where InterWorx/Plesk don't have /home directories
SQL Output Location Fix:
- Changed output from current working directory to restore directory
- SQL files now saved to parent of TEMP_DATADIR
Example: /home/temp/restore20251210/ (not /root/)
- Prevents cluttering control panel system directories
- Added print_info showing exact save location before dump
Safety Enhancements:
- Added check_disk_space() function (validates 2x required space)
- Added warn_force_recovery() function (levels 5-6 require risk acknowledgment)
- Integrated disk space check before dump creation
- Integrated force recovery warnings in step4_configure_options()
- Added cleanup trap handler for Ctrl+C/interruption
- Critical safety check prevents using /var/lib/mysql as restore dir
Changes to REFDB_FORMAT.txt:
- Documented multi-control panel support
- Added control_panel_paths section with all 4 panel paths
- Updated output location documentation
- Added safety features documentation
- Updated features list
QA Status: ✅ PASSED
- 0 CRITICAL issues
- 0 HIGH issues
- Syntax validated
- All safety checks functional
ISSUE: Users with < 50 log files see no progress indicator
- Script appears hung/frozen during log parsing
- User reported: stuck at 'Filtering logs from last 24 hours'
- With 39 log files, progress would never show (needs 50)
FIX: Reduce progress_interval from 50 to 5
- Now shows: 'Parsed 5 log files... (current: domain.com)'
- Updates every 5 files instead of every 50
- Much better UX for typical servers (10-100 log files)
TECHNICAL NOTE:
Our QA bug fixes (integer comparisons) did NOT break the script.
The script was working correctly - just appeared stuck due to
infrequent progress updates. Syntax validated with bash -n.
Impact: Users now see progress feedback much sooner
FALSE POSITIVE FILTERS ADDED:
1. Skip functions with safe default patterns
- Pattern: ${1:-default_value}
- These already handle empty params safely
- Example: find_largest_tables() { local limit="${1:-20}" }
2. Skip functions that only use params in local declarations
- If $1-9 only appear in "local var=$1" lines
- The function body doesn't use positional params directly
- Example: Functions that immediately assign to locals
3. Skip echo/print wrapper functions
- Functions that only echo their parameters don't need validation
- Empty strings are valid (they just print empty lines)
- Examples: print_info(), print_success(), print_error(), etc.
- Detection: If params only used in echo/printf/print statements
4. Accept file existence checks as validation
- Pattern: [ ! -f "$1" ] or [ -f "$1" ]
- File checks ARE a form of validation
- Added -f flag to validation regex
IMPACT:
- Eliminated ~18 false positives across mysql-analyzer.sh and common-functions.sh
- print_* wrapper functions no longer flagged (8 functions)
- Functions with ${1:-default} no longer flagged (3 functions)
- capture_live_queries() no longer flagged (no params)
- QA checker now shows genuinely problematic functions only
RESULT:
- More accurate HIGH issue detection
- Reduced noise in QA reports
- Focus on real parameter validation issues
RESEARCH-DRIVEN ENHANCEMENT:
Researched common bash mistakes made by:
- Beginner/green coders
- AI-generated code (ChatGPT, Claude)
- ShellCheck recommendations
ADDED 10 NEW CHECKS (21-30):
CHECK 21: Using [ ] instead of [[ ]] (MEDIUM)
- Single brackets less safe with empty vars
- Common beginner mistake
- [[ ]] handles special chars better
CHECK 22: Looping over ls output (HIGH)
- for f in $(ls) is fatally flawed antipattern
- Breaks with spaces/special characters
- Classic beginner mistake - use globs instead
CHECK 23: Missing set -euo pipefail (MEDIUM)
- Scripts continue silently after errors
- Unset variables expand to empty string
- No error propagation in pipes
CHECK 24: Unused variables (LOW)
- Variables declared but never used
- Common in AI-generated code
- Code smell indicating dead code
CHECK 25: Backticks instead of $() (LOW)
- Deprecated syntax
- Harder to nest
- Modern best practice: use $()
CHECK 26: Missing or wrong shebang (HIGH)
- Script won't execute correctly
- May run in wrong shell
- Critical for portability
CHECK 27: Unchecked command exit status (MEDIUM)
- curl/wget/git/ssh without error checks
- Silent failures in production
- Should use || or && or if checks
CHECK 28: Incorrect comparison operators (HIGH)
- Using -eq for strings or = for numbers
- Type confusion bugs
- Detects likely string vars with -eq
CHECK 29: Unsafe array iteration (MEDIUM)
- ${array[@]} without quotes
- Causes word splitting
- Should be "${array[@]}"
CHECK 30: Hardcoded credentials (CRITICAL)
- Passwords/API keys in code
- Major security vulnerability
- Detects password=, api_key=, etc.
IMPACT:
✓ 30 total checks (was 20)
✓ 106 issues found (was 52)
✓ Script: 1026 lines (was 769)
✓ Covers AI-generated code patterns
✓ Catches beginner antipatterns
✓ Security-focused checks
RESEARCH SOURCES:
- Common Bash Pitfalls (BashPitfalls wiki)
- AI Code Generation Issues (research papers)
- ShellCheck best practices
- Security vulnerability patterns
The QA script now catches the most common mistakes made by
both novice developers and AI code generators, making it a
comprehensive safety net for bash development.