We are seeking a skilled and detail-oriented System Maintenance Engineer to oversee the day-to-day operational stability of our car online booking system. Your focus will be on maintaining uptime, monitoring logs, identifying and resolving technical issues, and ensuring the platform runs smoothly for customers booking vehicles through our website or mobile app.
You will be responsible for proactively detecting and resolving problems, applying system updates, and coordinating closely with technical teams to prevent downtime.
System Monitoring & Uptime Management
- Monitor the real-time health and availability of the car booking platform, including frontend, backend, and APIs.
- Select, Deploy, Set up and manage alerts using monitoring tools (e.g., Grafana, Datadog, Prometheus, or similar).
- Investigate slow response times, failed transactions, and booking errors.
Incident Response & Bug Fixing
- Analyze application logs and error reports to identify root causes of user-reported or system-detected issues.
- Fix bugs across the system stack (UI behavior, backend processing, database errors).
- Respond promptly to system outages or anomalies to restore functionality quickly.
Software Updates & Maintenance
- Manage version upgrades for system libraries, services, and dependencies.
- Apply security patches and framework updates as needed.
- Maintain a version control process, including changelogs and rollback planning.
Log Analysis & Reporting
- Regularly review server and application logs to detect abnormalities or usage spikes.
- Generate performance and stability reports for internal use.
- Recommend improvements to monitoring and diagnostic practices.
Platform Integrity & Security
- Perform health checks and system audits to ensure secure and uninterrupted access to booking features.
- Help ensure data integrity for customer bookings, vehicle availability, and payment transactions.