Campfire HQ

Is Campfire HQ Down Right Now? Check if there is a current outage ongoing.

Campfire HQ is currently Operational

Last checked from Campfire HQ's official status page

Historical record of incidents for Campfire HQ

Report: "Ranking System"

Last update
resolved

We have now resolved this issue. Ranking is now operational, though many cookies may have been expired due to a change pushed by Roblox.

identified

Campfire/Hyra ranking is currently not performing as expected due to a change with how Roblox manages cookies. We are working on a solution.

Report: "Issues with bots"

Last update
resolved

This incident has been resolved.

monitoring

A fix has been implemented and we are monitoring the results.

identified

We have identified the issue and are rolling out a fix.

identified

The issue has been identified and a fix is being implemented.

monitoring

A fix has been implemented and we are monitoring the results.

investigating

We're currently investigating an issue with our bot network

Report: "Delayed or no response on website"

Last update
resolved

This incident has been resolved.

identified

This issue is caused by an upstream internet provider issue. We will provide more information as we receive it.

Report: "Degraded ranking performance"

Last update
resolved

Roblox has recovered and Campfire Ranking requests are now operational.

identified

A Roblox issue causing decreased performance on the platform is impacting performance of Campfire Ranking.

investigating

We are currently investigating this issue.

Report: "Degraded ranking performance"

Last update
resolved

Roblox has recovered and Campfire Ranking requests are now operational.

identified

We're experiencing an elevated level of API errors due to a Roblox outage. The majority of requests are still going through, however a small percentage are failing.

identified

Ranking performance is currently degraded. We are looking into this issue and will provide an update shortly.

Report: "Security vulnerability"

Last update
resolved

This incident has been resolved.

monitoring

A fix has been implemented and we are monitoring the results.

identified

A fix has been implemented and these changes are now deploying across the CFHQ network.

identified

We are working hard to roll out a fix as soon as possible for this security bug.

Report: "Member Counters"

Last update
resolved

This incident has been resolved.

monitoring

A fix has been implemented and we are monitoring the results.

identified

The issue has been identified and a fix is being implemented.

investigating

An issue has automatically been identified with member counters. A team is now investigating the issue.

Report: "All systems unavailable"

Last update
resolved

We're marking this as resolved, further information can be found at https://www.cloudflarestatus.com/

investigating

There is currently an outage across Campfire products and services due to an upstream internet issue.

Report: "Ranking services down"

Last update
resolved

This incident has been resolved.

investigating

We are currently investigating this issue.

Report: "Member Counters not operating as expected."

Last update
resolved

This incident has been resolved.

identified

The issue has been identified and a fix is being implemented.

investigating

We are currently investigating this issue.

monitoring

A fix has been implemented and we are monitoring the results.

investigating

We are currently investigating an issue regarding member counters not sending correct or at a reduced interval. As this product continues to grow in size, we are finding it increasingly difficult to manage the 1,728,000+ requests per day we send to Roblox. We are working closely with our DevOps Team to resolve the situation as soon as possible and we will update you as required. We apologise about the inconvenience.

Report: "Roblox.com down"

Last update
resolved

This incident has been resolved.

monitoring

The site is now resumed. We are monitoring and will update if the situation changes.

identified

Issues are still occuring. We will update you as required.

monitoring

The site now back up. Our team are continuing to monitor and will keep you updated should issueso occur again.

identified

The site is now coming back online slowly.

identified

Roblox.com is currently down.

Report: "Member Counters"

Last update
resolved

This incident has been resolved.

monitoring

We are working hard on a fix.

monitoring

A fix has been implemented and we are monitoring the results.

identified

We’re going to test a fix in production now. The only caveat of this is that there will be an increased delay between the counting - i.e. a longer amount of time will pass before we check for new members.

identified

There is currently an issue where member counters may jump back and forth between two numbers. This is caused by a problem in the Roblox group count system, and we are looking into fixes for this problem. This service continues to operate as normal.

Report: "Member counters not operating"

Last update
resolved

This incident has been resolved.

monitoring

A fix has been implemented and we are monitoring the results.

identified

A fix will be rolling out very shortly.

identified

Issue has been identified and a fix is being implemented.

investigating

Our DevOps Team are continuing to investigate the issue.

investigating

DevOps Team are now working on a solution to this problem.

investigating

All hooks and the ability to create and manage hooks has been disabled until DevOps Team becomes available. Apologies about the inconvenience.

investigating

All DevOps team are currently unavailable. No expectation of resolution until DevOps Team become available again in 5-10 hours.

investigating

We are currently investigating this issue.

Report: "Issues with caching CDN"

Last update
resolved

This incident has been resolved.

monitoring

This issue may still be present for some users. We advise clearing your cache or using Ctrl + Shift + R / ⌘ + Shift + R

monitoring

We expect this to be resolved for all users by 00:00 UTC.

monitoring

A fix has been implemented and we are monitoring the results.

Report: "Billing System Not Activating Bots"

Last update
resolved

This incident has been resolved.

monitoring

A fix has been implemented and we are monitoring the results.

identified

The issue was identified as a type error. We’re now rolling out a patch.

investigating

We are currently investigating this issue.

Report: "Ranking issues"

Last update
resolved

This incident has been resolved.

monitoring

A fix has been deployed. We are now monitoring the fix.

identified

Our team are working on implementing rate limiting technology. We won’t ever hard rate limit, we’ll simply slow down your requests in transit. This is to prevent us from sending too many requests to Roblox’s servers. We will publish documentation to our Developer Docs when this has been implemented. Service will resume once this has been implemented. Thank you for your patience and cooperation.

identified

We've identified the root cause as a denial of service attack on our ranking service. We are taking security precautions to resolve the issue.

investigating

We have no ETA of when this can be resolved. The issues are beyond the scope of Campfire, and are with our providers.

investigating

We are continuing to investigate this issue.

investigating

We've temporarily paused this service as we are facing issues on our backend database. We work with our team to resolve the issue.

investigating

We are continuing to investigate this issue.

investigating

We're currently investigating an issue with Campfire ranking and 429 errors. We're working closely with our DevOps Team to resolve the issues as quickly as possible.

Report: "API Timeouts"

Last update
resolved

Resolved on our end.

monitoring

We've directed all traffic to our AMS3 server.

identified

There is currently issues with our Digital Ocean NYC1 server.

Report: "Error accessing portal"

Last update
resolved

This incident has been resolved.

monitoring

A fix has been implemented and is now rolling out over our CDN.

identified

This issue has been identified as an SSL error. A fix is being implemented.

Report: "Outage"

Last update
resolved

This incident has been resolved.

monitoring

A fix has been implemented and we are monitoring the results.

identified

We are now going to be putting our public bot and all other client bots onto a different host, as we're becoming more and more reliant on another hosting provider instead of AWS.

identified

The root cause is our AWS EC2 Instance. It has stopped unexpectedly.

investigating

We are continuing to investigate this issue.

investigating

We are continuing to investigate this issue.

investigating

We are currently investigating this issue.

Report: "Bot Offline"

Last update
resolved

This incident has been resolved.

identified

We’re still working on a fix, as this appears to be caused by a bug with prefixes. We’re going to be changing the prefixes feature to improve stability and reliability, as it has caused quite a large majority of uptime over the last month.

identified

Bot is offline and we are working on a fix.

Report: "Bot not responding"

Last update
resolved

This incident has been resolved.

identified

We are continuing to work on a fix for this issue.

identified

This issue has been identified as high CPU utilisation.

investigating

We are currently investigating this issue.

Report: "šŸš’ Community Forum"

Last update
resolved

This incident has been resolved.

monitoring

We are continuing to monitor for any further issues.

monitoring

A fix has been implemented and we are monitoring the results.

identified

The issue has been identified and a fix is being implemented.

investigating

We’re currently investigating issues with all users accessing our community forum.

Report: "Typeform 500 Error"

Last update
resolved

This looks to have been resolved now!

investigating

There's currently an issue with Typeform meaning that no users can access our Typeforms. We believe this is across the whole of Typeform. We will update you with more information soon.

Report: "Downtime"

Last update
resolved

This incident has been resolved.

monitoring

We are continuing to monitor for any further issues.

monitoring

A fix has been implemented and we are now monitoring our services to ensure that everything is working. For some customers it may take additional time due to a change in DNS.

identified

We’ve identified this issue as a change in our IP address. We are now working to resolve this.

investigating

We’re continuing to have issues with Campfire. Campfire apologises and is working to resolve the issue.

investigating

We are currently investigating this issue.

Report: "Bot not responding to commands"

Last update
resolved

This incident has been resolved.

monitoring

A fix has been implemented and we are monitoring the results.

identified

We believe this was a localised issue. We’ve identified the problem and are working to a resolution.

investigating

We’re currently investigating an issue where the bot is not responding to commands.

Report: "Elevated API response time"

Last update
resolved

This incident has been resolved.

monitoring

This issue looks to have fixed itself. We’re monitoring the bot to assure quality and performance.

investigating

We’ve checked with our whitelabel bots and they’re running a-ok, therefore we’re ruling out a connection problem.

investigating

We are currently investigating an issue affecting the latency of the bot. We’ll update you when we have more information. Live updates available via Twitter and https://cmpf.ml/status

Report: "Issue with Cronjobs"

Last update
resolved

This incident has been resolved.

monitoring

We’ve now restored all clients bots. We’ll monitor this over the next 24 hours and see what we get back.

identified

This issue has been identified as an issue with one of our dependencies. We’ve restarted the server and are in the process of restoring all clients bots

investigating

We are continuing to investigate this issue.

investigating

We’re restarting our main server to see if it resolves the issue.

investigating

We are currently investigating this issue.

Report: "Elevated Response Time"

Last update
resolved

This incident has been resolved.

identified

We’re currently experiencing elevated response time across all Campfire services whilst Discord experience issues. Live updates available on https://status.discordapp.com

Report: "Major Outage"

Last update
resolved

This incident has been resolved.

monitoring

A fix has been implemented and we are monitoring the results.

identified

We believe to have identified and issue and a fix is now being implemented.

investigating

This issue appears to have been present since 02:55AM (approx) UTC. We're extremely sorry and are still looking into solutions.

investigating

We're currently investigating the issue.

Report: "Downtime to main server"

Last update
resolved

This incident has been resolved.

identified

This issue has been identified as a lack of disk drive space on the hard-drive. We're now working to resolve the problem.

investigating

We are currently investigating this issue.

Report: "Issue with API."

Last update
resolved

We believe this to now have been resolved.

identified

This issue has now been identified and we are working with our partners to resolve it.

Report: "Campfire Downtime"

Last update
resolved

Recovered and expected to be a Discord API issue.

investigating

We are currently investigating this issue.

Report: "Discord - Server Outages and Increased API Errors"

Last update
resolved

This incident has been resolved.

identified

Campfire is experiencing issues due to a Discord outage. Please see further information here: https://status.discordapp.com/

Report: "Elevated response time"

Last update
resolved

This incident has been resolved.

monitoring

A fix has been implemented and we are monitoring the results.

identified

The issue has been identified and a fix is being implemented.

investigating

We are currently investigating this issue.

Report: "Bot Downtime"

Last update
postmortem

# Postmortem: Downtime šŸ”Ø Hi all, Campfire’s Development Operations Manager, Sam here with a postmortem for the issues experienced here. ## What went wrong? We’ve been looking at our history of downtime like this and we’ve come to the conclusion that we’re simply putting a little too much load on our London server. This likely is causing the server to ā€œburstā€ \(run out of CPU credits\) and therefore is powering down our instance. We’d have no issue believing this, but the issue is it’s not actually powering down. It’s becoming entirely unresponsive. We’ve taken screenshots via the AWS EC2 manager and the instance looks perfectly normal. ## What do we 100% know? We know that the server became overloaded \(at about 99%\) CPU utilisation and then went completely silent. This therefore leads us to believe it’s a performance issue. ## What are we doing about this? We’re doing a few things. 1. We’re moving our EC2 instance to America, so it has a quicker response time with the Discord gateway \(this therefore means less CPU time and better performance\) 2. We’re investing more money into compute power to increase our cloud footprint by at least 200%. 3. We’re trying out a different process manager. We previously were using forever. We’re now trialing PM2 based on a recommendation from a third party who develops bots and services in our space. We don’t anticipate to have any more issues like this, but naturally we may have some again in the future. We are trying our best and training staff on procedures for our downtime so we can handle it in the best way possible. Many thanks for your continued support.

resolved

All looks healthy! We're closing this incident now. So sorry about the issues this caused. A post mortem may be published soon.

monitoring

We are continuing to monitor for any further issues.

monitoring

All bots have been restored and we are now monitoring the issue to ensure that things run smoothly.

identified

We're now beginning to restore all bots running on this instance.

identified

The issue has been identified as a large amount of stress on our CPU causing it to terminate our instance. We're now looking into what caused this large CPU spike. CPU Graph: https://cdn.discordapp.com/attachments/552944213768011816/693491730477088858/unknown.png

investigating

This issue appears to have been present since UTC 13:55. We're now launching an investigation with our Engineering Ops Team. We'll keep you updated as this case develops. Our main priority is to restore service.

investigating

We are currently investigating this issue.

Report: "Bot not responding"

Last update
resolved

This incident has been resolved.

investigating

We're continuing to investigate this issue and will update you as required.

investigating

We're currently investigating an issue that's making the bot not respond to commands.