Outages

Upcoming AiM System Upgrade: What You Need to Know

The Office of Information Technology (OIT) will be performing an essential upgrade to the AiM system this October. Please take note of the following details to prepare for the scheduled downtime.

Upgrade Schedule:

  • Start: Friday, October 18 at 5:00 PM
  • End: Sunday, October 20 at 9:00 PM

During this window, the AiM system, including AiM Mobile and all reports or interfaces that rely on AiM data, will be unavailable. It’s essential for each department to review how this may impact your specific applications and workflows.

We recommend that department heads and team leads notify any employees who may be affected by this downtime to ensure minimal disruption to daily operations.

Questions or Concerns?

If you have any questions or need further assistance, please contact Jim Smith, Director of IT Operations Support and Process Development, at 205-348-5374 or via email at jhsmith7@ua.edu.

Thank you for your understanding and cooperation as we work to improve the AiM system for everyone at UA.

How OIT Resolved a Critical Incident with Crowdstrike and Enhanced Security Protocols

On July 19, 2024, at 4:09 UTC, The University of Alabama’s Office of Information Technology (OIT) faced a significant challenge when a defective Crowdstrike update caused a widespread disruption across campus systems. Crowdstrike, an industry-leading platform used by UA to protect servers from cyber threats, had released a Rapid Response Content update for Windows systems, which inadvertently caused many machines to crash and enter an endless reboot loop.

OIT personnel were alerted to server outages around midnight and swiftly mobilized a cross-functional team to investigate. Working through the early hours, OIT employees from the systems and security teams collaborated with Crowdstrike support and conducted independent research to find a solution. By 4:00 a.m., most systems had been restored, and the team continued to work with various campus units to resolve the remaining issues by 10:00 a.m.

What We’ve Done to Prevent Future Incidents

While OIT Security had previously set Crowdstrike to defer sensor updates for critical systems, this setting applied only to major sensor version updates, not to the nightly content updates responsible for this incident. Following this event, Crowdstrike has introduced new controls, allowing customers to defer content updates. UA has now adopted a staggered deployment strategy: updates are rolled out first to test systems, then to non-critical production systems, and finally to critical systems.

How Crowdstrike Has Enhanced Its Platform

In response to this incident, Crowdstrike has implemented a series of improvements, including:

  • Enhanced software testing: Advanced testing techniques such as fault injection and stress testing are used to prevent similar issues.
  • Improved resilience: Strengthening error-handling mechanisms in the Falcon sensor to manage content-related errors gracefully.
  • Refined deployment strategy: Introducing a staggered rollout, with a small canary system deployment and increased monitoring of system performance during updates.
  • Third-party validation: Engaging independent reviews to ensure the quality of development and deployment processes.

The proactive measures taken by both OIT and Crowdstrike underscore a commitment to security, ensuring that UA’s critical systems are better protected from future risks.

Outage – CrowdStrike cybersecurity software

A global CrowdStrike outage early Friday morning is affecting the University of Alabama network. The current known impact is network connectivity in some areas of campus and OnBase management platform. A security incident or cyberattack did not cause this outage.

CrowdStrike’s cybersecurity software detects and blocks hacking threats. Like other cybersecurity products, it requires deep-level access to a computer’s operating system to scan for threats. In this case, computers running Microsoft Windows appear to have crashed because of a software code update issued by CrowdStrike interacting with the Windows system.

If you believe a UA-managed system you use is having issues, please submit a report to the IT Service Desk at ITSD@ua.edu or 205-348-5555. Updates will be provided on the OIT Service Status webpage.

Controlled Data Center Shutdowns December 22

OIT will briefly shut down both on-campus data centers on Friday, December 22 at 7:30 a.m. until 9:30 a.m. for maintenance. Along with intermittent internet outages, the following services will be unavailable during the maintenance period: all on-premise servers, such as Banner, Action Card, myBama, etc. If you have any questions or concerns, contact the IT Service Desk at 205-348-5555 or itsd@ua.edu.

Banner and Related Apps Unavailable October 27-28

OIT will perform updates to platforms Banner and related applications run on starting Friday, October 27, 7:00 a.m. and will remain unavailable until Saturday, October 28, 10 p.m. The following services will be unavailable during the update period: 

  • Banner Student Self-Service, including course registration (ability to register for new courses or drop courses); viewing student records such as transcripts, grades, and financial aid; updating personal contact information
  • Banner Employee Self-Service, including monthly leave reporting and viewing leave balances
  • MyBama, DegreeWorks

If you have any questions or concerns, contact the IT Service Desk at 205-348-5555 or itsd@ua.edu.

VMware Storage Issue September 18, 2023

VMware encountered a storage issue that is impacting some of the University’s VMWare hosts. VMware is responsible for running virtual machines and servers. The issue began at 3:00 a.m., and OIT is working to resolve the issue. Visit status.oit.ua.edu for updates.