TippingPoint Digital Vaccine Laboratories
DID YOU KNOW... The ZDI has published over 1100 high-risk vulnerabilities since the inception of the program.

Honeypotting the Cloud

What’s the cloud good for?

One of the challenges facings DVLabs today is the ability to have complete network data from attacking and compromised hosts from non-customer networks. To solve this problem, a honeypot infrastructure with instances running in various cloud infrastructures was created. Allowing passive collection of information about suspicious hosts and comparing attacks across geographic regions in a wide-open network setting are the main reasons for choosing cloud services. The data collected can then be to add more entries to our RepDV IP reputation system and assist in the collection of live malware samples. There are many options for running a honeypot, but mwcollectd’s feature set and python module support made it the most attractive option for our purposes. The two features that most interested us were the SMB / MS-RPC emulation that provide enough interaction with the attacker to allow them to fully establish MS-RPC and SMB sessions and then send their attack and shellcode analysis on the stream through libemu.

Where are these connections coming from?

Over the last few months the honeypots have collected a large amount of data from our honeypot systems and one honeypot will be the focus of this post. This system saw a total of 36,000 TCP connection attempts from 6600 unique IPs and averaged approximately 700 connections/day. The country of origin for the source addresses encompassed a large number of countries with Russia and United States having the most connection attempts.


Destination Port Data



Conficker still going strong on port 445

The honeypot saw traffic on 99 distinct ports, but two-thirds of the connections were over port 445. Shellcode detection said there was shellcode in 21% of the streams over port 445. The majority of the streams with shellcode were attempting to spread Conficker by exploiting the MS08-067 SMB vulnerability.

The data collected from the honeypot shows that Conficker is still extremely prevalent and the data collected on filter 6545 (login required) on ThreatLinQ reinforces the data collected from the honeypot. The source countries for filter 6545 graphic above shows that the sources are not as evenly distributed as the data seen from the honeypots. This is likely due to the IPS network being significantly larger and seeing a more complete picture than the honeypot network is currently seeing. Infected Conficker hosts exhibited interesting behavior if the user-agent of the retriever was not set to a valid Internet Explorer 7 or previous user-agent for Windows XP. These hosts would send junk data at slow speeds and time the connections out. Modifying mwcollectd to use this user-agent caused a sharp increase in the number of Conficker A/B/C binaries collected.

Other Ports

Attempts at authentication bypass and open proxy scanning comprised the majority of the hits on HTTP-related ports. The authentication bypass attempts appeared to be targeting insecure installations of PHPMyAdmin. None of the RDP connection attempts appeared to originate from hosts infected with the Morto RDP worm, they were mostly simple attempts to login as Administrator with an empty username and password (filter 11659). These attempts were similar to many of the MS-SQL hits that attempted to login as sa with an empty password. This is covered by filter 1397, but is not on by default because of the potential for false positives due to organizations intentionally using this configuration.


Malware Collection



The honeypots have also collected a large number of malware samples using mwcollect’s binary retrieval modules. These modules look for URLs for standard protocols when shellcode is detected in a stream and then attempt to retrieve them. A total of 17 unique Brambul.A samples were collected, but were mostly the same binary - including one sample that matched across 11 hosts. Conficker was by far the largest malware family collected with 89 unique samples across 210 total hits with one sample being matched 24 times. Using the collected samples and packet captures from the honeypots, the filters produced to provide coverage have been verified to protect against all the variants collected.


Conclusion

The cloud, just like any computer network, can be a dangerous place if you’re not careful. If you want to run sensitive systems in the cloud consider moving to a managed service level that will by default add firewall rules to restrict port access and provide better monitoring of access.

We are currently expanding our network and integrating the data we are collecting from our honeypot network into our RepDV IP Reputation service to increase customer protection from the suspicious hosts seen attempting connections to our honeypots. We will also use this data to supplement the data currently used in our semi-annual Cyber Security Risk Report. As we expand our network to encompass more widespread geographic locations we expect to continue to receive more interesting data that will continue to increase the protection we offer our customers.



Tags:
Published On: 2011-10-19 10:22:50

Comments post a comment

No comments.
Trackback