上海交通大学学报(英文版) ›› 2011, Vol. 16 ›› Issue (5): 567-570.doi: 10.1007/s12204-011-1190-8
QIU Wei-dong 1(邱卫东), JIN Ling1 (金 凌), YANG Xiao-niu 2(杨小牛), YANG Hong-wa 2(杨红娃)
摘要: Abstract: Packet analysis is very important in our digital life. But what protocol analyzers can do is limited because they can only process data in determined format. This paper puts forward a solution to decode raw data in an unknown format. It is certain that data can be cut into packets because there are usually characteristic bit sequences in packet headers. The key to solve the problem is how to find out those characteristic sequences. We present an efficient way of bit sequence enumeration. Both Aho-Corasick (AC) algorithm and data mining method are used to reduce the cost of the process.
中图分类号: