Mining Input Grammars for Security Testing

773

13.6

Microsoft Research334 тыс

Следующее

29.09.17 – 1 9312:29

Tech Showcase: Project Malmo – Experimentation Platform for the Next Generation of AI Research

Популярные

172 дня – 22215:51

Keynote: Building Globally Equitable AI

172 дня – 1 8615:12

MatterGen: A Generative Model for Materials Design

Опубликовано 29 сентября 2017, 18:18

Knowing which part of a program processes which parts of an input can reveal the structure of the input as well as the structure of the program. In a URL "example.com/path/", for instance, the protocol “http", the host “www.example.com", and the path “path" would be handled by different functions and stored in different variables. Given a set of sample inputs, we use _dynamic tainting_ to trace the data flow of each input character, and aggregate those input fragments that would be handled by the same function into lexical and syntactical entities. The result is a _context-free grammar_ that accurately reflects valid input structure; as it draws on function and variable names, it can be as readable as textbook examples: URL ::= PROTOCOL "://" HOST "/" PATH PROTOCOL ::= “http” | “https” | … HOST ::= /[a-zA-Z0-9.]+/ ... We expect inferred grammars to considerably ease the understanding of file and input formats. Their most important use, however, will be in automatic fuzz testing, where grammars can easily be turned into producers that help to quickly cover program features. Our grammar-based LANGFUZZ fuzzer is in daily use at Mozilla and has uncovered more than 4,000 defects so far; mining grammars automatically will bring such techniques to a wide range of programs. For details on our work on grammar mining, see st.cs.uni-saarland.de/models/a...

See more on this video at microsoft.com/en-us/research/v...

Свежие видео