v-rex 🦖

A verbose implementation of a regular expression engine.

NOTE: This is a work in progess.

Why

This projects purpose is to help me with learning about formal automata.

The implementation follows a strict by-the-book approach. The regular expression gets converted to an ε-NFA, that can be converted to an NFA, which gets converted to a DFA. The DFA can actually run the matching.

Usage

Clone this repo and run

python3 main.py

Limitations

The input alphabet has to be {0, 1}, only strings consisting of 0 and 1 are supported.
Only the following operations are supported:
- Concatenation ('01')
- Union ('0+1')
- Kleene star ('0*')
Precedence orders with parenthesis is supported

Under the hood

The implementation has 4 main parts, Regex, e-NFA, NFA and DFA. The processing of the matching of a string to a regular expression happens as follows.

Regex - regex.py

Using the Shunting-Yard algorithm, the regex is converted from infix to postifx notation. This helps to eliminate having to parse parenthesis. For example the regex (0+1)*0+11 would be transformed to 01+*0.11.+.

Out of the postfix regex the ε-NFA is created using the Thomsons's construction.

ε-NFA - enfa.py

The epsilon-NFA is converted to an NFA by removing all ε transitions.

NFA - nfa.py

The non-deterministic format automaton is converted to a DFA.

DFA - dfa.py

The deterministic formal automaton can be easily executed and matched against the input string.

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
dfa.py		dfa.py
dfa_test.py		dfa_test.py
enfa.py		enfa.py
enfa_test.py		enfa_test.py
integration_test.py		integration_test.py
main.py		main.py
nfa.py		nfa.py
nfa_test.py		nfa_test.py
plotter.py		plotter.py
printer.py		printer.py
regex.py		regex.py
regex_test.py		regex_test.py
requirements.txt		requirements.txt

peterhalasz/v-rex

Folders and files

Latest commit

History

Repository files navigation

v-rex 🦖

Why

Usage

Limitations

Under the hood

Regex - regex.py

ε-NFA - enfa.py

NFA - nfa.py

DFA - dfa.py

About

Topics

Resources

Stars

Watchers

Forks

Languages