We present a face recognition system that can recognize multiple persons parallel in real-time running on a single Jetson TX2. Due to rapid progress in deep learning accuracy of face recognition has surpassed human level recently. GPUs became the major platform to train and run deep learning models. Speed of NVidia GPUs on deep learning tasks is increasing rapidly due to hardware and software optimizations. We present a system that combines the most accurate face detection and recognition models with the fastest software stack. Combined with a 4K camera the system can recognize over 10 persons parallel in crowd situations even from 10 meter range. The system can be deployed to low power embedded environments such as drones.