- Compiler Technology Department
- Computer Systems Department
- Information Systems Department
- Software Engineering Department
- System integration and multi-disciplinary collaborative environments
- System Programming Department
- Theoretical Computer Science Department
- Academic council
- Dissertation council
- Verification Center of the Operating System Linux
- Center of competence in parallel and distributed computing
Apache Spark - a fast and general engine for large-scale data processing
Participating in development of Apache Spark and using it in own projects.
Our open source solutions for Spark:
- spark-openstack - Scripts to setup Spark cluster (any version) in any Openstack environment with optional useful tools.
- pu4spark - A library for Positive-Unlabeled Learning for Apache Spark MLlib (ml package). Library page on SparkPackages: https://spark-packages.org/package/ispras/pu4spark