Unstructured sources of information such as scientific publications, electronic patient files, but also patents, are available in very large numbers. The automated analysis of these unstructured knowledge sources requires substantial compute resources; however, scaling systems for information extraction must be optimized for HPC environments and, for example, harmonize with the existing middleware for the distribution of computationally intensive tasks. Fraunhofer SCAI makes complex text mining workflows executable on HPC environments and demonstrates the scientific use of high-performance computers for information extraction. The services offered concentrate on the cost-effective indexing of company archives with a focus on chemistry as well as on the development of clinical routine data for research purposes and for studies in health economics.