Machine learning-based identification of wastewater treatment plant-specific microbial indicators using 16S rRNA gene sequencing

Machine Learning


  • Cai, L., Ju, F. & Zhang, T. Tracking human sewage microbiome in a municipal wastewater treatment plant. Appl. Microbiol. Biotechnol. 98(7), 3317–3326 (2014).

    CAS 
    PubMed 

    Google Scholar 

  • Lee, S. H., Kang, H. J. & Park, H. D. Influence of influent wastewater communities on temporal variation of activated sludge communities. Water Res. 73, 132–144 (2015).

    ADS 
    CAS 
    PubMed 

    Google Scholar 

  • Hultman, J. et al. Host range of antibiotic resistance genes in wastewater treatment plant influent and effluent. FEMS Microbiol. Ecol. 94(4), fiy038 (2018).

    PubMed 
    PubMed Central 

    Google Scholar 

  • Dueholm, M. K. D. et al. MiDAS 4: A global catalogue of full-length 16S rRNA gene sequences and taxonomy for studies of bacterial communities in wastewater treatment plants. Nat. Commun. 13(1), 1908 (2022).

    ADS 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Dueholm, M. K. D. et al. MiDAS 5: Global diversity of bacteria and archaea in anaerobic digesters. Nat. Commun. 15(1), 5361 (2024).

    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Do, T. T., Delaney, S. & Walsh, F. 16S rRNA gene based bacterial community structure of wastewater treatment plant effluents. FEMS Microbiol. Lett. 366(3), fnz017 (2019).

    CAS 
    PubMed 

    Google Scholar 

  • Kalinowska, A. et al. Insights into the microbial community of treated wastewater, its year-round variability and impact on the receiver, using cultivation, microscopy and amplicon-based methods. Sci. Total Environ. 829, 154630 (2022).

    CAS 
    PubMed 

    Google Scholar 

  • Aghalari, Z., Dahms, H. U., Sillanpaa, M., Sosa-Hernandez, J. E. & Parra-Saldivar, R. Effectiveness of wastewater treatment systems in removing microbial agents: a systematic review. Glob. Health. 16(1), 13 (2020).

    Google Scholar 

  • Fairbairn, D. J. et al. Contaminants of emerging concern: Mass balance and comparison of wastewater effluent and upstream sources in a mixed-use watershed. Environ. Sci. Technol. 50(1), 36–45 (2016).

    ADS 
    CAS 
    PubMed 

    Google Scholar 

  • Yang, W. et al. Emerging contaminants in the effluent of wastewater should be regulated: Which and to what extent?. Toxics 12(5), 309 (2024).

    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Gatica, J. & Cytryn, E. Impact of treated wastewater irrigation on antibiotic resistance in the soil microbiome. Environ. Sci. Pollut. Res. Int. 20(6), 3529–3538 (2013).

    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Hladilek, M. D. et al. Microbial community structure of a freshwater system receiving wastewater effluent. Environ. Monit. Assess. 188(11), 626 (2016).

    PubMed 

    Google Scholar 

  • Deng, P. et al. Profiling of intracellular and extracellular antibiotic resistance genes in municipal wastewater treatment plant and their effluent-receiving river. Environ. Sci. Pollut. Res. Int. 30(12), 33516–33523 (2023).

    CAS 
    PubMed 

    Google Scholar 

  • Su, Z., Chen, L. & Wen, D. Impact of wastewater treatment plant effluent discharge on the antibiotic resistome in downstream aquatic environments: A mini review. Front. Environ. Sci. Eng. 18(3), 36 (2023).

    Google Scholar 

  • Xiao, Z. et al. Effects of wastewater treatment plant effluent on microbial risks of pathogens and their antibiotic resistance in the receiving river. Environ. Pollut. 345, 123461 (2024).

    CAS 
    PubMed 

    Google Scholar 

  • Shchegolkova, N. M. et al. Microbial community structure of activated sludge in treatment plants with different wastewater compositions. Front. Microbiol. 7, 90 (2016).

    PubMed 
    PubMed Central 

    Google Scholar 

  • Li, L. G., Yin, X. & Zhang, T. Tracking antibiotic resistance gene pollution from different sources using machine-learning classification. Microbiome 6(1), 93 (2018).

    PubMed 
    PubMed Central 

    Google Scholar 

  • Tiwari, A. et al. Bacterial diversity and predicted enzymatic function in a multipurpose surface water system – from wastewater effluent discharges to drinking water production. Environ. Microbiome 16(1), 11 (2021).

    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Wijaya, J. & Oh, S. Machine learning reveals the complex ecological interplay of microbiome in a full-scale membrane bioreactor wastewater treatment plant. Environ. Res. 222, 115366 (2023).

    CAS 
    PubMed 

    Google Scholar 

  • Kitajima, M., Sassi, H. P. & Torrey, J. R. Pepper mild mottle virus as a water quality indicator. NPJ Clean Water 1, 19 (2018).

    CAS 

    Google Scholar 

  • Farkas, K. et al. Critical evaluation of crAssphage as a molecular marker for human-derived wastewater contamination in the aquatic environment. Food Environ. Virol. 11(2), 113–119 (2019).

    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Liu, M. et al. Chemical fingerprinting of organic micropollutants in different industrial treated wastewater effluents and their effluent-receiving river. Sci. Total Environ. 838(Pt 4), 156399 (2022).

    CAS 
    PubMed 

    Google Scholar 

  • Lim, F. Y., Ong, S. L., Hu, J. Recent advances in the use of chemical markers for tracing wastewater contamination in Aquatic Environment. Water. 9(2), 143 (2017).

  • Liu, X., Nie, Y. & Wu, X. L. Predicting microbial community compositions in wastewater treatment plants using artificial neural networks. Microbiome 11(1), 93 (2023).

    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Kim, Y. & Oh, S. Machine-learning insights into nitrate-reducing communities in a full-scale municipal wastewater treatment plant. J. Environ. Manage 300, 113795 (2021).

    CAS 
    PubMed 

    Google Scholar 

  • Bombaywala, S., Bajaj, A. & Dafale, N. A. Meta-analysis of wastewater microbiome for antibiotic resistance profiling. J. Microbiol. Methods 223, 106953 (2024).

    CAS 
    PubMed 

    Google Scholar 

  • Wang, J. et al. Metabolomics-based estimation of activated sludge microbial composition and prediction of filamentous bulking. Water Res. 259, 121805 (2024).

    CAS 
    PubMed 

    Google Scholar 

  • Kozich JJ et al. Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform. Appl Environ Microbiol. 79(17), 5112–5120 (2013).

    ADS 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Bolyen, E. et al. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat. Biotechnol. 37(8), 852–857 (2019).

    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Bokulich, N. A. et al. Optimizing taxonomic classification of marker-gene amplicon sequences with QIIME 2’s q2-feature-classifier plugin. Microbiome 6(1), 90 (2018).

    PubMed 
    PubMed Central 

    Google Scholar 

  • Dueholm, M.K.D., Andersen, K.S., Petersen, A.-K.C., Rudkjøbing, V., Nielsen, P.H.: MiDAS 5: Global diversity of bacteria and archaea in anaerobic digesters. bioRxiv. 2023:2023.08.24.554448

  • Katoh, K., Misawa, K., Kuma, K. & Miyata, T. MAFFT: A novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucl. Acids Res. 30(14), 3059–3066 (2002).

    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Shannon, C. E. The mathematical theory of communication. MD Comput. 14(4), 306–317 (1997).

    CAS 
    PubMed 

    Google Scholar 

  • Faith, D. P. Conservation evaluation and phylogenetic diversity. Biol. Cons. 61(1), 1–10 (1992).

    MathSciNet 

    Google Scholar 

  • McKinney, W. (ed.): Data structures for statistical computing in Python. In: Proceedings of the 9th Python in Science Conference (2010)

  • Lemaìtre, G., Nogueira, F. & Aridas, C. K. Imbalanced-learn: A python toolbox to tackle the curse of imbalanced datasets in machine learning. J. Mach. Learn. Res. 18(17), 1–5 (2017).

    Google Scholar 

  • Waskom, M. L. Seaborn: Statistical data visualization. J. Open Sour. Softw. 6(60), 3021 (2021).

    ADS 

    Google Scholar 

  • Pedregosa, F. et al. Scikit-learn: Machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011).

    MathSciNet 

    Google Scholar 

  • Harris, C. R. et al. Array programming with NumPy. Nature 585(7825), 357–362 (2020).

    ADS 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Ellson, J. et al. (eds) Graphviz—Open source graph drawing tools (Springer Berlin Heidelberg, Berlin, Heidelberg, 2002).

    Google Scholar 

  • Hunter, J. D. Matplotlib: A 2D graphics environment. Comput. Sci. Eng. 9(3), 90–95 (2007).

    Google Scholar 

  • Virtanen, P. et al. SciPy 1.0: Fundamental algorithms for scientific computing in Python. Nat. Methods 17(3), 261–272 (2020).

    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Lundberg, S. M. & Lee, S. A unified approach to interpreting model predictions. Adv. Neural Inf. Process. Syst. 30, 4765–4774 (2017).

    Google Scholar 

  • Seabold, S., Perktold, J. (ed.): Statsmodels: Econometric and statistical modeling with Python. In: Proceedings of the 9th Python in Science Conference (2010)

  • Jordahl, K., Van den Bossche, J., Fleischmann, M., Wasserman, J., McBride, J., Gerard, J., et al.: geopandas/geopandas: v0.8.1. v0.8.1 ed: Zenodo (2020)

  • Gillies, S., van der Wel, C., Van den Bossche, J., Taves, M.W., Arnott, J., Ward, B.C.: Manipulation and analysis of geometric objects in the Cartesian plane. 2.0.5 ed.

  • Planet dump retrieved from https://planet.osm.org 2024.07.20 [Internet]. 2017. Available from: https://www.openstreetmap.org.

  • ChatGPT. 4 ed. San Francisco, CA: OpenAI; 2024.

  • Whon, T. W. et al. The effects of sequencing platforms on phylogenetic resolution in 16 S rRNA gene profiling of human feces. Sci. Data 5, 180068 (2018).

    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Johnson, J. S. et al. Evaluation of 16S rRNA gene sequencing for species and strain-level microbiome analysis. Nat. Commun. 10(1), 5029 (2019).

    ADS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Nierychlo, M. et al. MiDAS 3: An ecosystem-specific reference database, taxonomy and knowledge platform for activated sludge and anaerobic digesters reveals species-level microbiome composition of activated sludge. Water Res. 182, 115955 (2020).

    CAS 
    PubMed 

    Google Scholar 

  • Ziyan, Y. et al. Diversity of activated sludge microbial community structure in different wastewater treatment plants. Biol. Bull. 50(3), 329–337 (2023).

    Google Scholar 

  • Seviour, R. & Nielsen, P. H. Microbial ecology of activated sludge (IWA publishing, London, 2010).

    Google Scholar 

  • Speth, D. R., In’t Zandt, M. H., Guerrero-Cruz, S., Dutilh, B. E. & Jetten, M. S. Genome-based microbial ecology of anammox granules in a full-scale wastewater treatment system. Nat. Commun. 7, 11172 (2016).

    ADS 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Perez-Cataluna, A., Collado, L., Salgado, O., Lefinanco, V. & Figueras, M. J. A polyphasic and taxogenomic evaluation uncovers Arcobacter cryaerophilus as a species complex that embraces four genomovars. Front. Microbiol. 9, 805 (2018).

    PubMed 
    PubMed Central 

    Google Scholar 

  • Nouha, K., Kumar, R. S. & Tyagi, R. D. Heavy metals removal from wastewater using extracellular polymeric substances produced by Cloacibacterium normanense in wastewater sludge supplemented with crude glycerol and study of extracellular polymeric substances extraction by different methods. Bioresour. Technol. 212, 120–129 (2016).

    CAS 
    PubMed 

    Google Scholar 

  • Kertesz, M. A. & Kawasaki, A. Hydrocarbon-degrading Sphingomonads: Sphingomonas, Sphingobium, Novosphingobium, and Sphingopyxis. In Handbook of hydrocarbon and lipid microbiology (ed. Timmis, K. N.) 1693–1705 (Springer Berlin, Heidelberg, Berlin, Heidelberg, 2010).

    Google Scholar 

  • Zhang, L., Liu, H., Dai, J., Xu, P. & Tang, H. Unveiling degradation mechanism of PAHs by a Sphingobium strain from a microbial consortium. mLife 1(3), 287–302 (2022).

    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Lee, J., Koo, T., Yulisa, A. & Hwang, S. Magnetite as an enhancer in methanogenic degradation of volatile fatty acids under ammonia-stressed condition. J. Environ. Manage 241, 418–426 (2019).

    CAS 
    PubMed 

    Google Scholar 

  • Schwan, B. et al. Chemically stressed bacterial communities in anaerobic digesters exhibit resilience and ecological flexibility. Front. Microbiol. 11, 867 (2020).

    PubMed 
    PubMed Central 

    Google Scholar 

  • Shakeri Yekta, S. et al. Effluent solids recirculation to municipal sludge digesters enhances long-chain fatty acids degradation capacity. Biotechnol. Biofuels 14(1), 56 (2021).

    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Butler, B. J., Mccallum, K. J. & Inniss, W. E. Characterization of Aquaspirillum arcticum sp. Nov., a new psychrophilic bacterium. Syst. Appl. Microbiol. 12(3), 263–266 (1989).

    Google Scholar 

  • Tehei, M. & Zaccai, G. Adaptation to extreme environments: Macromolecular dynamics in complex systems. Biochim. Biophys. Acta 1724(3), 404–410 (2005).

    CAS 
    PubMed 

    Google Scholar 

  • Zellner, G. et al. Methanocorpusculaceae fam. nov., represented by Methanocorpusculum parvum, Methanocorpusculum sinense spec. nov. and Methanocorpusculum bavaricum spec. nov.. Arch. Microbiol. 151(5), 381–390 (1989).

    CAS 
    PubMed 

    Google Scholar 

  • Thomas, C. M., Desmond-Le Quéméner, E., Gribaldo, S. & Borrel, G. Factors shaping the abundance and diversity of the gut archaeome across the animal kingdom. Nat. Commun. 13(1), 3358 (2022).

    ADS 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Schrecengost, A., Rotterova, J., Polakova, K., Cepicka, I. & Beinart, R. A. Divergent marine anaerobic ciliates harbor closely related Methanocorpusculum endosymbionts. ISME J. 18(1), wrae125 (2024).

    PubMed 
    PubMed Central 

    Google Scholar 



  • Source link

    Leave a Reply

    Your email address will not be published. Required fields are marked *