Protein sequences of translated gene-prediction sequences of five Trichoptera species
datasetposted on 25.11.2019 by Joshua Niklas Ebner, Danilo Ritz, Stefanie von Fumetti
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
This protein sequence file (FASTA format) contains in-silico translated gene prediction sequences of the following four Trichoptera species (DOIs leading to the original publication in which these data were generated are given in parentheses after the species name): Stenopsyche tienmushanensis (10.1093/gigascience/giy143), Glossosoma conformis (10.1016/j.scitotenv.2018.03.109), Sericostoma personatum (10.1111/mec.14292) and Limnephilus lunatus (10.1093/nar/gku983). Additionally the FASTA file contains in-silico predicted protein sequences of the transcriptome of Micropterna lateralis (10.1080/01650424.2016.1200727). For the bioinformatics pipeline on how these protein sequences were generated from the original data see Material & Methods in our publication. Additionally, the FASTA files contains heat shock protein sequences downloaded from NCBI (taxonomic scope Insecta, see publication). This file was used as a homology in-silico digested search space in MaxQuant to identify proteins in samples of Crunoecia irrorata reared at three different temperatures via LC-MS/MS.