Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1421 |
Symbol | |
ID | 4809082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1740053 |
End bp | 1741033 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106844 |
Product | signal peptide peptidase SppA, 36K type |
Protein accession | YP_001037845 |
Protein GI | 125973935 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000000360798 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA AACAGCTAAT CGGTTTAATT GTGGCAGGAG TAGTATTTGT TTTTGTTTGT TCTTCAAGTG TTTTGGTAAA CACCGTTTCA AAAAGACTCG GAACGTCTCT GAACTTAAGC GACAGCGAAA GCAGTCTTCC TCTTACTCCC TATATAGGTG TTGTAAGTGT TGAAGGCACC ATTATGGACA GCAACTCAAC CACAAGTTTC TTAAGCAACG GTTACAACCA CAAAGAAACG TTGAAGCTCA TTGAGGATAT GAAAAATTCA GCAAGCAACA AAGGCATTCT TTTGTATGTG AATTCCCCCG GCGGAGGCGT TTATGAAAGT GATGAATTGT ATTTGAAGTT GAAAGAATAC AAAGAAGAAA CCGGAAGGCC GGTCTGGACC TATATGTCAA ATCAGGCATG TTCCGGTGGC TATTATATTT CTATGGCATC CGACAAAGTA TTTTCAAACC GAAACGCATG GACCGGTTCC ATCGGCGTCA TCATCTCCCT GACAAACCTC AAAGGCTTGT ACGATAACCT TGGAATTAAA GGTATTTATA TTACAAGCGG CAGAAACAAA GCAATGGGTG CTGCCGATCT GGAATTGACA GATGAGCAGC GTGATATACT TCAAAGCCTT GTGGATGAGG CATATGAGCA ATTTGTTGAA ATTGTGGCGG AAGGCAGAAA AATGACAGTG GAAGAAGTAA AAAGAATTGC CGATGGAAGA ATTCTTTCCG CAAAACAGGC ACTCGAGTTG AACCTCATTG ATGAAATTGC CACGTATGAT GAAGTAAAAG AAGCTTTCAG CGCAGAGCTT GGAAATGTTA AAATATATAC ACCCAAAAAG AAAGACCCGT TTGGACTTAG CTCTTTGTTC AGCTATATAA ACAGCTTGAA ACCTCGCTCT GATACTGAAA TAATAGCCGA GTTGATAAAG GCTAAAGGAA ATGGGGTGCC GATGTATTAT GCAATGCCGG GACAATACTA A
|
Protein sequence | MNKKQLIGLI VAGVVFVFVC SSSVLVNTVS KRLGTSLNLS DSESSLPLTP YIGVVSVEGT IMDSNSTTSF LSNGYNHKET LKLIEDMKNS ASNKGILLYV NSPGGGVYES DELYLKLKEY KEETGRPVWT YMSNQACSGG YYISMASDKV FSNRNAWTGS IGVIISLTNL KGLYDNLGIK GIYITSGRNK AMGAADLELT DEQRDILQSL VDEAYEQFVE IVAEGRKMTV EEVKRIADGR ILSAKQALEL NLIDEIATYD EVKEAFSAEL GNVKIYTPKK KDPFGLSSLF SYINSLKPRS DTEIIAELIK AKGNGVPMYY AMPGQY
|
| |