Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1038 |
Symbol | |
ID | 4811332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1241562 |
End bp | 1242587 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640106456 |
Product | germination protease |
Protein accession | YP_001037463 |
Protein GI | 125973553 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01441] GPR endopeptidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000076763 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCATTTGG CAAAAAGGAT AAGGGAGGAT TACAGGGTGG AAAGGAATAT AAGAACGGAT CTTGTGCTGG AAGCTCATGA GCTTTTGAAG GAAAACGAGT TTAAAAACGA AAGAAGGGAA CCGCCGGGAG TCGATGTTGA AAACGACGGT ACGGAAGATA TAAGAATAAC CAGGGTGAGG GTAACGTCAC CCACCGGTGA GGCGGCTATC GGAAAGCCGA TGGGTAATTA TATCACCCTT GAGGTGCCCA GGCTCAAGGA AAACGACCAG GAATTGTACG AAGAGACTTG CAAAGCTCTT GCCAAAGAAC TGACACGTGT ATTGAATCTT AAAGACGACT CAACAATTCT GGTTATAGGA TTGGGTAACT GGAATGTCAC ACCGGACGCA TTGGGGCCGA AAGTCGTTTC AAGGCTTATG GTCACAAGGC ATTTGCTTGA GTATGTTCCT GATCAGGTTG ATGAAGGGGT AAGACCGGTG TGTGCGGTAT CTCCCGGCGT GTTGGGTATT ACCGGTATAG AGACGGGTGA GATTGTAAGA GGAATTGTTG ACAGGGTAAA ACCCGATGTT GTAATTGCGA TAGATGCTTT AGCTTCCAGA AAAATGGAAA GAGTGAATAC CACTATTCAG ATTGCGGATA CCGGAATTTC CCCGGGTTCG GGAGTCGGCA ACAAAAGAAT GGAGCTTTCC AGAGAAACTT TGGGAGTTCC GGTTATTGCA ATCGGAGTCC CGACCGTGGT GGATGCGGCA ACCATGGCAA ATGACACAAT TGATCTCGTT ATAGACAACC TTATTAGAGA AGCAAAAGAA GATTCGCATT TTTACAATAT GCTTAAAAAT ATTGACAGAA ATGAAAAATA TCAATTGATA CAAGAGGTGT TGCAGCCCTA TGTGGGCAAC CTTGTGGTAA CTCCGAAAGA AATTGACGAT GTTGTTGACA GAATTGCAAA AGTAATTGCT AACGGTCTTA ATATTGCGCT TCACCAAGGT ATTACATTAA ACGATGTCAA CCGGTATGTC CAGTAG
|
Protein sequence | MHLAKRIRED YRVERNIRTD LVLEAHELLK ENEFKNERRE PPGVDVENDG TEDIRITRVR VTSPTGEAAI GKPMGNYITL EVPRLKENDQ ELYEETCKAL AKELTRVLNL KDDSTILVIG LGNWNVTPDA LGPKVVSRLM VTRHLLEYVP DQVDEGVRPV CAVSPGVLGI TGIETGEIVR GIVDRVKPDV VIAIDALASR KMERVNTTIQ IADTGISPGS GVGNKRMELS RETLGVPVIA IGVPTVVDAA TMANDTIDLV IDNLIREAKE DSHFYNMLKN IDRNEKYQLI QEVLQPYVGN LVVTPKEIDD VVDRIAKVIA NGLNIALHQG ITLNDVNRYV Q
|
| |