Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1389 |
Symbol | |
ID | 4809050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1695905 |
End bp | 1697233 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106813 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_001037814 |
Protein GI | 125973904 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0617] tRNA nucleotidyltransferase/poly(A) polymerase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.23743 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGCA AAATAGAAAT CAATATGCCC AAAGATGTGT CATATATTAT CGATACTCTT AACAATAGAG GTTTTAAAGC TTATATAGTG GGAGGCTGCA TACGGGATGC CATTTTAGGA AAAGTTCCTG CCGACTGGGA TGTCGCCACC GATGCACAGC CTGAAGACGT AAAGCTCATC TTTGACAAAA TCGTTGAGAC CGGAATTAAA CATGGAACGG TTACTGCCGT TATAAACGGC TGTAATTATG AAATTACCAC TTTCAGAGCA CCGTCGTCAG CTAAAATTCC CACTATCAAG GATGATTTGG GGTTAAGGGA TTTTACAATA AACGCAATGG CCTATCATCC TGAAGAAGGT ATAATTGATC CATTTTTGGG CATGCAGGAC ATGGAAAAGT CCGTCATCCG CGCAGTAGGC TCCCCCGAAG ACCGCTTTCA TGAAGATCCT TTAAGAATGC TGAGAGCTGT TCGTTTAAGC TCCACTTTAG GGTTTGAAAT CGACAGGTCG GTCCTTTCGG CCATAAAAGA AAACTGCAAA CTGATAGAAA AAGTAAGTCC GGAAAGAATC CGGGATGAGC TGTCAAAAAT ATTGATTTCG GACAGGCCAA AAAATTTTCT TGTCTTGAGA GAAACAGGCC TTCTGAAATA TGTGCTTCCG GAGTTTGACA TATGTTTTGA TACCGGCCAG AACCATCCTT ATCATGTTTA CAATGTCGGA ATGCATACTT TGGAAACTGT GTCGAATATT GAAAGCAACC TTGTCCTGAG ATGGACCATG CTCTTGCACG ATATAGGAAA ACCAGTTGTC AAAACCACTG ATCAAAACGG AACAGATCAT TTTTACGGTC ATCCTGAAGA AAGCGTTAAT ATCGCGGATA AAATTATGAA AAGGCTCAGG TTTGACAACA AAACCACAAA CAAAGTGCTA AGGCTTATTA AGCATCATGA CCGGCGTATA GAACCGAACC AAAAATCAGT GCGAAAAGCT GTAAGCATCA TCGGAAAAGA CATTTTTCCA GACCTTTTAA AGGTTCAGGA AGCGGACAAA AAAGGCCAAA ATCCTCAGTA CCTGGATGAA AGGCTTAAAG TCCTTGATGA AATAAAGGAC ATCTTTTTTA ATCTGGAAAA GGAAGGACAG ATCCTAAACT TAAAAGACCT TGCATTAAAC GGAAACGACC TTCTTGCAAT GGGTTTTGAA CAGAGCCGGG AAATAGGTAT AATTCTAAGA GAACTTTACA ATATTGTTCT TGACAACCCT GAAATGAACA CAAAAGAAAA GTTGACTGAA ATTGTCGAAA ATATAAGAAA AAAAAGTTTT AAAACATAG
|
Protein sequence | MKGKIEINMP KDVSYIIDTL NNRGFKAYIV GGCIRDAILG KVPADWDVAT DAQPEDVKLI FDKIVETGIK HGTVTAVING CNYEITTFRA PSSAKIPTIK DDLGLRDFTI NAMAYHPEEG IIDPFLGMQD MEKSVIRAVG SPEDRFHEDP LRMLRAVRLS STLGFEIDRS VLSAIKENCK LIEKVSPERI RDELSKILIS DRPKNFLVLR ETGLLKYVLP EFDICFDTGQ NHPYHVYNVG MHTLETVSNI ESNLVLRWTM LLHDIGKPVV KTTDQNGTDH FYGHPEESVN IADKIMKRLR FDNKTTNKVL RLIKHHDRRI EPNQKSVRKA VSIIGKDIFP DLLKVQEADK KGQNPQYLDE RLKVLDEIKD IFFNLEKEGQ ILNLKDLALN GNDLLAMGFE QSREIGIILR ELYNIVLDNP EMNTKEKLTE IVENIRKKSF KT
|
| |