Gene Cthe_1389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1389 
Symbol 
ID4809050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1695905 
End bp1697233 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content40% 
IMG OID640106813 
Productmetal dependent phosphohydrolase 
Protein accessionYP_001037814 
Protein GI125973904 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0617] tRNA nucleotidyltransferase/poly(A) polymerase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.23743 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGCA AAATAGAAAT CAATATGCCC AAAGATGTGT CATATATTAT CGATACTCTT 
AACAATAGAG GTTTTAAAGC TTATATAGTG GGAGGCTGCA TACGGGATGC CATTTTAGGA
AAAGTTCCTG CCGACTGGGA TGTCGCCACC GATGCACAGC CTGAAGACGT AAAGCTCATC
TTTGACAAAA TCGTTGAGAC CGGAATTAAA CATGGAACGG TTACTGCCGT TATAAACGGC
TGTAATTATG AAATTACCAC TTTCAGAGCA CCGTCGTCAG CTAAAATTCC CACTATCAAG
GATGATTTGG GGTTAAGGGA TTTTACAATA AACGCAATGG CCTATCATCC TGAAGAAGGT
ATAATTGATC CATTTTTGGG CATGCAGGAC ATGGAAAAGT CCGTCATCCG CGCAGTAGGC
TCCCCCGAAG ACCGCTTTCA TGAAGATCCT TTAAGAATGC TGAGAGCTGT TCGTTTAAGC
TCCACTTTAG GGTTTGAAAT CGACAGGTCG GTCCTTTCGG CCATAAAAGA AAACTGCAAA
CTGATAGAAA AAGTAAGTCC GGAAAGAATC CGGGATGAGC TGTCAAAAAT ATTGATTTCG
GACAGGCCAA AAAATTTTCT TGTCTTGAGA GAAACAGGCC TTCTGAAATA TGTGCTTCCG
GAGTTTGACA TATGTTTTGA TACCGGCCAG AACCATCCTT ATCATGTTTA CAATGTCGGA
ATGCATACTT TGGAAACTGT GTCGAATATT GAAAGCAACC TTGTCCTGAG ATGGACCATG
CTCTTGCACG ATATAGGAAA ACCAGTTGTC AAAACCACTG ATCAAAACGG AACAGATCAT
TTTTACGGTC ATCCTGAAGA AAGCGTTAAT ATCGCGGATA AAATTATGAA AAGGCTCAGG
TTTGACAACA AAACCACAAA CAAAGTGCTA AGGCTTATTA AGCATCATGA CCGGCGTATA
GAACCGAACC AAAAATCAGT GCGAAAAGCT GTAAGCATCA TCGGAAAAGA CATTTTTCCA
GACCTTTTAA AGGTTCAGGA AGCGGACAAA AAAGGCCAAA ATCCTCAGTA CCTGGATGAA
AGGCTTAAAG TCCTTGATGA AATAAAGGAC ATCTTTTTTA ATCTGGAAAA GGAAGGACAG
ATCCTAAACT TAAAAGACCT TGCATTAAAC GGAAACGACC TTCTTGCAAT GGGTTTTGAA
CAGAGCCGGG AAATAGGTAT AATTCTAAGA GAACTTTACA ATATTGTTCT TGACAACCCT
GAAATGAACA CAAAAGAAAA GTTGACTGAA ATTGTCGAAA ATATAAGAAA AAAAAGTTTT
AAAACATAG
 
Protein sequence
MKGKIEINMP KDVSYIIDTL NNRGFKAYIV GGCIRDAILG KVPADWDVAT DAQPEDVKLI 
FDKIVETGIK HGTVTAVING CNYEITTFRA PSSAKIPTIK DDLGLRDFTI NAMAYHPEEG
IIDPFLGMQD MEKSVIRAVG SPEDRFHEDP LRMLRAVRLS STLGFEIDRS VLSAIKENCK
LIEKVSPERI RDELSKILIS DRPKNFLVLR ETGLLKYVLP EFDICFDTGQ NHPYHVYNVG
MHTLETVSNI ESNLVLRWTM LLHDIGKPVV KTTDQNGTDH FYGHPEESVN IADKIMKRLR
FDNKTTNKVL RLIKHHDRRI EPNQKSVRKA VSIIGKDIFP DLLKVQEADK KGQNPQYLDE
RLKVLDEIKD IFFNLEKEGQ ILNLKDLALN GNDLLAMGFE QSREIGIILR ELYNIVLDNP
EMNTKEKLTE IVENIRKKSF KT