Gene Cthe_2420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2420 
Symbol 
ID4808136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2890294 
End bp2892075 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content33% 
IMG OID640107834 
ProductHD superfamily phosphohydrolase 
Protein accessionYP_001038815 
Protein GI125974905 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAATG AGAAATGTAT CAGGGATCCT GTTCATAATT ATATATATTT AACCGATGTT 
GAATTTAAAC TTATAAGACA TCCTTTGTTT CAAAGATTGA GATTTATAAC TCAGAACGGT
TCGGCATATT ATACTTATCC GTCAAACAAA AATTGCCGTT TTTTACATTC TCTTGGTTCT
ATGAAACTTG GAGGAGACAT TTTCTTAAAT GCCACAGAGA ATTTGTCTGA CGGGGATGTA
AAAGAATACC TGATACAAGC TTATAAAATG TTGGATAGCA TAGCAAATAA CAATCTGACA
ATTCCTATTT TTGACATAAT TAAAGAATTT GCCTCCATGA ATGATAAAAC GTTTGATAAA
TACGGGTTAT CTTTGCATAT AGACAAATCT TCGATAGAGA ATGAAATAAA AAAAGAAGTG
TTTCAAATGA AATTTGCCAG GGCGGTTTTA TTTCAGAGCG TGAGGCTTGC GTGTATACTT
CATGACATAG GCCATTTTCC GTTTAGTCAT GCTGTGGAAA GAGCCTTTAG CCAATATTTT
GATTATTTAA CCGGGGATCA GAAAAAGGAA AACCAAATAT ATGTGAAATA TCATTCAAAA
GCAGAATATG TTGAAAAACA AATCCATGAA AGAATTGGTT TGGGAATATT GCAAAAGATT
ATTCCGTCAA GTGAAAAAGA TTTTCATAAG CTGTGCAGGC ATGTAGCAAG AATAATTCTT
ATTGGGCATT CGGAATACAG AAATATAGTG CATCCTTTAT ATACTATTAT ATCAAGTGAG
TTGGATGCTG ACAGGCTTGA CTATTCGCTA AGGGACCCCC GTTCGTCAGG TCTTGAATTG
GGAGCGTTTG ATATTGAAAG ATTGCTGAAC AATTTTACAA TATATAGAGA AAATGACAAA
TTTGAGATAT TGCCAAAGGT AAATGCGCTT TCTTCAATTG AAAGTTTTTA TCACCAGCGT
TTCCTTATAT ACAAGTATTT AATATATCAT CACAGTAAAG CCCGTATGGA TGAAATTGTA
AAGGAAATTA CTTTTTTACT GCTTGAGATT TATAACAGCA AGGAGATAAA ATACGATTCA
GTCAGGAGAG TATTGACAGA TTATAATTTC GATTTTCTTT GGGAAAAATG TGACGAAGAG
GAGTATTATT ATTGTAATGA AAATTGGTAC TTTACTATTT TGCAGGCAAT ATATATAATT
ATACAAGGCG TCAGTAACCC TGATGACAGG ACTGAGAAAC TTAAGACTTT GATTGAGACT
TTTATTTTCA GAAAAACTGA AAACATTTAC TCATTTTTTA AAAGATATGA TGCCTATTTT
GATTTTATGC AAAAAATGTA CATAAAGATA AAGGAAGCCG AAGATATAGA ATTTGGTGAT
TTTGAAAAGA AGATGAGGGG TGTCATAAAA GATTCGATTA ACAATAATGA TTTGAAAGAA
CTGAATGATA GACTTTACAA ACAGGACCGG GTTATTTGTT TGATTGCAAA AACTGAGCCC
AAAGTGATAA AATTTTTGGA AAATCAGGCA TTTCCATTCA CATCGGAGTT ACATGTTGCT
CAACAGGAAA AAAACGGTCA GAAGAAAAAG GTACCTGTAA CTGTATTTTC ACCTTATCTG
CAGAGTATGG CTCATGCATC GGAAAAAGAG CAATTCTTCA ATGTGTTTAT TATTAAAGAG
GGGATTAAAA CAGACGGCCA AAGAAGATTG TTGGAGAAGA TCAGAAAGGA ATTTATGCGC
TTTTTTGTTT CAAAATACAA GCTTTGCTTT GGAATAGAAT GA
 
Protein sequence
MANEKCIRDP VHNYIYLTDV EFKLIRHPLF QRLRFITQNG SAYYTYPSNK NCRFLHSLGS 
MKLGGDIFLN ATENLSDGDV KEYLIQAYKM LDSIANNNLT IPIFDIIKEF ASMNDKTFDK
YGLSLHIDKS SIENEIKKEV FQMKFARAVL FQSVRLACIL HDIGHFPFSH AVERAFSQYF
DYLTGDQKKE NQIYVKYHSK AEYVEKQIHE RIGLGILQKI IPSSEKDFHK LCRHVARIIL
IGHSEYRNIV HPLYTIISSE LDADRLDYSL RDPRSSGLEL GAFDIERLLN NFTIYRENDK
FEILPKVNAL SSIESFYHQR FLIYKYLIYH HSKARMDEIV KEITFLLLEI YNSKEIKYDS
VRRVLTDYNF DFLWEKCDEE EYYYCNENWY FTILQAIYII IQGVSNPDDR TEKLKTLIET
FIFRKTENIY SFFKRYDAYF DFMQKMYIKI KEAEDIEFGD FEKKMRGVIK DSINNNDLKE
LNDRLYKQDR VICLIAKTEP KVIKFLENQA FPFTSELHVA QQEKNGQKKK VPVTVFSPYL
QSMAHASEKE QFFNVFIIKE GIKTDGQRRL LEKIRKEFMR FFVSKYKLCF GIE