Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2420 |
Symbol | |
ID | 4808136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2890294 |
End bp | 2892075 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640107834 |
Product | HD superfamily phosphohydrolase |
Protein accession | YP_001038815 |
Protein GI | 125974905 |
COG category | [R] General function prediction only |
COG ID | [COG1078] HD superfamily phosphohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAATG AGAAATGTAT CAGGGATCCT GTTCATAATT ATATATATTT AACCGATGTT GAATTTAAAC TTATAAGACA TCCTTTGTTT CAAAGATTGA GATTTATAAC TCAGAACGGT TCGGCATATT ATACTTATCC GTCAAACAAA AATTGCCGTT TTTTACATTC TCTTGGTTCT ATGAAACTTG GAGGAGACAT TTTCTTAAAT GCCACAGAGA ATTTGTCTGA CGGGGATGTA AAAGAATACC TGATACAAGC TTATAAAATG TTGGATAGCA TAGCAAATAA CAATCTGACA ATTCCTATTT TTGACATAAT TAAAGAATTT GCCTCCATGA ATGATAAAAC GTTTGATAAA TACGGGTTAT CTTTGCATAT AGACAAATCT TCGATAGAGA ATGAAATAAA AAAAGAAGTG TTTCAAATGA AATTTGCCAG GGCGGTTTTA TTTCAGAGCG TGAGGCTTGC GTGTATACTT CATGACATAG GCCATTTTCC GTTTAGTCAT GCTGTGGAAA GAGCCTTTAG CCAATATTTT GATTATTTAA CCGGGGATCA GAAAAAGGAA AACCAAATAT ATGTGAAATA TCATTCAAAA GCAGAATATG TTGAAAAACA AATCCATGAA AGAATTGGTT TGGGAATATT GCAAAAGATT ATTCCGTCAA GTGAAAAAGA TTTTCATAAG CTGTGCAGGC ATGTAGCAAG AATAATTCTT ATTGGGCATT CGGAATACAG AAATATAGTG CATCCTTTAT ATACTATTAT ATCAAGTGAG TTGGATGCTG ACAGGCTTGA CTATTCGCTA AGGGACCCCC GTTCGTCAGG TCTTGAATTG GGAGCGTTTG ATATTGAAAG ATTGCTGAAC AATTTTACAA TATATAGAGA AAATGACAAA TTTGAGATAT TGCCAAAGGT AAATGCGCTT TCTTCAATTG AAAGTTTTTA TCACCAGCGT TTCCTTATAT ACAAGTATTT AATATATCAT CACAGTAAAG CCCGTATGGA TGAAATTGTA AAGGAAATTA CTTTTTTACT GCTTGAGATT TATAACAGCA AGGAGATAAA ATACGATTCA GTCAGGAGAG TATTGACAGA TTATAATTTC GATTTTCTTT GGGAAAAATG TGACGAAGAG GAGTATTATT ATTGTAATGA AAATTGGTAC TTTACTATTT TGCAGGCAAT ATATATAATT ATACAAGGCG TCAGTAACCC TGATGACAGG ACTGAGAAAC TTAAGACTTT GATTGAGACT TTTATTTTCA GAAAAACTGA AAACATTTAC TCATTTTTTA AAAGATATGA TGCCTATTTT GATTTTATGC AAAAAATGTA CATAAAGATA AAGGAAGCCG AAGATATAGA ATTTGGTGAT TTTGAAAAGA AGATGAGGGG TGTCATAAAA GATTCGATTA ACAATAATGA TTTGAAAGAA CTGAATGATA GACTTTACAA ACAGGACCGG GTTATTTGTT TGATTGCAAA AACTGAGCCC AAAGTGATAA AATTTTTGGA AAATCAGGCA TTTCCATTCA CATCGGAGTT ACATGTTGCT CAACAGGAAA AAAACGGTCA GAAGAAAAAG GTACCTGTAA CTGTATTTTC ACCTTATCTG CAGAGTATGG CTCATGCATC GGAAAAAGAG CAATTCTTCA ATGTGTTTAT TATTAAAGAG GGGATTAAAA CAGACGGCCA AAGAAGATTG TTGGAGAAGA TCAGAAAGGA ATTTATGCGC TTTTTTGTTT CAAAATACAA GCTTTGCTTT GGAATAGAAT GA
|
Protein sequence | MANEKCIRDP VHNYIYLTDV EFKLIRHPLF QRLRFITQNG SAYYTYPSNK NCRFLHSLGS MKLGGDIFLN ATENLSDGDV KEYLIQAYKM LDSIANNNLT IPIFDIIKEF ASMNDKTFDK YGLSLHIDKS SIENEIKKEV FQMKFARAVL FQSVRLACIL HDIGHFPFSH AVERAFSQYF DYLTGDQKKE NQIYVKYHSK AEYVEKQIHE RIGLGILQKI IPSSEKDFHK LCRHVARIIL IGHSEYRNIV HPLYTIISSE LDADRLDYSL RDPRSSGLEL GAFDIERLLN NFTIYRENDK FEILPKVNAL SSIESFYHQR FLIYKYLIYH HSKARMDEIV KEITFLLLEI YNSKEIKYDS VRRVLTDYNF DFLWEKCDEE EYYYCNENWY FTILQAIYII IQGVSNPDDR TEKLKTLIET FIFRKTENIY SFFKRYDAYF DFMQKMYIKI KEAEDIEFGD FEKKMRGVIK DSINNNDLKE LNDRLYKQDR VICLIAKTEP KVIKFLENQA FPFTSELHVA QQEKNGQKKK VPVTVFSPYL QSMAHASEKE QFFNVFIIKE GIKTDGQRRL LEKIRKEFMR FFVSKYKLCF GIE
|
| |