Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2141 |
Symbol | |
ID | 4811188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2544545 |
End bp | 2545732 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640107545 |
Product | metal-dependent phosphohydrolase |
Protein accession | YP_001038538 |
Protein GI | 125974628 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0232] dGTP triphosphohydrolase |
TIGRFAM ID | [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00242364 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTCTA ATGAAATTTA CAGTGAAATA ATTAAAAACA ATTTAAAAAG AGAAAAGATA TTATCAAAAT ATGCTTGTAA GAGCATCATG GGGGTTCGCC GGCATCCGGA ACGGGAGGAA ATAGAGGACA GGATTAATAT CAGACCTGCT TTTTTCCACG ATACCGACAG GATAATACAT TCTCTTGCCT ATACAAGATA TATTGACAAA ACGCAGGCAT TTTTCCTTTT CGAAAACGAC CATATTACCC ACAGGGTTCT GCATGTTCAA TTTGTTTCAA AGATAGCAAG GGTTATTGGA AGGTGTTTGA GTCTGAATGA GGATTTGATT GAGGCAATTG CGCTTGGGCA TGACTTGGGA CATGTACCAT ACGGGCATGA CGGTGAGAAT TATTTGAATG AAATCATTAA TAAAAAAGAG AACCTGTATT TTAACCATAA TGCTCAAAGC GTGAGGTTTT TGATGGAGTT GGAAAACGGA GCACGGGGAT TGAACCTGAC ATTGCAGGTA CTTGACGGTA TCCTCTGCCA CAACGGTGAA ATTCTCGAAA AGGAATATGC GCCGGAAACG AACAAAAATT GGGATAAATT TTTGGAGGAT TATGAAAAGT GCTGGACGGA AAAAGACTAC AGCAAAAAGC TGCGGCCAAT GACGCTGGAG GGCTGTCTGG TACGGATATG CGATATAATT GCATATATAG GAAGGGATAT TGAGGATGCA ATAACAGTAA AGCTTATAAA GAGGGAGGAT ATTCCGGGTG AAGTTGTAAG GGTGCTGGGA AATACCAATC GGGATATTAT TAACAACCTT GCAAAAGATA TAATAGAAAA TAGTTATAAC AGGCCGTATA TTATGTTTTC CAAAGATAAG TACAATGCTT TGAAACTGCT TCTTGATTTT AACTATAAGT ATATTTATAA AAATCCTGTA AAAATGACTG AGAATGAGAA AATAAAGAGA ATGTTCAGAG AACTGTTTGA GTTGTATTTA AAGGATTTGG AAACGGAAAA TAAAGAATCC TCCATATATG AGTGGTTTTT AGATAATATG AGTGAAGAGT ATTTGCGTAG CAACAGCAAG CCAAGAATTG TGCTTGACTA TATTGCAGGA ATGACAGACG ACTTTTTTAA CAATGAGTTT AAGAGATATG TGCTTCCGAA AAGTTATGGG TATTGCATGG AACAGTGA
|
Protein sequence | MDSNEIYSEI IKNNLKREKI LSKYACKSIM GVRRHPEREE IEDRINIRPA FFHDTDRIIH SLAYTRYIDK TQAFFLFEND HITHRVLHVQ FVSKIARVIG RCLSLNEDLI EAIALGHDLG HVPYGHDGEN YLNEIINKKE NLYFNHNAQS VRFLMELENG ARGLNLTLQV LDGILCHNGE ILEKEYAPET NKNWDKFLED YEKCWTEKDY SKKLRPMTLE GCLVRICDII AYIGRDIEDA ITVKLIKRED IPGEVVRVLG NTNRDIINNL AKDIIENSYN RPYIMFSKDK YNALKLLLDF NYKYIYKNPV KMTENEKIKR MFRELFELYL KDLETENKES SIYEWFLDNM SEEYLRSNSK PRIVLDYIAG MTDDFFNNEF KRYVLPKSYG YCMEQ
|
| |