Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2262 |
Symbol | |
ID | 7407681 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2398473 |
End bp | 2399756 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 643716628 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_002574107 |
Protein GI | 222530225 |
COG category | [R] General function prediction only |
COG ID | [COG1078] HD superfamily phosphohydrolases |
TIGRFAM ID | [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000921964 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAAA AATTATATGA GTTTCGTGAC CCAGTTCACG GTTTCATTTA TGTTCGACCT TTAGAACTTA AGCTCATTGA TTCTTTTCCA TTTCAGAGAT TGCGAAACAT AAAACAGTTA GCCTTTTCAC ATTATATCTA CCATGGTGCT GAACATTCGA GGTTTGGACA TTCATTAGGA GTTATGCATC TTGTTACAAG AGCTTTTAAT ACAGTAACTG AAAAAACAAA AATCTTTGAT ATTGCTACGA AAGAGTGGTA TACTCAAATA TTGAGAATTA TAGCATTAGT TCATGATTTG GGACATGCAC CATTTTCTCA TGCTTCTGAA GAGCTTTTGC CGGATGGTTT TTCACATGAA GACTATACCC ATATGATAGT AACACAAACG GAAGTTGCTG ATTGTATTAG TGAGATTGGG GAATGGTTTA AAAAGCAATA TGGTGAAGAG TATGATATTA CACCAGAATT GATATCTTCC ATATATAAAG GAGAAAACAT AGAAAATCCT GATTTTATAT TTCTGAAGAA GTTTATGGAT AGCGAACTCG ATTGCGATAA AATGGATTAT TTATTACGAG ACTCATTATA TTGTGGAGTT AGTTATGGAA AATTTGATTT AGAAAGGCTT ATTAATACTC TCACTGTTTG GGAAAATGAA GAAGGAGTGC TTTACCTTGC TATTGAAAAA GGTGGAATGC ATGCTTTCGA AGAATTTGTT CTTGCAAGAT ATTTTATGTT TACCCAAGTT TATTTTTATA AAACAAGAAG GTTTTTAGAT AATGCTCTTT TGTATTTTTT AAAAGGAGTG CTTCCAAATG GAAAGTATCC AGAAGATATT CAAGAATTTT TGAAGTACGA TGATATTTAT GTGTTAGAAC TTATGAAACA AAATATAAAA CAAAATGAAT GGGCAGAACG AATTTTAAAA AGAAAAATAC TAAGCAAAGT CTATGAAACT CCTGTTCATG CTTCTGAAAA AGATCAACAA ATTTTCAACT TAGTTAAAAA CAACTTGGTG GAAAGGATTG GCGAAGAATA TCTTATTTTA GATTCAGCCG ATAAACTTGT ACATCAAATG CCAGTGAGGT ATGAGCTTGA TAGCGAGAAA GCAATTCCTG TAATTACTGA AAATGACAAA AAAGTGATAC CAGTTAGTGT TGCCTCTGAA GTTATAAGAA AAATGACAGA GCCTATAAAC ATAAAAAGAA TATACGTTTA CGAAGATAAG AAAGAAGAAG CAATAAAAAT TGTGAATGAG ATGATGGAAA AAATGAGTAA ATAA
|
Protein sequence | MSEKLYEFRD PVHGFIYVRP LELKLIDSFP FQRLRNIKQL AFSHYIYHGA EHSRFGHSLG VMHLVTRAFN TVTEKTKIFD IATKEWYTQI LRIIALVHDL GHAPFSHASE ELLPDGFSHE DYTHMIVTQT EVADCISEIG EWFKKQYGEE YDITPELISS IYKGENIENP DFIFLKKFMD SELDCDKMDY LLRDSLYCGV SYGKFDLERL INTLTVWENE EGVLYLAIEK GGMHAFEEFV LARYFMFTQV YFYKTRRFLD NALLYFLKGV LPNGKYPEDI QEFLKYDDIY VLELMKQNIK QNEWAERILK RKILSKVYET PVHASEKDQQ IFNLVKNNLV ERIGEEYLIL DSADKLVHQM PVRYELDSEK AIPVITENDK KVIPVSVASE VIRKMTEPIN IKRIYVYEDK KEEAIKIVNE MMEKMSK
|
| |