Gene Athe_2262 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2262 
Symbol 
ID7407681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2398473 
End bp2399756 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content32% 
IMG OID643716628 
Productmetal dependent phosphohydrolase 
Protein accessionYP_002574107 
Protein GI222530225 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000921964 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAA AATTATATGA GTTTCGTGAC CCAGTTCACG GTTTCATTTA TGTTCGACCT 
TTAGAACTTA AGCTCATTGA TTCTTTTCCA TTTCAGAGAT TGCGAAACAT AAAACAGTTA
GCCTTTTCAC ATTATATCTA CCATGGTGCT GAACATTCGA GGTTTGGACA TTCATTAGGA
GTTATGCATC TTGTTACAAG AGCTTTTAAT ACAGTAACTG AAAAAACAAA AATCTTTGAT
ATTGCTACGA AAGAGTGGTA TACTCAAATA TTGAGAATTA TAGCATTAGT TCATGATTTG
GGACATGCAC CATTTTCTCA TGCTTCTGAA GAGCTTTTGC CGGATGGTTT TTCACATGAA
GACTATACCC ATATGATAGT AACACAAACG GAAGTTGCTG ATTGTATTAG TGAGATTGGG
GAATGGTTTA AAAAGCAATA TGGTGAAGAG TATGATATTA CACCAGAATT GATATCTTCC
ATATATAAAG GAGAAAACAT AGAAAATCCT GATTTTATAT TTCTGAAGAA GTTTATGGAT
AGCGAACTCG ATTGCGATAA AATGGATTAT TTATTACGAG ACTCATTATA TTGTGGAGTT
AGTTATGGAA AATTTGATTT AGAAAGGCTT ATTAATACTC TCACTGTTTG GGAAAATGAA
GAAGGAGTGC TTTACCTTGC TATTGAAAAA GGTGGAATGC ATGCTTTCGA AGAATTTGTT
CTTGCAAGAT ATTTTATGTT TACCCAAGTT TATTTTTATA AAACAAGAAG GTTTTTAGAT
AATGCTCTTT TGTATTTTTT AAAAGGAGTG CTTCCAAATG GAAAGTATCC AGAAGATATT
CAAGAATTTT TGAAGTACGA TGATATTTAT GTGTTAGAAC TTATGAAACA AAATATAAAA
CAAAATGAAT GGGCAGAACG AATTTTAAAA AGAAAAATAC TAAGCAAAGT CTATGAAACT
CCTGTTCATG CTTCTGAAAA AGATCAACAA ATTTTCAACT TAGTTAAAAA CAACTTGGTG
GAAAGGATTG GCGAAGAATA TCTTATTTTA GATTCAGCCG ATAAACTTGT ACATCAAATG
CCAGTGAGGT ATGAGCTTGA TAGCGAGAAA GCAATTCCTG TAATTACTGA AAATGACAAA
AAAGTGATAC CAGTTAGTGT TGCCTCTGAA GTTATAAGAA AAATGACAGA GCCTATAAAC
ATAAAAAGAA TATACGTTTA CGAAGATAAG AAAGAAGAAG CAATAAAAAT TGTGAATGAG
ATGATGGAAA AAATGAGTAA ATAA
 
Protein sequence
MSEKLYEFRD PVHGFIYVRP LELKLIDSFP FQRLRNIKQL AFSHYIYHGA EHSRFGHSLG 
VMHLVTRAFN TVTEKTKIFD IATKEWYTQI LRIIALVHDL GHAPFSHASE ELLPDGFSHE
DYTHMIVTQT EVADCISEIG EWFKKQYGEE YDITPELISS IYKGENIENP DFIFLKKFMD
SELDCDKMDY LLRDSLYCGV SYGKFDLERL INTLTVWENE EGVLYLAIEK GGMHAFEEFV
LARYFMFTQV YFYKTRRFLD NALLYFLKGV LPNGKYPEDI QEFLKYDDIY VLELMKQNIK
QNEWAERILK RKILSKVYET PVHASEKDQQ IFNLVKNNLV ERIGEEYLIL DSADKLVHQM
PVRYELDSEK AIPVITENDK KVIPVSVASE VIRKMTEPIN IKRIYVYEDK KEEAIKIVNE
MMEKMSK