Gene GM21_2316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2316 
Symbol 
ID8137656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2695004 
End bp2696137 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content60% 
IMG OID644869930 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_003022122 
Protein GI253700933 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones109 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCGGG CCGATCTGGC GGGGTACGCA GCGACCAGCG CCGGTTCCAA GGGGCGCAGG 
TACCAGGAGG AGTTTCGGGA CAACAGGCCC GCCTTCGAGA GGGACCGCGA CCGCATCATC
CACTGTGCCG CCTTCAGGCG CCTGGAGTAC AAGACCCAGG TCTTCGTCAA CCACGAAGGG
GACTACTACC GCACCCGGCT CACCCATTCC CTCGAAGTGG CGCAGATAGG GAAGGGGATA
GCGAGGAGGC TCGGGCTGAA CGAAGAGCTG ACCGAGACCC TGGCGCTCGC CCACGACCTC
GGGCACACCC CCTTCGGCCA CACCGGCGAA GAGGTGCTGA ACCGGCTCAT GGAAGGAGCG
GGCGGTTTCG AGCACAACCT GCAGTCGCTT CGGGTGGTCG ACGAATTGGA GGAGCGCTAC
CCGCACTTCA ACGGGCTGAA CCTTTCCTGG GAGGTGCGCG AAGGGATCGT CAAGCACTCC
TCGCAGTACG ACACACCCGC CTCCTCCGTC TACCAGGAGT TCCTCCCCGG CACCGTTTCC
AGCATCGAGG CGCAGTTGAT CAACTTCGCC GACGAGATCG CCTACAACAA CCACGACATC
GACGACGGGC TCAAATCCGG GTACATCAAC CTGAACCAGC TGAAGAATGT GGAGCTCTGG
AGCGAGGTGC ACGAGAGGAT ACTCGTCAAA TATCCCGGCA TCGACATCGA CCGCGGCGTC
TGCCAGACCG TGAGCGCGCT GATCGGCGTG TTGATCACCG ATTTGGTCAA CACCACCGCC
GAGAATCTGA GGACGCTCAA GATAGAGACC CAGGAGGACC TGAAGCGGGT GAATCTTCCC
GTCGTCGCCT TCAGCCCCGC CATGGCGGAT AGCAACGCCC AGTTGAAGCG CTTTTTGTTC
CAGAACCTGT ACCGGCATTA CAAGGTAGAG CGGATGAGGG TCAAGGCCGA GCGTTATCTG
GCCGAGCTTT TCGAGATGTA CATCAAGCAC CCGACGCTGT TGCCCATGAA GCACCAGGTG
AAGATGGAAC GTGACGGAAG GGAGCGGGTG ATTTGCGACT ACATAGCCGG GATGACCGAT
AGGTTTGCGC TGGACGAGTT CAAGAGGCTC TTCGAGCCGT ATGAAAGGGT TTAA
 
Protein sequence
MERADLAGYA ATSAGSKGRR YQEEFRDNRP AFERDRDRII HCAAFRRLEY KTQVFVNHEG 
DYYRTRLTHS LEVAQIGKGI ARRLGLNEEL TETLALAHDL GHTPFGHTGE EVLNRLMEGA
GGFEHNLQSL RVVDELEERY PHFNGLNLSW EVREGIVKHS SQYDTPASSV YQEFLPGTVS
SIEAQLINFA DEIAYNNHDI DDGLKSGYIN LNQLKNVELW SEVHERILVK YPGIDIDRGV
CQTVSALIGV LITDLVNTTA ENLRTLKIET QEDLKRVNLP VVAFSPAMAD SNAQLKRFLF
QNLYRHYKVE RMRVKAERYL AELFEMYIKH PTLLPMKHQV KMERDGRERV ICDYIAGMTD
RFALDEFKRL FEPYERV