Gene GM21_1457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1457 
Symbol 
ID8136786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1715486 
End bp1716730 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content40% 
IMG OID644869069 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_003021271 
Protein GI253700082 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones89 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGATAT ATAGCAAAGA GACAATAGAT TACGCTCAGG AGCTTGCTCG CGAAATTGGA 
CAGTATGATC TTCGCTTTTA TAGGTCTGCT GTGGATGAAG AAGAGCGTGC AAGAAATGAT
TATCAAAGAG ACTATGCACG CGTACTTTAC TCATCCTCGT TCAGGCGGTT GCAGGGGAAG
ATGCAGCTTC TCGGGATTGA TGATGCCCAT TTTCATAGAA ACCGACTCAC ACATAGTCTG
GAAGTAGCAC AGATTGCGAG GACGATAGCG GCAAGTCTGG GCATACCAGA TCATATTGTA
GCAGAGACTT GTTCCTTGGC CCATGATCTT GGCAACCCAC CCTTTGGCCA TCAGGGAGAG
GTCATTCTTA ATGAACTTGC AAACCCTATC GGTGGTTATG AGGGAAATGC ACAAACGTTT
AGAATTCTCA ATTTCCTGGA AAAAAAATCC CATACGTACA GAGGGCTGAA TCTCACATTC
AGGACCCTTA TGGGTGTGGT CAAGTATTTC TATAATAAAG AATCCAACTC AAAGAAGTTT
TTATACGATG ATGATTTTGC ACTTTTGGAC CAGGTGTGCT CTGCCCTCGG ATTAAAATAT
CGCCGCACAA TTGATATGCA AATAATGGAT CTTGCAGATG AAATCGCCTA TGCTGCTCAT
GATTTGGACG ATGCACTCAG CTTTAATATG CTCTCTATAG ATGAACTTAT ATATGAATTC
AAAATTAATA AAGAGTATTC AGACGCATTT GAAGTTTTGA AGGAAATAGT CAAAGAGTGT
CAGAGTTTCG GATTCAGTAG CAAAGAGTTG AGTACATCAG AGGAATACAC CTTTCTATTC
AGGAAGGAAT TGACATCAAA GATAGTGAGT AAATTGATAA ATGATATACA TGTCTCTGAT
CAGAACACAA ACAAGGAACT TTCTTTGTGC TTTAATAAAT ATGAAGGCCT TGCTGTAGGA
CTAAAAAAAC TATTATTTAA GGCAATTCTT AGAAAAACAT CTGTTCAGAT CTACGAAAAG
AAGGGTGAGA AGGTCATTAG GGGGTTGTTC CAAGTATACA GTGATCCGAC CTTTAACAAA
GGTCAAATAC TATTACCTCC TGAATACAGG CAAGGGAAAT CCGAAAGGCA GAAAAGTAGA
TATGTTATTG ACTATATTGC TGGAATGATG GACTCGTTTG CCATGCATGA GTATGTTAAA
TACTTTGGTC ATAACTCCTT GGAAAGAATA TATCGTGATG TCTGA
 
Protein sequence
MSIYSKETID YAQELAREIG QYDLRFYRSA VDEEERARND YQRDYARVLY SSSFRRLQGK 
MQLLGIDDAH FHRNRLTHSL EVAQIARTIA ASLGIPDHIV AETCSLAHDL GNPPFGHQGE
VILNELANPI GGYEGNAQTF RILNFLEKKS HTYRGLNLTF RTLMGVVKYF YNKESNSKKF
LYDDDFALLD QVCSALGLKY RRTIDMQIMD LADEIAYAAH DLDDALSFNM LSIDELIYEF
KINKEYSDAF EVLKEIVKEC QSFGFSSKEL STSEEYTFLF RKELTSKIVS KLINDIHVSD
QNTNKELSLC FNKYEGLAVG LKKLLFKAIL RKTSVQIYEK KGEKVIRGLF QVYSDPTFNK
GQILLPPEYR QGKSERQKSR YVIDYIAGMM DSFAMHEYVK YFGHNSLERI YRDV