Gene GM21_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1994 
Symbol 
ID8137328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2312785 
End bp2314317 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content63% 
IMG OID644869607 
ProductPpx/GppA phosphatase 
Protein accessionYP_003021804 
Protein GI253700615 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.00000000000000431495 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAAGCAGA CCAGGCTTGC CGCCATCGAC ATCGGCACCA ACTCCATCCG CAGCATAATC 
ATCGAGACCT CCGGAAACGG CAAATACAAG ATCCTTGACG ACGAGAAGGT GCTGGTGCGG
CTGGGCGAAG GGCTGCACCA AAGCGGCGCC ATCTCCCCCG CTGCCTGCAG CCGCGCGTTG
GAGGCCCTTT CCCGACAAAA GAAAATCATA GACGGCTACG GCGTCGCCTC CATCGAGGCG
GTGGCCACCA GCGCGATGCG CAAGGCGAGT AACGGCGCCG CCCTGGTGCA GGCGATCAAG
GACGCCACCG GCGTCGAGGT GGAAGTCATC AGCGGCGAGG AGGAGGCCGA ACTCGCGGCC
CTGAGCGCCG CGCACAATTT CGAGCTGGAA GGGGTCAGGC ACCTTATCTT CGACATCGGC
GGCGGGAGCA TGGAGCTGAT AGCCGCGCTC GGCTCCCATA CCGAGGAGAT GATCTCCCTG
GAACTGGGAG CGGTTTTCCT CACCGAGAGC TTCCTCAAGG GAGACCCGGT GCACCCCTCC
GAGCACGAAA AGCTGCGCAA GCACGTCCGC AAGACGCTGA AGCGGGCCTA TACCGGGGAA
CGCAGCGGCA TGCAGTGCCT GGTAGGGTCC GGCGGAACCG TCACCTCGAT CGCCGCCATG
ATCGCCGCCA CCAGGAAGGA GAAGTACGAC TCGGTGCACG GCTACGAGCT CCTCCGCTCG
GAGGTGGTGC ACCTTCTGGC GATGCTGGTC AGAAAGAACG ACAAGGAGCG GCGTACCATC
CCCGGGCTCA ACCCGGACCG ATCCGACATC ATCGTGGCCG GGGTCACCGT AATCGACGAA
CTGATGGATT TTTTCCAGGT GAACCTGCTC AAGGTGAACG AGCGGGGGAT CAGGGAAGGG
CTGATACTGA GGGGGCTGCG GCGGCAGAAC CTGCTCCCCC ACGAGAAAAG GACCCGCTCC
TGGCGCAACT CGGCGCTGGA GTTCGGCCAT TCCTGCCATT TCGACCAGGG TCACGCGGAG
CATGTGGCCA AACTGGCCCA GCAGGTGGCG AAGGCATTGG CGCCCAAGTT CAAGCTGGCC
GAACGGGAGC TGCGGCTACT GGAGGCGGCG GCGCTTTTGC ACGACGTCGG GTATTTCATC
AACTATTCCA GTCACCACAA GCACTCCTAC CACCTGATCC GCCATGCCGA CCTCTTCGGT
TTCACCCCGC GCGAACGGGA GTTGATCGCC AACGTGGCGC GCTACCACCG TAAATCTATC
CCCAAGAAAA AACACGACCA GTTCGTGCGG CTTCCGGCTG GCGACCAGTT GCTGGTTTCG
CGCCTGGGAG GGATCCTGCG GCTTTGCGAC GGGCTGGACC GGCGCCGAAA TGGAGTGGTT
AAAGAGCTTC GCTGCCGGCT TTCGCCGGAC GGCACGCTGC GCGTGACCCT GGTGGGCGAT
GAGGACATGT CGGTGGAACT CTACGGTGCG AAGGCCAAGG GAGACCTGCT GCAGGAGGCT
TTCCATCTGA AGCTTGCGCT GGAGGCGGGC TGA
 
Protein sequence
MKQTRLAAID IGTNSIRSII IETSGNGKYK ILDDEKVLVR LGEGLHQSGA ISPAACSRAL 
EALSRQKKII DGYGVASIEA VATSAMRKAS NGAALVQAIK DATGVEVEVI SGEEEAELAA
LSAAHNFELE GVRHLIFDIG GGSMELIAAL GSHTEEMISL ELGAVFLTES FLKGDPVHPS
EHEKLRKHVR KTLKRAYTGE RSGMQCLVGS GGTVTSIAAM IAATRKEKYD SVHGYELLRS
EVVHLLAMLV RKNDKERRTI PGLNPDRSDI IVAGVTVIDE LMDFFQVNLL KVNERGIREG
LILRGLRRQN LLPHEKRTRS WRNSALEFGH SCHFDQGHAE HVAKLAQQVA KALAPKFKLA
ERELRLLEAA ALLHDVGYFI NYSSHHKHSY HLIRHADLFG FTPRERELIA NVARYHRKSI
PKKKHDQFVR LPAGDQLLVS RLGGILRLCD GLDRRRNGVV KELRCRLSPD GTLRVTLVGD
EDMSVELYGA KAKGDLLQEA FHLKLALEAG