Gene GM21_2457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2457 
Symbol 
ID8137798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2871363 
End bp2872661 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content62% 
IMG OID644870067 
ProductEPSP synthase (3-phosphoshikimate 1-carboxyvinyltransferase) 
Protein accessionYP_003022258 
Protein GI253701069 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones166 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCG GAATTGTGGC AGAGGTACCC ACAGCGGTTG GTCAGATCGA TACGTCCAGT 
TATGTCGTCC ATCGCTCCAG GCTGGTAGGG ACAGTCAGTG TGAGCGGCGC CAAAAACAGC
GTGCTGCGGC TTCTGGCGGC CTCTCTTCTG ACCAGCGAGA GCATTGTGCT GAGGAACTAT
CCGGTCGGCC TGCTCGACGC CAAGGTACAC GTCCAGATGC TTGAAGTGCT GGGAAAAAAC
TGCGTCTGCG ACGGCGAGGA GATCACCATC ACGCAAGCTG CGGCTCCCCC GTCCCGTCTC
GACTGGCAGG GACGCTCCAT ACGCAACACC CTTCTCATCC TCGGGGCTTT GGTGGCCCGC
ACCGGTGAGG GGGCGGTTCC CCTCCCGGGC GGCTGCAAGT TAGGCGAGCG GAAGTACGAC
CTGCACGAGA TGTTGCTGCA GCGCTTGGGC GCCAAGGTGT GGGAGGAGGA CGGGATGCTC
TGCGCCCGCT CCACCGGACG CCTGGTCGGG ACCGATATCC ACCTGCCGAT CCGTTCCACC
GGTGCGACGG AGAACGCCAT TATCTGCGGT ACGCTGGCAA GCGGGGTCAC CAGGATCTGG
AATCCGCACA TCCGGCCGGA GATACTCGAC CTGATCCACT TGCTGCAGAG CATGGGGGCG
TCGATCAGGG TGTTCGGACA GGAGCACATA GAGGTGACCG GTGTCGAACA ACTCCACGGC
GCAAAGCATG TTGTCATCTC GGACAACATG GAGGCGATCA CCTGGCTGAT CGCATCGGTC
ATCACCGGCG GCGACATTGA GATCTTCAAC TTCCCCTATC GGGACCTGGA GGTCCCCCTC
ATTCACCTGA GGGAGAGCGG GGCGCGATTC TTCCGCGGCG ACAACAGCCT CATCGTAAGG
GGGGGGCGTT GCTACCCCGT CGACATCAGC ACCGGTCCGT ATCCGGGCAT AAACTCGGAT
ATGCAGCCGC TTTTCGCTGT TTACGGAGCG GTGGCGCAGG GGGAGACCCG CGTCATCGAC
CTCCGTTTCC CGGGACGCTA CGCCTATGCG GAGGAGCTGG CCAAGATGGG GGTCTCCTCT
GCCATCGACG GGAACCTCCT GAAAATAAGC GGAGGCAGGC CGCTCATCGG CGCGGAAGTG
CGGGCCCTTG ACCTTCGCGC AGGCATCGCC CTGACCCTGG CCGGACTGGT CGCTGACGGC
CGGACTGTGC TGCGCGAAGC ATGGCAGGTG GAGCGCGGTT ACAACAACTT CATGCACAAG
ATGCAGCAGC TTGGAGGAAA CATCTCCTAT GGCTGCTGA
 
Protein sequence
MNTGIVAEVP TAVGQIDTSS YVVHRSRLVG TVSVSGAKNS VLRLLAASLL TSESIVLRNY 
PVGLLDAKVH VQMLEVLGKN CVCDGEEITI TQAAAPPSRL DWQGRSIRNT LLILGALVAR
TGEGAVPLPG GCKLGERKYD LHEMLLQRLG AKVWEEDGML CARSTGRLVG TDIHLPIRST
GATENAIICG TLASGVTRIW NPHIRPEILD LIHLLQSMGA SIRVFGQEHI EVTGVEQLHG
AKHVVISDNM EAITWLIASV ITGGDIEIFN FPYRDLEVPL IHLRESGARF FRGDNSLIVR
GGRCYPVDIS TGPYPGINSD MQPLFAVYGA VAQGETRVID LRFPGRYAYA EELAKMGVSS
AIDGNLLKIS GGRPLIGAEV RALDLRAGIA LTLAGLVADG RTVLREAWQV ERGYNNFMHK
MQQLGGNISY GC