Gene GM21_3802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3802 
Symbol 
ID8139176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4373383 
End bp4374648 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content64% 
IMG OID644871421 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_003023579 
Protein GI253702390 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones116 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAAAAC TGGTAATTAA AGGTGGCAAC AAGCTCTCCG GAGAAGTGAC CGTTAGCGGA 
TCCAAGAACG CTGCCCTCCC CATATTCATC TCCACCATCC TGGCTCCGGG ATGCCACACC
ATCAGCAACG TTCCCTTCCT TAGGGACATC AACACCACCA TCAAGGTGCT GGAGAAGCTG
GGGGCGACGG TCGACGGCAG GGGGAACGTG GTCAAGATCG ACACCACCAA CCTGAACAGC
TTCGAGGCGA CCTACGACCT GGTGCGCACC ATGCGCGCCT CGGTCCTCGT GCTCGGCCCG
CTCCTGGCGC GCTTCGGCCA GGCCCGCGTC TCGCTTCCGG GGGGATGCGC CATAGGCGCC
CGCCCGATCA ACCTGCACCT CAAAGGGCTA GCGGCGCTGG GCGCCGAGAT CACCCTGGAG
CACGGCTACG TCGAGGCGAA GGCGAAAAAG CTCAAGGGGG CGCGCATCAA CTTCGACATC
TCCACCGTCG GCGGCACCGA GCAGCTCCTG ATGGCGGCGG CCACGGCGCA GGGGGAGACC
GTTCTGGAGA ACGCGGCGCG TGAGCCGGAG ATCGTCGATC TCGCCGAGAT CCTGATCAAG
ATGGGGGCAG ACATCGAGGG GGCCGGCACC GACACCATCC GCATCAAGGG GGTCGAGGCG
CTCACCGCCG CCGAGCACGC CGTGATGCCG GACCGCATCG AGGCCGGGAC CTTCATGATC
GCATCCGCCA TCACCGGCGG CGATATCAAG ATCAAGAACA TGCGTCTGGA CCACCTGGAC
GCACTCTCCT TCAAACTGCA GGACGCCGGC GTCGAGATCA CCAACAAGGA CAACATGGTC
CGCGTCAAAG GCCCCAAGAA GATCCGGAAC GTGAACATCA AGACGAGACC GTACCCCGGT
TTTCCGACTG ACATGCAGGC CCAGTTCATG GCGCTCATGT GCATCGCCGA GGGGGCCAGC
GTCATCTCGG AGAACATCTT CGAGAACCGC TTCATGCACG TCTCCGAGCT GCTTCGCTTC
GGCGCCGACA TCATCTGCGA GGGGAACAGC GCCACGGTGA AGGGGGTCAA GAAGCTCTCC
GGGGCTCCGG TCATGGCCAC CGACCTGCGC GCCTCCGCGT CGCTGATTCT GGCAGCCCTC
GCCGCCGACA ACACCAGCGA GATCTCCAGG ATCTACCACC TGGACCGCGG CTACGAAAGC
ATCGAGAAGA AGCTCGCCGG TCTCGGCGCC GACATAGCCC GCGTCCCGGA CGAAGAAGGC
CCCTAG
 
Protein sequence
MEKLVIKGGN KLSGEVTVSG SKNAALPIFI STILAPGCHT ISNVPFLRDI NTTIKVLEKL 
GATVDGRGNV VKIDTTNLNS FEATYDLVRT MRASVLVLGP LLARFGQARV SLPGGCAIGA
RPINLHLKGL AALGAEITLE HGYVEAKAKK LKGARINFDI STVGGTEQLL MAAATAQGET
VLENAAREPE IVDLAEILIK MGADIEGAGT DTIRIKGVEA LTAAEHAVMP DRIEAGTFMI
ASAITGGDIK IKNMRLDHLD ALSFKLQDAG VEITNKDNMV RVKGPKKIRN VNIKTRPYPG
FPTDMQAQFM ALMCIAEGAS VISENIFENR FMHVSELLRF GADIICEGNS ATVKGVKKLS
GAPVMATDLR ASASLILAAL AADNTSEISR IYHLDRGYES IEKKLAGLGA DIARVPDEEG
P