Gene GM21_1080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1080 
Symbol 
ID8136402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1266574 
End bp1267557 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content67% 
IMG OID644868691 
ProductUDP-N-acetylenolpyruvoylglucosamine reductase 
Protein accessionYP_003020899 
Protein GI253699710 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0812] UDP-N-acetylmuramate dehydrogenase 
TIGRFAM ID[TIGR00179] UDP-N-acetylenolpyruvoylglucosamine reductase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value2.21619e-24 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCATCG ACCAGGCAAA AGAAAACCAG CAGATAGAGC TCTTGCGGGA CGTCCCCCTC 
GCCCCCTTCA CCTCGTTCAA GATCGGCGGC CCGGCCAGGT TCTTGACCAT GGCCCGGACG
CTGGAACAAC TGAAGCAGGC GCTTTCCTTC GCCAGGCGGG AAGGGATTCC CTTCCTCATC
GTCGGGGGGG GATCCAACCT GCTGGTGAGC GACCGCGGTT TCGACGGCAT CGCGATCAGG
CTGCAGCTGA AAGGGATCAA GGTCCAAGGG AACCGGGTCG AGGCGCAGGC GGGAGTCGAC
CTCATGGCGC TGGTGGAGCA TGCGGCACAC TGGGGGCTGG CGGGGATCGA GCGGCTGGCT
GGCATTCCGG GGCTCTTCGG GGGGGCGGTG CGCGGCAATG CGGGGGCCTA CGGCAGTTGC
ATCGGCGACG TGATCGAGAG GGTCTACGCG CTCCGGACGG AGACCATGGA GCTGGTCGCG
CTCACGCGGG ACGACTGCCG GTTCCAGTAC CGCGACAGCC GTTTCAAGAA GGATCACGGG
CTGGTGGTGG TGGCGGCGAG CCTGCTGCTT GAGCCGGCGG ACCCCCAGGA GATCCTGCGC
CAGGCCGAGG CGACGGTGAG GAAACGGCAA GCCCGCCGGC TGCAATGCGA CCGGAGCGCC
GGCTCTTTCT TCATGAATCC GGTGGTGCGC GACCCAGAGC TGATCCGGAG GTTCGAAACC
GAGCAGGGAA CCCACTGCAG GGACGGCAGG ATTCCCGCCG GATGGCTCAT CGACAAGGCC
AGGCTGCGCA GCCTCGCGGT GGGTGCGGCC ATGGTCAGCC CACGGCACGC CAATTACCTG
ATCAACACCG GCAACGCCAG CGCCCAGGAG GTGGTGAGGC TCGCCGAGCT GGTGAAGGAC
GAGGTGCGGG CGTCGCTGGG GGTGCAGTTG GAGGAGGAGG TGAGCTGCGT CGGCTTCACT
CAGGCTGCGC CGCTTCCCTC CTGA
 
Protein sequence
MFIDQAKENQ QIELLRDVPL APFTSFKIGG PARFLTMART LEQLKQALSF ARREGIPFLI 
VGGGSNLLVS DRGFDGIAIR LQLKGIKVQG NRVEAQAGVD LMALVEHAAH WGLAGIERLA
GIPGLFGGAV RGNAGAYGSC IGDVIERVYA LRTETMELVA LTRDDCRFQY RDSRFKKDHG
LVVVAASLLL EPADPQEILR QAEATVRKRQ ARRLQCDRSA GSFFMNPVVR DPELIRRFET
EQGTHCRDGR IPAGWLIDKA RLRSLAVGAA MVSPRHANYL INTGNASAQE VVRLAELVKD
EVRASLGVQL EEEVSCVGFT QAAPLPS