Gene GM21_0074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0074 
SymbolglmU 
ID8135373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp96452 
End bp97828 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content65% 
IMG OID644867691 
Productbifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase 
Protein accessionYP_003019919 
Protein GI253698730 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value4.6740700000000004e-33 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGTAACA AGATAGCTGC GATCGTACTG GCGGCAGGCA TGGGGACCAG GATGAAGTCC 
GACTTGGTCA AGGTGATGCA TCCGGTGGCC GGCGTGCCGA TGATCCAATG GCCCGTCGCC
GCGGCCTTTG CCTCCGGCGT GGAGCGCTGC GTGCTGGTGG TCGGCCATCA GCAGGAGAAG
GTGCGCGAAC ATTTCGCCGG CCGGGGGGAG GTGAGCTTCG CCCTGCAGTC CGAGCAACTC
GGAACCGGGC ACGCGGTACG CTGCGCCATG ACCGAGCTCG ATCCCGGCGC GGACACTGTG
CTGATTCTCT GCGGCGACAC CCCGCTTCTT GAGGCCAAAA GCCTCGCGGG AATGCTCAAG
GCGCACCGGG AGACGAAAGC CTGCCTCACC GTGATGACCG CGACCCTGGG GAATCCCTTC
GGCTACGGCA GGATCGTGAA GGACGCAACA GGAAAGGTCA CCGCCATCAC CGAGGAGAAG
GACGCGACTG AAAAGGAGCG CCTGATAGAC GAGGTGAACG CCGGGGTCTA CTGCGTCGAC
CGCGCCTTCC TGGAAAAATC CATCACCTCC ATCAAAAACG ACAACGCCCA GGGCGAGTAT
TACCTCACCG ACGTGGTGCG GCAGGCGGCG CAGCAGGGAC TCGCCTGCCT GAGCTTCAAG
GTCGCCGACC CGCGTGAGAT CAGCGGGGTC AACGACCGGG CCCAGATGGC CGAGGCGGCG
CGGGTGCTTC GGGGACGGAT CAACCGCGAA CTGATGCTCT CCGGCGTCAC CATGATCGAC
CCCGAGACGG TCTACATCGA CCGCGGCGTC CGCATCGGGC GCGACAGCGT GGTCTACCCG
GGAGCCACCA TCGAGGGGAA CACTGTGATC GGCGAGCGCT GCGTCATCGG CCAAGGGTCG
CTGATCCAGA ACTGCAGCAT AGCCGACGAC GTCGCTGTCA AGGCGGGCAG CGTGCTGGAG
GATTCCAAGG TCGGTCCCGA GGCGGCCATA GGCCCCATGG CGCATCTGCG TGCCGGGACC
GAACTCTCCG CCCACGTGAA GATCGGCAAC TTCGTCGAGA CCAAGAAGGC CTTCATGGGC
GAGGGGTCCA AGGCCTCGCA CCTCACCTAC CTGGGGGACG CCACCATCGG CCGGGACGTG
AACATCGGCT GCGGCACCAT CACCTGCAAC TACGACGGGG TGAAAAAGCA CAAGACCGTG
ATCGAGGACG GCGTCTTCGT GGGGAGCGAC GTGCAACTGG TGGCGCCGGT CACCGTGGGG
AGGAACTCCC TGATCGCGGC AGGGACCACC GTCACCAAGG ACGTCCCTGC GGACTCGCTC
GCCATCGCCC GCTCCCCCCA GGTCAACAAG GAAGGGTGGA CCCTCCGGAA AAAATAA
 
Protein sequence
MGNKIAAIVL AAGMGTRMKS DLVKVMHPVA GVPMIQWPVA AAFASGVERC VLVVGHQQEK 
VREHFAGRGE VSFALQSEQL GTGHAVRCAM TELDPGADTV LILCGDTPLL EAKSLAGMLK
AHRETKACLT VMTATLGNPF GYGRIVKDAT GKVTAITEEK DATEKERLID EVNAGVYCVD
RAFLEKSITS IKNDNAQGEY YLTDVVRQAA QQGLACLSFK VADPREISGV NDRAQMAEAA
RVLRGRINRE LMLSGVTMID PETVYIDRGV RIGRDSVVYP GATIEGNTVI GERCVIGQGS
LIQNCSIADD VAVKAGSVLE DSKVGPEAAI GPMAHLRAGT ELSAHVKIGN FVETKKAFMG
EGSKASHLTY LGDATIGRDV NIGCGTITCN YDGVKKHKTV IEDGVFVGSD VQLVAPVTVG
RNSLIAAGTT VTKDVPADSL AIARSPQVNK EGWTLRKK