Gene Avin_51920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_51920 
SymbolglmU 
ID7764029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5294919 
End bp5296283 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content70% 
IMG OID643808008 
ProductUDP-N-acetylglucosamine pyrophosphorylase; GlmU 
Protein accessionYP_002802242 
Protein GI226947169 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCTCG ATATCGTCAT TCTCGCCGCC GGCCAGGGCA CGCGCATGCG TTCCGCCCTG 
CCGAAAGTCC TGCACCCGGT CGCCGGCAAT TCGATGCTCG GCCATGTCGT CGCCACGGCC
CGCCAACTGC AGCCGCAGGG CATCCACGTG GTGATCGGGC ACGGCGCCGA ACGGGTGCGC
GAGCGGCTGG CGGCGGACGA CCTGAACTTC GTCCTGCAGG CCGAGCAACT GGGCACCGGG
CACGCCGTGG CCCAGGCGCT GCCGGCACTG TCCGCCGAGC GGGTGCTGAT CCTCTACGGC
GACGTGCCGC TGATCGAGGC GGACACCCTG CGCCGCCTGC TGGCGCAGGT CGGCCCCGAG
CGCCTGGCCC TGCTCACCGT GGACCTGGTC GATCCCAGCG GCTACGGGCG GATCGTCCGC
GATGCCGCCG GGCGGGTGGT CGCCATCGTC GAGCACAAGG ACGCCAGCCC CGAGCAGCGC
GCCATCTGCG AGGGCAACAC CGGCATCCTC GCGGTGCCCG GCGCGCGCCT GGCCGACTGG
CTGGGGCGGC TGTCCAACGA CAATGTCCAG GGCGAGTACT ACCTCACCGA CGTGATCGCC
ATGGCGGTGG CCGACGGCCT GACGATCGCC ACCGAGCAGC CGCAGGACGC CATGGAGGTG
CAGGGCGCCA ACGACCGCCT GCAGCTCGCC CAACTGGAGC GCCACTACCA GTCGCGCGTC
GCCCGAAGGC TGATGGCCCA GGGCGTGACC CTGCGCGATC CGGCGCGATT CGACCTGCGC
GGCGAAGTCG AGGTCGGCCG CGACGTGCTG ATCGACGTCA ATGTGATCCT CGAAGGCAAG
GTGATCATCG AGGACGGCGT GGAAATCGGC CCGAACTGCA CGATCAAGGA CAGCACCCTG
CGCCGGGGCG CCCAGGTCAA GGCCAACAGC CACCTGGAAG GCGCCGAGCT GGGCGAGGGC
GCCGACTGCG GTCCCTTCGC CCGCCTGCGT CCGGGCGCGG TGCTGGGTGC CAAGGCCCAC
GTCGGCAACT TCGTCGAGCT GAAGAACGCC GTGCTGGGCG AGGGCGCCAA GGCCGGGCAC
CTGTCCTACC TGGGCGATGC CGAGATCGGC GCGCGGACCA ACATCGGCGC CGGCACCATC
ACCTGCAACT ACGACGGCGC CAACAAGTTC AGGACGGTGA TGGGCGAGGA TGTGTTCATC
GGCTCGAACA GCGCCCTGGT CGCCCCGGTC GAACTCGGCG CCGGCGCCAC CACCGGCGCC
GGCTCGGTGA TCACCGAGGA TGTGCCGGCC GGCAACCTGG CCCTCGGCCG TGGACGCCAG
CGCAATATCG AAGGCTGGCA GCGGCCCACC AAGCAAAAGA AATAG
 
Protein sequence
MSLDIVILAA GQGTRMRSAL PKVLHPVAGN SMLGHVVATA RQLQPQGIHV VIGHGAERVR 
ERLAADDLNF VLQAEQLGTG HAVAQALPAL SAERVLILYG DVPLIEADTL RRLLAQVGPE
RLALLTVDLV DPSGYGRIVR DAAGRVVAIV EHKDASPEQR AICEGNTGIL AVPGARLADW
LGRLSNDNVQ GEYYLTDVIA MAVADGLTIA TEQPQDAMEV QGANDRLQLA QLERHYQSRV
ARRLMAQGVT LRDPARFDLR GEVEVGRDVL IDVNVILEGK VIIEDGVEIG PNCTIKDSTL
RRGAQVKANS HLEGAELGEG ADCGPFARLR PGAVLGAKAH VGNFVELKNA VLGEGAKAGH
LSYLGDAEIG ARTNIGAGTI TCNYDGANKF RTVMGEDVFI GSNSALVAPV ELGAGATTGA
GSVITEDVPA GNLALGRGRQ RNIEGWQRPT KQKK