Gene BMA10229_A2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_A2041 
SymbolglmU 
ID4793639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008836 
Strand
Start bp2066907 
End bp2068592 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content72% 
IMG OID 
ProductUDP-N-acetylglucosamine pyrophosphorylase 
Protein accessionYP_001028005 
Protein GI124383571 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGCGCA TCGCACGGCG GCCCGCACCG CCGGGGCACG CGTGTTCGCC CGCGCGCCGC 
ACGCTATCGC GCCCCGCCCG CTACATCCGT CTCGCCTCCT CCCCCGCGCC GCCCGACCTC
GCCCGTCTTC CCGACCGCTT CGGGGCCACC CGTCGACGGC CTCGCGGCCG ACTCGCACTC
CGGGCGTTCG TGCGGCTCGA ACGAACCTCG CGCCCCCGCC CGCCGGCCGC CGCCGCGCCC
GGCTCGCCCG GCTGCCCAGC CGCGCCGGAC GGGGCCCGCA TGCTAGAATG GCCAGCTTCG
AACTCTCCCT ACGAAACTGG CGCCATGAAT ATCGTGATTT TGGCGGCAGG CACCGGCAAG
CGCATGCGTT CGGCGCTGCC GAAAGTGCTT CATCCTCTGG CCGGCAGGCC CCTTCTCTCC
CACGTGATCG ATACCGCCCG CGCACTCGCG CCGTCCCGGC TCGTCGTCGT GATCGGCCAT
GGCGCCGAGC AGGTGCGCGC GGCCGTCGCC GCGCCCGACG TGCAGTTCGC GGTGCAGGAG
CAGCAGCTCG GCACCGGGCA CGCGGTGCGC CAGGCGCTGC CGCTGCTCGA CCCGTCGCAG
CCGACGCTCG TGCTGTACGG CGACGTGCCG CTCACGCGCA CGGCGACACT CAAGCGCCTC
GCCGACGCCG CGACCGACGC CCGCTACGGC GTGCTGACCG TCACGCTCGA CGATCCGACG
GGCTACGGGC GCATCGTGCG CGATCAGGCC GGGTGCGTGA CGCGCATCGT CGAGCAGAAG
GACGCGTCGC CCGACGAGTT GCGCATCGAC GAGATCAACA CGGGCATCGT CGTCGCGCCG
ACCGCGCAGC TTTCGATGTG GCTCGGCGCG CTCGGCAACG ACAACGCGCA GGGCGAGTAC
TATCTGACCG ACGTCGTCGA GCAGGCGATC GAAGCGGGCT TCGAGATCGT CACGACGCAG
CCGGACGACG AGTGGGAGAC GCTCGGCGTG AACAGCAAGG CGCAGCTCGC CGAGCTCGAG
CGCATTCATC AGCGCAACCT CGCCGACGCG CTGCTCGCCG CGGGCGTGAC GCTCGCCGAT
CCGGCGCGCA TCGACGTGCG CGGCACGCTC GCGTGCGGGC GCGACGTGTC GATCGACGTG
AATTGCGTGT TCGAAGGCGA CGTGACGCTC GCCGACGGCG TGACGATCGG CGCGAACTGC
GTGATCCGCC ACGCGGCGAT CGCCGCGGGC GCGCGCGTGG ACGCGTTCTC GCATCTCGAC
GGCGCGACGG TCGGCGCGAA CGCGGTCGTC GGCCCGTACG CGCGGCTGCG CCCGGGCGCG
GTGCTCGCCG CCGACGCGCA CGTCGGCAAC TTCGTCGAGG TGAAGAACGC GACGCTCGGC
CAAGGCTCGA AGGCGAACCA TCTGACCTAT CTCGGCGACG CGGACATCGG CGCGCGCGTG
AACGTCGGCG CGGGCACGAT CACGTGCAAC TACGACGGCG CGAACAAGTT CCGCACGGTC
ATCGAGGACG ACGTGTTCGT CGGCTCGGAC ACGCAGTTCG TCGCGCCGGT GCGCGTCGGC
CGCGGCGTGA CGGTGGCGGC GGGCACGACC GTATGGAAGG ACGTCGCCGC GGACATGCTC
GTGCTCAACG ACAAGACGCA GACCGCGAAG AGCGGCTACG TGCGCCCCGT CAAGAAGAAG
AGCTGA
 
Protein sequence
MARIARRPAP PGHACSPARR TLSRPARYIR LASSPAPPDL ARLPDRFGAT RRRPRGRLAL 
RAFVRLERTS RPRPPAAAAP GSPGCPAAPD GARMLEWPAS NSPYETGAMN IVILAAGTGK
RMRSALPKVL HPLAGRPLLS HVIDTARALA PSRLVVVIGH GAEQVRAAVA APDVQFAVQE
QQLGTGHAVR QALPLLDPSQ PTLVLYGDVP LTRTATLKRL ADAATDARYG VLTVTLDDPT
GYGRIVRDQA GCVTRIVEQK DASPDELRID EINTGIVVAP TAQLSMWLGA LGNDNAQGEY
YLTDVVEQAI EAGFEIVTTQ PDDEWETLGV NSKAQLAELE RIHQRNLADA LLAAGVTLAD
PARIDVRGTL ACGRDVSIDV NCVFEGDVTL ADGVTIGANC VIRHAAIAAG ARVDAFSHLD
GATVGANAVV GPYARLRPGA VLAADAHVGN FVEVKNATLG QGSKANHLTY LGDADIGARV
NVGAGTITCN YDGANKFRTV IEDDVFVGSD TQFVAPVRVG RGVTVAAGTT VWKDVAADML
VLNDKTQTAK SGYVRPVKKK S