Gene BCG9842_B5262 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B5262 
SymbolglmU 
ID7186062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp50868 
End bp52247 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content38% 
IMG OID643547835 
Productbifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase 
Protein accessionYP_002443596 
Protein GI218895185 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones124 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAACA GATTTGCAGT GATTCTAGCT GCAGGTAAAG GCACACGTAT GAAGTCTAAG 
CTATACAAAG TGCTGCATCC TGTATGTGGA AAACCCATGG TACAACACGT AGTCGATCAA
GTATCTCAAT TAGGATTGCA GAAACTTGTA ACGGTTGTTG GACATGGTGC TGAAATGGTA
CAAGAACAGC TAGGAAACGT AAGTGAGTTT GCATTACAAG CAGAACAACT TGGTACAGCA
CATGCTGTAG ATCAAGCTGC AAGTGTACTT GCAAATGAAG AAGGAACAAC TTTAGTTATT
TGTGGTGATA CGCCGCTAAT AACTGCTGAA ACGATGGAAG CATTACTTCA GCAACATAAA
GAAGCAGGAG CAATGGCGAC GGTGCTAACA GCGTACATAG AAGAACCTGC TGGATATGGC
CGTATTGTTC GTAATGAGAA TGGTCATGTT GAAAAGATTG TTGAGCATAA AGATGCAAAT
GAGAAAGAAT TAGCTATTAA AGAAATCAAT ACAGGTACGT ATTGTTTTGA TAATAAGGCT
TTATTTGCTT CACTTTCTAA AGTTTCAAAT GATAACGTAC AAGGTGAATA TTACCTTCCA
GATGTTATTG AGATTTTAAA AAATGAAGGT CATATCGTAT CAGCTTATCA AACAGAGCAG
TTCGATGAAA CGTTAGGTGT TAACGACAGA GTCGCTCTAT CGCAAGCGGA AATTATTATG
AAAAACCGTA TCAACCGAAA GAACATGGTA AATGGTGTTA CAATTATTGA TCCAAGTAAC
ACTTATATTT CTGCTGATGC AATTATCGGT AGTGATACAG TTCTTCATCC AGGAACAATT
ATTGAGGGGA ACACTGTAAT TGGTTCTGAT TGTGAAATTG GACCGCATAC AGTAATTCGT
GATAGTGAAA TTGGAGATCG TACGACAATT CGTCAATCTA CTGTACATGA TAGTAAACTT
GGTACAGAAG TATCGGTTGG TCCATTTGCA CATATTCGCC CAGATTCAGT TATTGGAGAC
GAAGTACGCG TTGGAAACTT CGTGGAAATC AAAAAAACTG TCTTTGGTAA TAGAAGTAAA
GCTTCACACT TGAGTTATAT CGGGGATGCA CAAGTTGGAG AAGACGTGAA TCTTGGTTGT
GGTTCAATTA CGGTGAACTA TGACGGTAAG AATAAATTCA AAACTGTGAT TGGTAATGGG
GTATTTATTG GATGTAATTC AAACCTTGTT GCTCCAGTAA CAGTTGAAGA TGGTGCTTAT
GTGGCAGCAG GCTCTACAAT TACAGAGAAT GTTCCATCAA AAGCATTATC GGTAGCACGT
GCACGTCAAG TTAACAAAGA AGACTATGTT GATCAATTGC TGAATAAGAA AAAATCATAA
 
Protein sequence
MSNRFAVILA AGKGTRMKSK LYKVLHPVCG KPMVQHVVDQ VSQLGLQKLV TVVGHGAEMV 
QEQLGNVSEF ALQAEQLGTA HAVDQAASVL ANEEGTTLVI CGDTPLITAE TMEALLQQHK
EAGAMATVLT AYIEEPAGYG RIVRNENGHV EKIVEHKDAN EKELAIKEIN TGTYCFDNKA
LFASLSKVSN DNVQGEYYLP DVIEILKNEG HIVSAYQTEQ FDETLGVNDR VALSQAEIIM
KNRINRKNMV NGVTIIDPSN TYISADAIIG SDTVLHPGTI IEGNTVIGSD CEIGPHTVIR
DSEIGDRTTI RQSTVHDSKL GTEVSVGPFA HIRPDSVIGD EVRVGNFVEI KKTVFGNRSK
ASHLSYIGDA QVGEDVNLGC GSITVNYDGK NKFKTVIGNG VFIGCNSNLV APVTVEDGAY
VAAGSTITEN VPSKALSVAR ARQVNKEDYV DQLLNKKKS