Gene BCZK0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK0044 
SymbolglmU 
ID3025510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp51145 
End bp52524 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content38% 
IMG OID637544204 
Productbifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase 
Protein accessionYP_081661 
Protein GI52145169 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0363259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAACA GATTTGCAGT GATTCTAGCT GCAGGTAAAG GCACACGTAT GAAGTCTAAG 
CTATACAAAG TGCTTCATCC TGTATGTGGA AAACCTATGG TACAACATGT TGTCGATCAA
GTATCTCAAT TAGGGTTGCA GAAACTTGTA ACAGTCGTAG GACATGGTGC TGAAATGGTA
CAAGAACAGC TAGGAAACGT AAGTGAGTTT GCATTACAAG CAGAACAACT TGGTACAGCG
CATGCTGTAG ATCAAGCTGC AGGTGTACTT GCAAATGAAG AAGGAACAAC TTTAGTTATT
TGTGGTGATA CGCCGCTAAT AACTGCTGAA ACGATGGAAG CATTACTTCA GCAACATAAA
GAAGCAGGGG CAATGGCAAC GGTGTTAACA GCTTACATAG AAGAACCTGC TGGATATGGT
CGTATCGTTC GTAATGAGAA TGGTCATGTT GAAAAGATTG TTGAGCATAA GGATGCAAAT
GAGAAAGAAT TAGCTATTAA AGAAATCAAT ACAGGTACGT ATTGTTTTGA TAATAAAGCT
TTATTCGCTT CACTTTCTAA GGTTTCAAAT GATAACGTAC AAGGTGAATA TTACCTGCCA
GATGTTATTG AGATTTTAAA AAATGAAGGT CATATTGTAT CGGCTTATCA AACAGAGCAC
TTCGATGAAA CGTTAGGTGT TAACGACAGA GTCGCTCTAT CGCAAGCGGA AATTATTATG
AAAAACCGTA TCAACCGAAA GAACATGGTA AATGGTGTTA CAATTATTGA TCCAAGTAAC
ACGTATATTT CTGCTGATGC AATTATCGGT AGTGATACAG TTCTTCATCC AGGAACAATT
ATTGAGGGGA ACACTGTAAT TGGTTCTGAT TGTGAAATTG GACCGCATAC AGTAATTCGC
GATAGTGAAA TTGGAGATCG TACGACAATT CGTCAATCTA CTGTACATGA TAGTAAGCTT
GGTACAGAAG TATCGGTTGG TCCATTTGCA CATATTCGCC CAGATTCAGT TATTGGAGAT
GAAGTACGCG TTGGAAACTT CGTGGAAATC AAAAAAACTG TTTTTGGTAA TAGAAGTAAA
GCTTCACACT TAAGTTATAT CGGGGATGCA CAAATTGGAG AAGACGTGAA TCTTGGTTGT
GGTTCAATTA CGGTGAACTA TGACGGTAAG AATAAATTCA AAACTGTGAT TGGTAACGGG
GTATTTATTG GATGTAATTC AAACCTTGTT GCTCCTGTAA CAGTTGAAGA TGGTGCTTAT
GTGGCAGCAG GCTCTACAAT TACAGAGAAT GTTCCATCAA AAGCATTATC TGTAGCACGT
GCACGTCAAG TTAACAAAGA AGACTATGTT GATCAATTGC TGAATAAGAA AAAATCATAA
 
Protein sequence
MSNRFAVILA AGKGTRMKSK LYKVLHPVCG KPMVQHVVDQ VSQLGLQKLV TVVGHGAEMV 
QEQLGNVSEF ALQAEQLGTA HAVDQAAGVL ANEEGTTLVI CGDTPLITAE TMEALLQQHK
EAGAMATVLT AYIEEPAGYG RIVRNENGHV EKIVEHKDAN EKELAIKEIN TGTYCFDNKA
LFASLSKVSN DNVQGEYYLP DVIEILKNEG HIVSAYQTEH FDETLGVNDR VALSQAEIIM
KNRINRKNMV NGVTIIDPSN TYISADAIIG SDTVLHPGTI IEGNTVIGSD CEIGPHTVIR
DSEIGDRTTI RQSTVHDSKL GTEVSVGPFA HIRPDSVIGD EVRVGNFVEI KKTVFGNRSK
ASHLSYIGDA QIGEDVNLGC GSITVNYDGK NKFKTVIGNG VFIGCNSNLV APVTVEDGAY
VAAGSTITEN VPSKALSVAR ARQVNKEDYV DQLLNKKKS