Gene BCZK1888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK1888 
Symbol 
ID3024576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp2008678 
End bp2009886 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content34% 
IMG OID637546115 
Productglycosyltransferase; macrolide glycosyltransferase 
Protein accessionYP_083481 
Protein GI52143348 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.213265 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAATG TACTCGTAAT AAATTTTCCT GGAGAAGGTC ATATAAATCC GACTTTAGCT 
ATTATAAGTG AGTTAATTCG GCGAGGGGAA ACAGTTGTTT CGTATTGTAT TGAAGATTTT
AGAAAGAAGA TTGAAGCAAC AGGTGCAGAA TTCCGAGAGT TTGAGAATTT TCTCTCTCAA
ATTAATATTA TGGAACGAGT AAATGAAGGT GGGAGTCCTT TGACGATGCT ATCTCATATG
ATTGAAGCAT CAGAGCGTAT TGTTACTCAA ATTGTAGAAG AAACAAAAGG AGAACAGTAC
GATTACTTAC TATACGATAA TCATTTTCCA GTAGGACGTA TCATAGCGAA TGTTTTACAA
TTACCTAGCA TTTCGTCTTG TACAACGTTT GCTTTTAATC AGTACATTAC TTTTAACGAT
GAACAAAAAT CGAGAGAAGT AGATGAAACG AATCCAGTAT ATCAATCTTG TTTAGCGGGA
ATGGAAAAAT GGAATAAGCA GTATGGAATG AAGTGTACTA GTATGTATGA TATTATGAAT
CATCCTGGTG ATATTACTAT TGTATATACT TCCAAAGAAT ATCAGCCGCG GTCAGATGTA
TTCGATGAAT CGTATAAGTT TGTCGGTCCA TCAATTGCTA CTCAAAAAGA AGTAGGTAGC
TTTCCTATTG AACATTTAAA AGATGAAAAA GTGATTTTCA TTTCTATGGG AACAGTTTTT
AATGAACAAC CTGAGCTATA TGAAAAATGT TTTGAAGCTT TTAAAGATGT AGAAGCGACA
GTCGTATTAG TTGTTGGTAA GAAGATAAAT ATAAGTCAAA TTGAAAACAT TCCGGATAAC
TTTAAGTTGT ATAATTATGT GCCACAGTTA GAAGTATTAC AGCATGCTGA TGTATTCGTG
ACACACGGTG GTATGAATAG TTCCAGTGAA GCACTATATT ACGGTGTCCC GTTAGTTGTA
ATTCCGGTAA CAGGAGATCA GCCTTTAGTT GCGAAACGAG TAAATGAAGT AGGGGCTGGA
ATAAGGCTAA ATCGTAAAGA ATTCACTTCT GAATTGTTAC GAGAGTCTGT AAAGAAAGTG
ATGAATGATG TAACGTTTAA GGAAAATAGT CGTAAAGTTG GAGAGTCACT TCGAAATGCT
GGTGGATATA AAAGGGCAGT TGATGAAATA TTTAAAATGA AAATGAATTC GTACTTGAAA
CTTAAATAA
 
Protein sequence
MANVLVINFP GEGHINPTLA IISELIRRGE TVVSYCIEDF RKKIEATGAE FREFENFLSQ 
INIMERVNEG GSPLTMLSHM IEASERIVTQ IVEETKGEQY DYLLYDNHFP VGRIIANVLQ
LPSISSCTTF AFNQYITFND EQKSREVDET NPVYQSCLAG MEKWNKQYGM KCTSMYDIMN
HPGDITIVYT SKEYQPRSDV FDESYKFVGP SIATQKEVGS FPIEHLKDEK VIFISMGTVF
NEQPELYEKC FEAFKDVEAT VVLVVGKKIN ISQIENIPDN FKLYNYVPQL EVLQHADVFV
THGGMNSSSE ALYYGVPLVV IPVTGDQPLV AKRVNEVGAG IRLNRKEFTS ELLRESVKKV
MNDVTFKENS RKVGESLRNA GGYKRAVDEI FKMKMNSYLK LK