Gene BCZK5117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK5117 
Symbol 
ID3022829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp5225975 
End bp5227075 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content34% 
IMG OID637549349 
Productglycosyltransferase group 1 family protein 
Protein accessionYP_086686 
Protein GI52145229 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000816463 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTGT TGCATATGAA TGCTGGTGCG GAAGAAGGTG GGGGGAAAAC ACATATTATT 
TCACTTTTAT CTCAATTTTC AAAGGAAGAA GTGGAATTAA TGGTATTTGA AGAAGGTGCG
ATTGCTAGAG AGGCGAGAAA CCTTGGTATT CAAGTACATG TTTTTACCCA ATCATCTCGG
TACGACCTAT CAATTCTTTC AAAAATAAAA GCATTTATTA ATGAAAATCA ATTTGACATT
GTGCATACAC ATGGCGCACG AGCAAATTTC TATCTCTCCC TCTTGAAAAA AGGTATAAAA
GCGAAATGGA TAATGACTGT CCACAGTGAT CCAACTTTGG ATTTTATGAA GAGGGGATTA
AAAGGATGGG TATTTACGAA GTTAAATTTA CGTTCTTTCA GGAAGGTAGA TTTATTCTTT
GCAATTACGG AGAACTTTAA GAGAAATATA ATAAAACTAG GTGTACCAGA AGAGAAGATT
TGTACTGTTT ATAATGGAAT TGAGTATGAT AGTAATCCGG CAAAACCTTA TGATAAGAGT
GAATTTGGCA TTGATGAAGG AATATTTACA GCCATTCAAG TAGCACGTCT TCATCCTGTT
AAAGGTCATG ATATTTTATT TGAAGCATTA CAAAAAATTA AAATTCCCAA TATAAAGGTA
CTCTTGCTTG GTGATGGTCC TATAGAAGCA GAATTAAAAG ATATGGTGAA ACAAAAGGGT
CTAGAGGATA AAGTAATGTT TCTAGGTTTT CGTACAGATT CAAAGGAATT ATATGCGTCT
GCACACATTA ATTTGTTAAC CTCTTATAGC GAAAGTTTCC CTCTCGTTTT ATTAGAAGCG
GCTAATCAAC GCTTAACATC TATTGCAACA AATGTAGGTG ATATGAAAAA GTTAATAGTT
GATGATACGT ATGGATGGAT TGTTCCGATT GGTGATGCAG ACTCGTTAGC AAATGCATTA
GAAAATGCTT ATGAAAAATG GTTGAATGGT GAATTAGAAG CGATGGGAAA TCGTTTATAT
ACTCACGCAT CTACTCACTT CTCGCTAAAG AATTTGTATG AAGATACTTA TAATGCATAT
AAAACACTTT TACTGAAATA G
 
Protein sequence
MKVLHMNAGA EEGGGKTHII SLLSQFSKEE VELMVFEEGA IAREARNLGI QVHVFTQSSR 
YDLSILSKIK AFINENQFDI VHTHGARANF YLSLLKKGIK AKWIMTVHSD PTLDFMKRGL
KGWVFTKLNL RSFRKVDLFF AITENFKRNI IKLGVPEEKI CTVYNGIEYD SNPAKPYDKS
EFGIDEGIFT AIQVARLHPV KGHDILFEAL QKIKIPNIKV LLLGDGPIEA ELKDMVKQKG
LEDKVMFLGF RTDSKELYAS AHINLLTSYS ESFPLVLLEA ANQRLTSIAT NVGDMKKLIV
DDTYGWIVPI GDADSLANAL ENAYEKWLNG ELEAMGNRLY THASTHFSLK NLYEDTYNAY
KTLLLK