Gene BCZK0689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK0689 
SymbolcelB 
ID3023537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp791543 
End bp792853 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content36% 
IMG OID637544926 
ProductPTS system, cellobiose-specific IIC component 
Protein accessionYP_082293 
Protein GI52144535 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00359] phosphotransferase system, cellobiose specific, IIC component
[TIGR00410] PTS system, lactose/cellobiose family IIC component 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGGGT TTATGAGTTT TATGGAACAA AAGATTATGC CAACAACGCA AAAGATTGCC 
GGACAGCGAC ATTTATTAGC AATTCGAAAC GGGGTTATTT CTACATTACC GTTAACGATT
GTCGGATCAT TTTTCGTTAT CTTTTTAAAC TTACCAATTG ATGCATATAT GGAATGGATT
GCACCGTTTC GCCATATTTT AGATATTCCA TTCCGATTTA CAGTAGGACT AATGGCGTTA
TACGCAGCAT TTGGGGTAGG CGCTTCGCTC GCGAATTTTT ATCAGCTCAA TCAATTAAGT
GCAGGGTTAC TATCTGTACT CGCCTTTTTA CTAGCATCGG TTGAACCAAT TCAAATTACG
AAAGCTGTAC CAGGTGTTAT TGATGCAGGT AGATATATTT CAGTAGGAAC ATTAAGTGCG
ACATCTTTAT TTGGCGCAAT TGTTACAGCT TTAATTGCAG TAGAAATTTA TCACTTTATG
ATTAAGCATA ATATCTCAAT TAAATTACCA GATAGTGTAC CACCAGCAGT TTCAAATTCA
TTTGCAGCAT TAATTCCAAC GTTAGCAGTT ATTCTTTTAT TCTGGGGCAT TCGCTACGGT
TTGAAATTTG ATGTAAATAC AACAATCACG TATTTAATCG CACCATTAAA ATCAGTATTA
GTAGGAAATA ATTTATTCGG TGGTTTATTA ACAGTATTCT TAATCGTATT CTTCTGGTCA
TTCGGTATAC ATGGCCCTGC GATTTTAGGG CCGATCATTC GTCCAATGTG GGATTCTGCA
ATTCTTGAAA ATATGGAAGT ATTCACAGCT ACAGGAAATG CACATCAATT ACCAAACTTA
TTTACAGAGC AATTTATTCA ATGGTTCGTA TGGATTGGTG GATCAGGCTC AACGTTAGCT
TTAGTCATTA TGTTTATGTT CTCTAAATCT AAGTTCCTAA AAGAATTAGG TAGATTATCA
TTTGTGCCAG GCTTATTCAA TATTAACGAA CCAATTATTT TCGGGGCACC AATTGTAATG
AACCCAATCT TAATTATTCC GTTCGTTATT ACACCGTTAG TGACAACGAC AGTATCATAT
TTCGCAGTTG TTTCAGGTAT GATCCCGATG ATGATGGCGA AACTGCCATT TACGATGTTA
GCACCAATTG CAGCGATTAT TAGTACGGAC TGGACAATTA TGGCTGGTGT ACTTGTACTT
GTTAACTTTG TTATCTCATT CGTTATTTAC TATCCATTCT TCAAAATGTA TGAGAAACAA
CAATTAGCAG GAGAGGAGAA AACAGAATGC TCGGAGCAAT TATCATCTTA A
 
Protein sequence
MNGFMSFMEQ KIMPTTQKIA GQRHLLAIRN GVISTLPLTI VGSFFVIFLN LPIDAYMEWI 
APFRHILDIP FRFTVGLMAL YAAFGVGASL ANFYQLNQLS AGLLSVLAFL LASVEPIQIT
KAVPGVIDAG RYISVGTLSA TSLFGAIVTA LIAVEIYHFM IKHNISIKLP DSVPPAVSNS
FAALIPTLAV ILLFWGIRYG LKFDVNTTIT YLIAPLKSVL VGNNLFGGLL TVFLIVFFWS
FGIHGPAILG PIIRPMWDSA ILENMEVFTA TGNAHQLPNL FTEQFIQWFV WIGGSGSTLA
LVIMFMFSKS KFLKELGRLS FVPGLFNINE PIIFGAPIVM NPILIIPFVI TPLVTTTVSY
FAVVSGMIPM MMAKLPFTML APIAAIISTD WTIMAGVLVL VNFVISFVIY YPFFKMYEKQ
QLAGEEKTEC SEQLSS