Gene BCG9842_B4490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B4490 
SymbolcelB1 
ID7182001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp779655 
End bp780965 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content37% 
IMG OID643548578 
ProductPTS system, cellobiose-specific IIC component 
Protein accessionYP_002444249 
Protein GI218895838 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00359] phosphotransferase system, cellobiose specific, IIC component
[TIGR00410] PTS system, lactose/cellobiose family IIC component 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.000000107554 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATGGGT TTATGAGTTT TATGGAACAA AAGATTATGC CAACAACGCA AAAGATTGCA 
GGACAACGAC ATTTATTGGC AATTCGAAAT GGGGTTATTT CTACATTACC GTTAACGATC
GTCGGGTCAT TTTTCGTTAT CTTTTTAAAT TTACCAATTG ATGCATATAT GGAATGGATT
GCACCGTTTC GCCATATTTT AGATATTCCA TTCCGATTTA CAGTAGGACT AATGGCCTTA
TACGCAGCGT TTGGAGTAGG GGCCTCGCTC GCGAACTTTT ATCAGCTCAA TCAATTAAGT
GCAGGGCTAC TATCTGTACT CGCCTTTTTA CTAGCATCAG TTGAACCAAT TCAAATTACG
AAAGCTGTAC CAGGTGTTAT TGATGCAGGT CGATATATTT CAGTAGGAAC ATTAAGTGCA
ACGTCTTTAT TCGGCGCAAT TGTGACAGCA TTAATTGCAG TGGAAATTTA TCATTTTATG
ATTAAGCATA ATATCTCAAT AAAATTACCA GACAGTGTAC CACCAGCAGT TTCAAATTCG
TTTGCAGCAT TAATTCCAAC ATTAGTAGTT ATTCTTTTAT TCTGGGGCAT TCGCTACGGT
TTGAAATTCG ATGTAAATAC AACAATTACA TACTTAATCG CACCATTAAA ATCAGTACTA
GTAGGAAATA ACTTATTCGG TGGTTTATTA ACAGTATTCT TAATCGTGTT CTTCTGGTCA
TTTGGTATAC ATGGACCGGC AATTTTAGGA CCAATCATTC GTCCGATGTG GGATTCCGCA
ATTCTTGAAA ATATGGAAGT GTTCACGGCT ACAGGAAATG CACATCAGTT ACCAAACTTA
TTTACAGAGC AGTTCATTCA ATGGTTCGTA TGGATTGGCG GATCTGGCTC AACGTTAGCT
TTAGTAATTA TGTTTATGTT CTCTAAATCT AAGTTCCTAA AAGAGTTAGG TAGATTATCA
TTCGTACCAG GTTTATTCAA TATTAACGAG CCAATTATTT TCGGGGCACC AATTGTAATG
AACCCAATCT TAATTATTCC GTTCGTTATT ACACCGTTAG TGACAACGAC AGTATCATAT
TTCGCAGTTG TTTCAGGTAT GATTCCACTC ATGATGGCGA AATTGCCATT TACGATGTTA
GCACCAATTG CAGCGGTGAT TAGTACGGAC TGGACAATTA TGGCTGGTGT ACTTGTACTT
GTTAACTTTG TAATCTCATT CGTTATTTAC TATCCATTCT TCAAAATGTA TGAGAAACAA
CAATTAGCAG GAGAGGAGAA AACAGAATGC TCGGAGCAAT TATCATCTTA A
 
Protein sequence
MNGFMSFMEQ KIMPTTQKIA GQRHLLAIRN GVISTLPLTI VGSFFVIFLN LPIDAYMEWI 
APFRHILDIP FRFTVGLMAL YAAFGVGASL ANFYQLNQLS AGLLSVLAFL LASVEPIQIT
KAVPGVIDAG RYISVGTLSA TSLFGAIVTA LIAVEIYHFM IKHNISIKLP DSVPPAVSNS
FAALIPTLVV ILLFWGIRYG LKFDVNTTIT YLIAPLKSVL VGNNLFGGLL TVFLIVFFWS
FGIHGPAILG PIIRPMWDSA ILENMEVFTA TGNAHQLPNL FTEQFIQWFV WIGGSGSTLA
LVIMFMFSKS KFLKELGRLS FVPGLFNINE PIIFGAPIVM NPILIIPFVI TPLVTTTVSY
FAVVSGMIPL MMAKLPFTML APIAAVISTD WTIMAGVLVL VNFVISFVIY YPFFKMYEKQ
QLAGEEKTEC SEQLSS