Gene BCG9842_B5624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B5624 
SymbolcelB3 
ID7186501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp5077404 
End bp5078705 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content39% 
IMG OID643553103 
ProductPTS system, cellobiose-specific IIC component 
Protein accessionYP_002448744 
Protein GI218900333 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00359] phosphotransferase system, cellobiose specific, IIC component
[TIGR00410] PTS system, lactose/cellobiose family IIC component 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00000000000179415 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATACGAT TTTTAGAGAA GTATGTGATG CCGGTGGCAG GGAAGGTTGC AGAGCAGAGG 
CATTTACAAG CAATTCGAGA TGGAATTATT TTAACGATGC CTTTCTTAAT TATTGGATCG
TTTTTCCTCA TTATTAGTGC ACTGCCGATA CCGGGATATA ACGAGTTTAT GGCAGGTTTG
TTTGGTGAGA ATTGGCAGAG AGCTTTAGGG TATCCAGTTA GTGCAACTTT TAATATAATG
GCTTTAATAG CTGTTTTTGG AATCGCTTAC AGGCTTGGAG AGTATTATAA AGTGGATGCT
TTAGCATCCG GGGCATTGTC GCTTGTGACG TTTTTACTTG CGACTCCATT TCAAGTTGCA
TATATTATAC CAAGTACAAA AGAGAGTGTA CTTGTAGAAG GCGCTATCCC AGCTGCATTA
ATGGGAAGCC AAGGGTTATT TGTAGCAATG ATTATTGCAC TTATATCTAC TGAACTTTAT
CGGTTTATTG TACAAAAAAA GATAATTATA AGGATGCCAG AGACAGTTCC ACCAGCTGTC
ACGCGTTCAT TTGCGGCACT TGTTCCAGGT TTTATTGTTG TAACAGTTAT TTGGATTTTA
CGCTTAATTA TAGAAAATAC TTCTTTTGGC AGTATCCATA ATATTGTAGG ACAAATTTTA
CAGGAACCAC TTAGTGTACT TGGTGCTAGT CTTTGGGGCG CAATAATAGC AGTTATTCTC
GTTCATGTTC TTTGGTCGTG TGGAATTCAT GGTGCTACTA TTGTTGGTGG TGTAATGAGC
CCTGTTTGGT TGTCTTTAAT GGATCAAAAT CGAGTTGCTT TTCAAGCTGG GCAAGATGTA
CCAAATACCA TTACGGCACA GTTTTTTGAC TTATGGATTT ATATGGGCGG TTCTGGTGCA
ACGCTGGCTC TAGTTGTCGG AATGTTACTA TTTGCACGAA GTCAACAATT AAAAAGTTTA
GGGAGATTGT CAATCGCGCC TGGTATATTT AATATTAATG AGATGGTAAC TTTTGGTATG
CCAATTGTAA TGAACCCAAT TTTATTAATT CCATTTATAT TGGTTCCAGT TGTGTTAACA
ATTGTTTCTT ACTTTGCAAT GGAATGGGGA TGGGTCGCAC GTCCGAGTGG GGCGGCTGTA
CCTTGGACGA CACCTATTCT TTTCAGTGGA TATTTAGGAT CGGGTGGGAA AATTTCAGGT
GTTGTTTTAC AACTCGTCAA CTTTGCGCTT GCATTTTTCA TTTATTTACC GTTCTTAAAA
ATATGGGATA AACAAAAAGT AGCAGAAGAA AAGGGGGAGT AA
 
Protein sequence
MIRFLEKYVM PVAGKVAEQR HLQAIRDGII LTMPFLIIGS FFLIISALPI PGYNEFMAGL 
FGENWQRALG YPVSATFNIM ALIAVFGIAY RLGEYYKVDA LASGALSLVT FLLATPFQVA
YIIPSTKESV LVEGAIPAAL MGSQGLFVAM IIALISTELY RFIVQKKIII RMPETVPPAV
TRSFAALVPG FIVVTVIWIL RLIIENTSFG SIHNIVGQIL QEPLSVLGAS LWGAIIAVIL
VHVLWSCGIH GATIVGGVMS PVWLSLMDQN RVAFQAGQDV PNTITAQFFD LWIYMGGSGA
TLALVVGMLL FARSQQLKSL GRLSIAPGIF NINEMVTFGM PIVMNPILLI PFILVPVVLT
IVSYFAMEWG WVARPSGAAV PWTTPILFSG YLGSGGKISG VVLQLVNFAL AFFIYLPFLK
IWDKQKVAEE KGE