Gene BCG9842_B5631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B5631 
SymbolcelF 
ID7182883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp5071476 
End bp5072801 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content38% 
IMG OID643553096 
Product6-phospho-beta-glucosidase 
Protein accessionYP_002448737 
Protein GI218900326 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00000000000180851 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGGAA TTAAAATTGC TACAATCGGC GGTGGATCTA GTTATACACC AGAGTTAATT 
GAAGGATTTA TTAAACGTTA TGATGAGCTT CCTGTTCGTG AAATTTGGTT AGTAGATATT
GAGGCAGGAA AAGAAAAGTT AGAAATTGTT GGTAACTTAG CGAAACGTAT GGTGAAAAAA
TCTGGTTTAC CAATCGAGGT ACATTTAACA CTTGATCGCC GTGAGGCATT AAAAGATGCA
GACTTCGTAA CAACACAACT TCGCGTTGGT TTATTAGAAG CACGTGCAAA AGATGAAGCA
ATCCCATTAA AATATGATGT AATCGGTCAG GAAACGAATG GTCCTGGTGG TTTATTCAAA
GCACTGAGAA CGATTCCTGT TATTTTAGAT ATTTGTAAAG ATATGGAGGA GCTTTGTCCG
AATGCATGGC TAATTAACTT TGCGAACCCA GCTGGTATGG TAACAGAAGC TGTCCTTCGT
TATACAAATA TTCAAAGAGT AGTTGGTCTA TGTAACGTTC CAATCGGAAT CCGCATGGGT
CTTGCAAGAT TACTTGAAGT AGATGCAAGT CGTGTCCACG TTGATTTTGC AGGTTTAAAT
CATATGGTAT ACGGACTAGA TGTATACTTA GATGGCGTAA GTGTAATGGA TCGTGTGTTA
GAGCTTGTAA CAGATCCGGA AAAGCAAATT ACGATGGAAA ATATCGCAGC GCTTAACTGG
GAACCAGACT TTATTCGCGG CCTTCGTGCA ATTCCATGTC CATATCATCG TTATTACTAC
AAAACACGTG AAATGTTAGA AGAAGAAAAA GAAGCTTCTG TTGAAAAAGG TACACGTGCA
GAAGTAGTAA AACAATTAGA AAATGATTTA TTTGAGTTAT ATAAAGACCC GAATTTAGAT
ATTAAACCAC CACAATTAGA AAAACGTGGC GGCGCTTATT ATAGTGACGC AGCATGTAGC
TTAATTACGT CTATTTACAA CAATAAAGGT GATATCCAGC CTGTTAATAC ACGAAACAAC
GGAACAATTG CAAGCTTACC AGATGATTCT GCTGTTGAAG TGAACTGTAT TATTACGAAA
GAAGGTCCAA AACCAATTGC GGTCGGAGAT CTTCCAGTAC CAGTTCGCGG TTTAGTACAG
CAAATTAAAT CATTTGAGCG CACAACAATT GAAGCTGCTG TTACAGGTGA TTATCATAAA
GCGCTGCTTG CTATGACAAT TAATCCACTT GTACCATCAG ATACAGTTGC AAGACAAATT
TTAGATGAAA TGTTGGAAGC ACATAAAGAA TATCTTCCGC AGTTCTTCAA AAAGGTAGAG
AAGTAA
 
Protein sequence
MTGIKIATIG GGSSYTPELI EGFIKRYDEL PVREIWLVDI EAGKEKLEIV GNLAKRMVKK 
SGLPIEVHLT LDRREALKDA DFVTTQLRVG LLEARAKDEA IPLKYDVIGQ ETNGPGGLFK
ALRTIPVILD ICKDMEELCP NAWLINFANP AGMVTEAVLR YTNIQRVVGL CNVPIGIRMG
LARLLEVDAS RVHVDFAGLN HMVYGLDVYL DGVSVMDRVL ELVTDPEKQI TMENIAALNW
EPDFIRGLRA IPCPYHRYYY KTREMLEEEK EASVEKGTRA EVVKQLENDL FELYKDPNLD
IKPPQLEKRG GAYYSDAACS LITSIYNNKG DIQPVNTRNN GTIASLPDDS AVEVNCIITK
EGPKPIAVGD LPVPVRGLVQ QIKSFERTTI EAAVTGDYHK ALLAMTINPL VPSDTVARQI
LDEMLEAHKE YLPQFFKKVE K