Gene BCZK4901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK4901 
SymbolcelF 
ID3026885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp4991263 
End bp4992588 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content38% 
IMG OID637549134 
Product6-phospho-beta-glucosidase 
Protein accessionYP_086471 
Protein GI52140360 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGGAA TTAAAATTGC TACAATCGGC GGTGGATCTA GCTATACACC AGAGTTAATT 
GAAGGATTTA TTAAACGTTA TGATGAGCTT CCTGTTAGTG AAATTTGGTT AGTAGATATT
GAGGCAGGAA AAGAGAAGTT AGAAATCGTT GGTAACTTAG CGAAACGTAT GGTGAAAAAA
TCTGGTTTAC CAATCGAGGT ACATTTAACG CTTGATCGCC GCGAGGCATT AAAGGATGCT
GACTTCGTAA CAACACAACT TCGCGTAGGT TTATTAGAAG CACGTGCAAA AGATGAAGCA
ATCCCATTAA AATATGATGT AATCGGTCAG GAAACGAATG GTCCTGGTGG TTTATTCAAA
GCATTGAGAA CGATTCCTGT TATTTTAGAT ATTTGTAAGG ATATGGAGGA ACTTTGTCCA
AATGCATGGT TAATTAACTT TGCAAACCCA GCTGGTATGG TAACAGAAGC TGTTCTTCGT
TATACAAATA TTCAAAGAGT AGTTGGTCTA TGTAACGTTC CAATCGGAAT TCGTATGGGT
CTTGCAAAAT TACTTGAAGT AGATGCGAGT CGTGTACACG TTGACTTCGC AGGTTTAAAC
CATATGGTAT ACGGACTAGA CGTATATTTA GATGGCGTAA GTGTAATGGA TCGTGTGTTA
GAGCTTGTAA CAGATCCAGA AAAGCAAATT ACGATGGAAA ATATTGCAGC GCTTAATTGG
GAACCAGACT TTATTCGTGG CCTTCGTGCA ATTCCATGTC CATATCACCG TTACTACTAC
AAAACACGTG AAATGTTAGA AGAAGAGAAA GAAGCTTCGG TTGAAAAAGG TACACGTGCA
GAAGTAGTAA AACAATTAGA AGATGATTTA TTCGAGTTAT ATAAAGACCC GAACTTAGAT
ATTAAACCAC CACAATTAGA AAAACGTGGA GGCGCTTATT ATAGTGATGC AGCATGTAGC
TTAATTACGT CTATTTACAA TAATAAAGGT GATATCCAGC CTGTTAATAC ACGAAACAAC
GGAACAATTG CGAGCTTACC ACATGATTCT GCTGTTGAAG TGAACTGTAT TATTACGAAA
GAAGGTCCAA AACCAATTGC AGTTGGAGAT CTTCCAGTAC CAGTTCGCGG TTTAGTACAA
CAAATTAAAT CATTTGAGCG CACAACAATT GAAGCTGCTG TTACAGGAGA TTATCATAAA
GCGCTGCTTG CTATGATAAT TAATCCACTT GTACCATCAG ATAAAGTTGC AAAACAAATT
TTAGATGAAA TGTTGGAAGC GCATAAAGAA TATCTTCCGC AGTTCTTCAA AAAGGTAGAG
AAATAA
 
Protein sequence
MTGIKIATIG GGSSYTPELI EGFIKRYDEL PVSEIWLVDI EAGKEKLEIV GNLAKRMVKK 
SGLPIEVHLT LDRREALKDA DFVTTQLRVG LLEARAKDEA IPLKYDVIGQ ETNGPGGLFK
ALRTIPVILD ICKDMEELCP NAWLINFANP AGMVTEAVLR YTNIQRVVGL CNVPIGIRMG
LAKLLEVDAS RVHVDFAGLN HMVYGLDVYL DGVSVMDRVL ELVTDPEKQI TMENIAALNW
EPDFIRGLRA IPCPYHRYYY KTREMLEEEK EASVEKGTRA EVVKQLEDDL FELYKDPNLD
IKPPQLEKRG GAYYSDAACS LITSIYNNKG DIQPVNTRNN GTIASLPHDS AVEVNCIITK
EGPKPIAVGD LPVPVRGLVQ QIKSFERTTI EAAVTGDYHK ALLAMIINPL VPSDKVAKQI
LDEMLEAHKE YLPQFFKKVE K