Gene EcSMS35_1457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1457 
SymbolchbF 
ID6146904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1440598 
End bp1441950 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content46% 
IMG OID641616335 
Product6-phospho-beta-glucosidase 
Protein accessionYP_001743515 
Protein GI170680403 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.307049 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA AATTAAAAGT CGTCACTATT GGTGGCGGGA GCAGCTATAC CCCGGAGTTA 
CTGGAAGGAT TTATTAAGCG TTATCACGAA TTGCCGGTCA GCGAATTATG GCTGGTGGAT
GTCGAAGATG GTAAAGAGAA ACTGGATATT ATTTTTGATC TCTGCCAACG GATGATTGAT
AACGCTGGCG TCCCGATGAA GCTTTATAAA ACGCTGGATC GCCGCGAAGC ATTGAAAGAT
GCTGATTTCG TTACTACCCA ACTGCGCGTA GGCCAATTAC CGGCGCGCGA ACTGGATGAA
CGTATTCCAT TAAGTCATGG TTATCTTGGT CAGGAAACCA ACGGCGCGGG CGGCCTGTTT
AAAGGTCTGC GTACCATTCC GGTGATTTTT GACATCGTAA AAGATGTCGA AGAACTTTGT
CCGAATGCAT GGGTGATTAA TTTCACTAAC CCGGCGGGAA TGGTCACTGA AGCCGTTTAT
CGTCATACTG GATTTAAACG CTTTATCGGC GTGTGTAATA TTCCGATCGG CATGAAGATG
TTTATTCGCG ATGTTCTGAT GCTGAAAGAC AGCGATGATT TATCTATCGA TCTGTTCGGC
CTCAACCATA TGGTGTTCAT TAAAGATGTG CTGGTAAATG GCAAATCACG CTTTGCCGAA
TTGCTTGATG GTGTGGCATC CGGGCAGTTA AAAGCATCTG GCGTTAAAAA TATTTTCGAT
CTGCCATTTA GCGAAGGCTT AATTCGTTCT CTGAATCTGT TGCCGTGTTC TTATTTGCTT
TATTACTTCA AGCAAAAAGA GATGCTGGCT ATTGAAATGG GCGAATACTA CAAAGGCGGC
GCACGAGCGC AGGTAGTACA GAAAGTCGAG AAACAACTTT TTGAGCTGTA TAAAAACCCG
GAGTTGAAAG TTAAGCCGAA AGAACTGGAA CAGCGCGGTG GGGCTTATTA CTCTGATGCT
GCATGCGAAG TGATCAACGC TATCTACAAC GACAAGCAAG CTGAACATTA CGTTAATATC
CCGCATCATG GGCATATTGA TAATATTCCG GCAGACTGGG CGGTAGAAAT GACCTGTACG
CTGGGGCGCG ATGGCGCGAC GCCACATCCG CGCATTACGC ATTTCGATGA TAAAGTGATG
GGGCTGATTC ACACCATTAA AAGCTTCGAG ATTGCTGCCA GCAACGCCGC ACTTAGCGGA
GAGTTTAACG ATGTGTTACT GGCCCTAAAC CTTAGTCCGT TGGTGCATTC CGATCGCGAT
GCTGAGCTGC TGGCACGCGA GATGATTCTG GCGCACGAGA AATGGTTGCC AAATTTTGCC
GACTGCATCG CAGAGCTTAA AAAAGCACAT TAA
 
Protein sequence
MSQKLKVVTI GGGSSYTPEL LEGFIKRYHE LPVSELWLVD VEDGKEKLDI IFDLCQRMID 
NAGVPMKLYK TLDRREALKD ADFVTTQLRV GQLPARELDE RIPLSHGYLG QETNGAGGLF
KGLRTIPVIF DIVKDVEELC PNAWVINFTN PAGMVTEAVY RHTGFKRFIG VCNIPIGMKM
FIRDVLMLKD SDDLSIDLFG LNHMVFIKDV LVNGKSRFAE LLDGVASGQL KASGVKNIFD
LPFSEGLIRS LNLLPCSYLL YYFKQKEMLA IEMGEYYKGG ARAQVVQKVE KQLFELYKNP
ELKVKPKELE QRGGAYYSDA ACEVINAIYN DKQAEHYVNI PHHGHIDNIP ADWAVEMTCT
LGRDGATPHP RITHFDDKVM GLIHTIKSFE IAASNAALSG EFNDVLLALN LSPLVHSDRD
AELLAREMIL AHEKWLPNFA DCIAELKKAH