Gene EcE24377A_1955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1955 
SymbolchbF 
ID5587016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1942770 
End bp1944122 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content47% 
IMG OID640925627 
Product6-phospho-beta-glucosidase 
Protein accessionYP_001463030 
Protein GI157156065 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGA AATTAAAAGT CGTCACTATT GGTGGCGGGA GCAGCTATAC CCCGGAGTTA 
CTGGAAGGAT TTATTAAGCG TTATCACGAA TTGCCGGTCA GCGAATTATG GCTGGTGGAT
GTCGAAGATG GTAAAGAGAA ACTGGATATC ATTTTTGAAC TCTGCCAACG GATGATTGAT
AACGCTGGCG TCCCGATGAA GCTTTATAAA ACGCTGGATC GCCGCGAAGC ATTGAAAGAT
GCTGATTTCG TTACTACCCA ACTGCGCGTA GGCCAATTAC CGGCGCGCGA ACTGGATGAA
CGTATTCCAT TAAGTCATGG TTATCTTGGT CAGGAAACCA ACGGCGCGGG CGGTCTGTTT
AAAGGTCTGC GTACCATTCC GGTGATTTTT GACATCGTAA AAGATGTCGA AGAACTTTGT
CCGAATGCAT GGGTGATTAA CTTCACTAAC CCGGCGGGAA TGGTCACTGA AGCCGTTTAT
CGTCATACCG GATTTAAACG CTTTATCGGC GTGTGTAATA TTCCGATCGG CATGAAGATG
TTTATTCGCG ATGTTCTGAT GCTGAAAGAC AGCGATGATT TATCTATCGA TCTGTTCGGC
CTCAACCATA TGGTGTTCAT TAAGGATGTG CTGGTAAATG GCAAGTCGCG CTTTGCCGAA
TTGCTTGATG GTGTGGCGTC AGGGCAGTTA AAAGCGTCCT CTGTAAAAAA TATTTTCGAT
CTGCCATTTA GTGAGGGCTT AATTCGTTCG TTAAATCTGC TGCCATGTTC TTATCTGCTG
TATTACTTCA AGCAGAAAGA GATGCTGGCT ATTGAAATGG GCGAATACTA CAAAGGCGGC
GCACGAGCAC AGGTAGTACA GAAAGTCGAG AAACAACTTT TTGAGCTGTA TAAAAATCCG
GAGTTGAAAG TTAAGCCGAA AGAACTGGAA CAGCGCGGTG GGGCTTATTA CTCTGATGCA
GCGTGCGAAG TGATCAACGC TATCTACAAC GACAAGCAAG CTGAACATTA CGTTAATATC
CCGCATCATG GGCATATTGA TAATATTCCG GCAGACTGGG CGGTAGAAAT GACCTGTAAG
CTGGGGCGCG ATGGCGCGAC GCCACATCCG CGCATTACGC ATTTCGATGA TAAAGTGATG
GGGCTGATTC ACACCATCAA AGGCTTCGAG ATTGCTGCCA GCAACGCCGC ACTTAGCGGA
GAATTTAACG ATGTGTTACT GGCGCTAAAC CTTAGTCCGT TGGTGCATTC CGATCGCGAT
GCTGAGCTGC TGGCACGCGA GATGATTCTG GCGCACGAGA AATGGCTGCC AAATTTTGCC
GACTGCATCG CAGAGCTTAA AAAAGCACAT TAA
 
Protein sequence
MSQKLKVVTI GGGSSYTPEL LEGFIKRYHE LPVSELWLVD VEDGKEKLDI IFELCQRMID 
NAGVPMKLYK TLDRREALKD ADFVTTQLRV GQLPARELDE RIPLSHGYLG QETNGAGGLF
KGLRTIPVIF DIVKDVEELC PNAWVINFTN PAGMVTEAVY RHTGFKRFIG VCNIPIGMKM
FIRDVLMLKD SDDLSIDLFG LNHMVFIKDV LVNGKSRFAE LLDGVASGQL KASSVKNIFD
LPFSEGLIRS LNLLPCSYLL YYFKQKEMLA IEMGEYYKGG ARAQVVQKVE KQLFELYKNP
ELKVKPKELE QRGGAYYSDA ACEVINAIYN DKQAEHYVNI PHHGHIDNIP ADWAVEMTCK
LGRDGATPHP RITHFDDKVM GLIHTIKGFE IAASNAALSG EFNDVLLALN LSPLVHSDRD
AELLAREMIL AHEKWLPNFA DCIAELKKAH