Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2452 |
Symbol | chbF |
ID | 6967773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2322953 |
End bp | 2324305 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643386321 |
Product | 6-phospho-beta-glucosidase |
Protein accession | YP_002270803 |
Protein GI | 209396083 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA AATTAAAAGT CGTCACTATT GGTGGCGGGA GCAGCTATAC CCCGGAGTTA CTGGAAGGAT TTATTAAGCG TTATCACGAA TTGCCGGTCA GCGAATTATG GCTGGTGGAT GTCGAAGGTG GCAAAGCTAA ACTGGATATT ATTTTCGATC TCTGCCAACG GATGATTGAT AACGCTGGCG TCCCGATGAA GCTTTATAAA ACGCTGGATC GCCGCGAAGC ATTGAAAGAT GCTGATTTCG TTACTACCCA ACTGCGCGTT GGCCAATTAC CGGCGCGTGA ACTGGATGAA CGTATTCCAT TAAGTCATGG TTATCTTGGT CAGGAAACCA ACGGCGCGGG CGGTCTGTTT AAAGGTCTGC GTACCATTCC GGTGATTTTT GACATCGTAA AAGATGTCGA AGAACTTTGT CCGAATGCAT GGGTGATTAA CTTCACTAAC CCGGCGGGAA TGGTCACTGA AGCCGTTTAT CGTCATACCG GATTTAAACG CTTTATCGGC GTGTGTAATA TTCCGATCGG CATGAAGATG TTTATTCGCG ATGTTCTGAT GCTGAAAGAC AGCGATGATT TATCTATCGA TCTGTTCGGC CTCAACCATA TGGTGTTCAT TAAGGATGTG CTGGTAAATG GCAAGTCGCG CTTTGCCGAA TTGCTTGATG GTGTGGCATC CGGGCAGTTA AAAGCATCTG GCGTTAAAAA TATTTTCGAT CTGCCATTTA GCGAAGGCTT AATTCGTTCT CTGAATCTGT TGCCGTGTTC TTATTTGCTT TATTACTTCA AGCAAAAAGA GATGCTGGCT ATTGAAATGG GCGAATACTA CAAAGGCGGC GCACGAGCGC AGGTAGTACA GAAAGTCGAG AAACAACTTT TTGAGCTGTA TAAAAACCCG GAGTTGAAAG TTAAGCCGAA AGAACTGGAA CAGCGCGGTG GGGCTTATTA CTCTGATGCA GCGTGCGAAG TGATCAACGC TATCTACAAC GATAAGCAAG CAGAACATTA CGTTAATATC CCGCATCATG GGCATATTGA TAATATTCCG GCAGACTGGG CGGTAGAAAT GACCTGTACG CTGGGGCGCG ATGGCGCGAC GCCACATCCG CGCATTACGC ATTTCGATGA TAAAGTGATG GGGCTGATTC ACACCATTAA AGGCTTCGAG ATTGCTGCCA GCAACGCCGC ACTTAGCGGA GAATTTAACG GTGTGTTACT GGCGCTAAAC CTTAGTCCGT TGGTGCATTC CGATCGCGAT GCTGAGCTGC TGGCACGCGA GATGATTCTG GCGCACGAGA AATGGCTGCC AAATTTTGCC GACTGCATCG CAGAGCTTAA AAAAGCACAT TAA
|
Protein sequence | MSQKLKVVTI GGGSSYTPEL LEGFIKRYHE LPVSELWLVD VEGGKAKLDI IFDLCQRMID NAGVPMKLYK TLDRREALKD ADFVTTQLRV GQLPARELDE RIPLSHGYLG QETNGAGGLF KGLRTIPVIF DIVKDVEELC PNAWVINFTN PAGMVTEAVY RHTGFKRFIG VCNIPIGMKM FIRDVLMLKD SDDLSIDLFG LNHMVFIKDV LVNGKSRFAE LLDGVASGQL KASGVKNIFD LPFSEGLIRS LNLLPCSYLL YYFKQKEMLA IEMGEYYKGG ARAQVVQKVE KQLFELYKNP ELKVKPKELE QRGGAYYSDA ACEVINAIYN DKQAEHYVNI PHHGHIDNIP ADWAVEMTCT LGRDGATPHP RITHFDDKVM GLIHTIKGFE IAASNAALSG EFNGVLLALN LSPLVHSDRD AELLAREMIL AHEKWLPNFA DCIAELKKAH
|
| |