Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1955 |
Symbol | chbF |
ID | 5587016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 1942770 |
End bp | 1944122 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640925627 |
Product | 6-phospho-beta-glucosidase |
Protein accession | YP_001463030 |
Protein GI | 157156065 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGA AATTAAAAGT CGTCACTATT GGTGGCGGGA GCAGCTATAC CCCGGAGTTA CTGGAAGGAT TTATTAAGCG TTATCACGAA TTGCCGGTCA GCGAATTATG GCTGGTGGAT GTCGAAGATG GTAAAGAGAA ACTGGATATC ATTTTTGAAC TCTGCCAACG GATGATTGAT AACGCTGGCG TCCCGATGAA GCTTTATAAA ACGCTGGATC GCCGCGAAGC ATTGAAAGAT GCTGATTTCG TTACTACCCA ACTGCGCGTA GGCCAATTAC CGGCGCGCGA ACTGGATGAA CGTATTCCAT TAAGTCATGG TTATCTTGGT CAGGAAACCA ACGGCGCGGG CGGTCTGTTT AAAGGTCTGC GTACCATTCC GGTGATTTTT GACATCGTAA AAGATGTCGA AGAACTTTGT CCGAATGCAT GGGTGATTAA CTTCACTAAC CCGGCGGGAA TGGTCACTGA AGCCGTTTAT CGTCATACCG GATTTAAACG CTTTATCGGC GTGTGTAATA TTCCGATCGG CATGAAGATG TTTATTCGCG ATGTTCTGAT GCTGAAAGAC AGCGATGATT TATCTATCGA TCTGTTCGGC CTCAACCATA TGGTGTTCAT TAAGGATGTG CTGGTAAATG GCAAGTCGCG CTTTGCCGAA TTGCTTGATG GTGTGGCGTC AGGGCAGTTA AAAGCGTCCT CTGTAAAAAA TATTTTCGAT CTGCCATTTA GTGAGGGCTT AATTCGTTCG TTAAATCTGC TGCCATGTTC TTATCTGCTG TATTACTTCA AGCAGAAAGA GATGCTGGCT ATTGAAATGG GCGAATACTA CAAAGGCGGC GCACGAGCAC AGGTAGTACA GAAAGTCGAG AAACAACTTT TTGAGCTGTA TAAAAATCCG GAGTTGAAAG TTAAGCCGAA AGAACTGGAA CAGCGCGGTG GGGCTTATTA CTCTGATGCA GCGTGCGAAG TGATCAACGC TATCTACAAC GACAAGCAAG CTGAACATTA CGTTAATATC CCGCATCATG GGCATATTGA TAATATTCCG GCAGACTGGG CGGTAGAAAT GACCTGTAAG CTGGGGCGCG ATGGCGCGAC GCCACATCCG CGCATTACGC ATTTCGATGA TAAAGTGATG GGGCTGATTC ACACCATCAA AGGCTTCGAG ATTGCTGCCA GCAACGCCGC ACTTAGCGGA GAATTTAACG ATGTGTTACT GGCGCTAAAC CTTAGTCCGT TGGTGCATTC CGATCGCGAT GCTGAGCTGC TGGCACGCGA GATGATTCTG GCGCACGAGA AATGGCTGCC AAATTTTGCC GACTGCATCG CAGAGCTTAA AAAAGCACAT TAA
|
Protein sequence | MSQKLKVVTI GGGSSYTPEL LEGFIKRYHE LPVSELWLVD VEDGKEKLDI IFELCQRMID NAGVPMKLYK TLDRREALKD ADFVTTQLRV GQLPARELDE RIPLSHGYLG QETNGAGGLF KGLRTIPVIF DIVKDVEELC PNAWVINFTN PAGMVTEAVY RHTGFKRFIG VCNIPIGMKM FIRDVLMLKD SDDLSIDLFG LNHMVFIKDV LVNGKSRFAE LLDGVASGQL KASSVKNIFD LPFSEGLIRS LNLLPCSYLL YYFKQKEMLA IEMGEYYKGG ARAQVVQKVE KQLFELYKNP ELKVKPKELE QRGGAYYSDA ACEVINAIYN DKQAEHYVNI PHHGHIDNIP ADWAVEMTCK LGRDGATPHP RITHFDDKVM GLIHTIKGFE IAASNAALSG EFNDVLLALN LSPLVHSDRD AELLAREMIL AHEKWLPNFA DCIAELKKAH
|
| |