Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3935 |
Symbol | |
ID | 5591162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3929469 |
End bp | 3930881 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640923042 |
Product | 6-phospho-beta-glucosidase |
Protein accession | YP_001460519 |
Protein GI | 157163201 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 0.359995 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCAT TTCCAGAAAC ATTTCTTTGG GGTGGCGCAA CAGCTGCCAA TCAGGTGGAA GGTGCCTGGC AGGAAGATGG CAAAGGGATC TCGACCTCAG ATTTACAGCC TCATGGCGTA ATGGGAAAAA TGGAACCGCG CATCCTGGGG AAAGAGAATA TCAAAGATGT CGCCATCGAT TTTTATCACC GTTACTCGGA AGATATCGCG TTATTTGCCG AGATGGGCTT CACCTGTCTG CGTATTTCCA TTGCCTGGGC GCGAATTTTC CCTCAGGGCG ACGAAGTCGA ACCGAATGAA GCGGGGTTAG CGTTTTACGA TCGGCTGTTT GATGAAATGG CGCAGGCGGG GATCAAGCCG CTGGTAACGT TATCCCATTA CGAAATGCCA TATGGGCTGG TGAAAAACTA CGGCGGTTGG GCTAATCGAG CGGTCATCGA TCACTTCGAG CATTACGCCC GCACGGTCTT TACTCGCTAC CAACATAAAG TGGCGTTATG GCTGACGTTT AATGAAATCA ACATGTCGTT ACACGCGCCA TTCACGGGCG TGGGGCTGGC AGAAGAGAGT GGCGAGGCGG AAGTTTATCA GGCTATCCAC CATCAACTGG TTGCCAGTGC GCGGGCAGTT AAAGCCTGTC ATAGCCTGCT ACCCGAAGCG AAAATCGGCA ATATGCTTCT CGGTGGGCTG GTTTACCCCC TCACCTGCCA GCCACAGGAT ATGTTGCAGG CCATGGAAGA GAACCGGCGC TGGATGTTCT TTGGTGATGT TCAGGCGCGT GGCCAGTATC CCGGCTATAT GCAGCGTTTC TTCCGCGACC ACAATATCAC CATTGAGATG ACTGAAAGTG ACGCAGAAGA TTTAAAACAT ACCGTCGATT TCATCTCTTT TAGTTATTAC ATGACTGGTT GTGTTTCCCA CGACGAAAGC ATTAATAAAA ATGCGCAGGG CAACATACTG AATATGATCC CCAATCCGCA TCTGAAAAGT TCAGAGTGGG GGTGGCAAAT TGATCCGGTT GGATTACGGG TTCTGTTAAA TACGCTTTGG GATCGTTATC AAAAACCGTT ATTTATTGTC GAGAACGGAT TAGGCGCAAA AGACAGCGTT GAAGCGGATG GTTCGATACA GGACGATTAT CGAATTGCCT ATTTAAACGA TCACCTGGTA CAGGTAAATG AAGCGATTGC CGATGGTGTG TATATTATGG GGTACACCAG TTGGGGGCCA ATTGATTTAG TCAGTGCATC TCATTCACAA ATGTCTAAGC GCTACGGCTT TATTTATGTG GATCGTGATG ATAATGGCGA AGGAAGCCTC ACAAGAACAC GCAAGAAAAG CTTCGGATGG TATGCAGAGG TGATCAAGAC GCGGGGGCTG TCATTAAAAA AAATAACCAT TAAAGCACCT TAA
|
Protein sequence | MKAFPETFLW GGATAANQVE GAWQEDGKGI STSDLQPHGV MGKMEPRILG KENIKDVAID FYHRYSEDIA LFAEMGFTCL RISIAWARIF PQGDEVEPNE AGLAFYDRLF DEMAQAGIKP LVTLSHYEMP YGLVKNYGGW ANRAVIDHFE HYARTVFTRY QHKVALWLTF NEINMSLHAP FTGVGLAEES GEAEVYQAIH HQLVASARAV KACHSLLPEA KIGNMLLGGL VYPLTCQPQD MLQAMEENRR WMFFGDVQAR GQYPGYMQRF FRDHNITIEM TESDAEDLKH TVDFISFSYY MTGCVSHDES INKNAQGNIL NMIPNPHLKS SEWGWQIDPV GLRVLLNTLW DRYQKPLFIV ENGLGAKDSV EADGSIQDDY RIAYLNDHLV QVNEAIADGV YIMGYTSWGP IDLVSASHSQ MSKRYGFIYV DRDDNGEGSL TRTRKKSFGW YAEVIKTRGL SLKKITIKAP
|
| |