Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1961 |
Symbol | |
ID | 6968804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1852713 |
End bp | 1854980 |
Gene Length | 2268 bp |
Protein Length | 755 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643385887 |
Product | glycosyl hydrolase, family 65 |
Protein accession | YP_002270376 |
Protein GI | 209397081 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1554] Trehalose and maltose hydrolases (possible phosphorylases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.756125 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.752383 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGGC CAGTAACGTT ATCAGAACCC CATTTCAGCC AGCATACCCT GAACAAGTAT GCATCGCTGA TGGCGCAGGG GAACGGCTAT CTTGGGCTTC GCGCCAGCCA TGAAGAAGAT TACACGCGCC AGACGCGAGG GATGTATCTG GCGGGGCTGT ATCATCGGGC GGGAAAAGGT GAAATCAACG AACTGGTGAA CCTGCCTGAT ATCTTGGGGA TGGAGATTGC CATAAATGGT GAGGTTTTCT CGTTATCCCA CGAAGCCTGG CAGCGTGAGC TTGACTTTGC CAGTGGCGAA TTACGCCGCA ACGTTGTCTG GCGTACCAGC AACGGCGCAG GTTACACCAT CGCCAGCCGT CGCTTTGTTT CGGCAGACCA ACTGCCGCTC ATTGCGCTGG AAATCACTAT TACGCCACTG GACGCCGACG CGTCAGTGCT GATTTCAACA GGTATCGACG CCACGCAAAC CAACCACGGT CGCCAACATC TCGACGAAAC CCAGGTGCGG GTGTTTGGTC AGCATCTGAT GCAGGGGATC TACACCACCC AGGATGGACG CAGTGATGTG GCCATCAGCT GTTGCTGTAA GGTGAGCGGT GATGTGCAGC AATGCTATAC CGCCAAAGAG CGCCGTTTGC TGCAACATAC CAGTGCGCAG CTTCATGCAG GCGAGACAGT GACGTTGCAA AAACTGGTGT GGATCGACTG GCGGGATGAC AGGCAAGCCG TTTTAGACGA GTGGGGCAGC GCGTCGCTTC GCCAGCTTGA AATATGCGCG CAGCAGAGTT ACGACCAACT TCTTGCAGCA TCAACAGAAA ACTGGCGTCA ATGGTGGCAG AAACGTCGTA TCACGGTAAA TGGCGGCGAT GCGCACGATC AGCAAGCGTT AGATTATGCG CTTTATCATC TGCGCATCAT GACGCCGGCT CACGACGAGC GCAGCAGTAT TGCGGCAAAA GGCTTAACCG GCGAAGGCTA CAAAGGCCAC GTTTTCTGGG ATACAGAAGT ATTTTTGCTG CCGTTCCATC TGTTTAGCGA TCCGACGGTT GCCCGAAGTT TACTGCGTTA TCGCTGGCAC AACTTGCCAG GCGCGCAGGA GAAAGCACGG CGCAGCGGCT GGCAGGGCGC GCTATTTCCG TGGGAAAGCG CGCGCAGCGG CGAAGAAGAG ACGCCAGAAT TTGCCGCCAT TAACATTCGT ACCGGGCTGC GGCAAAAAGT GGCCTCGGCG CAGGCGGAAC ATCATCTGGT GGCCGATATC GCCTGGGCGG TTATTCAATA CTGGCAGACC ACGGGGGATG AAAGTTTCAT TGCTCATGAA GGCATGGCGC TACTTCTGGA AACTGCAAAG TTCTGGATTA GCCGCGCGGT GAGGGTTAAC GACCGTCTGG AAATTCATGA TGTTATTGGG CCAGACGAAT ATACCGAACA TGTCAATAAT AACGCCTTCA CCAGCTATAT GGCCCGCTAC AACGTTCAAC AGGCGCTGAA TATTGCCCGC CAGTTCGGCT GTAGCGACGA TGCGTTTATC CATCGCGCCG AAATGTTCCT CAAAGAGCTA TGGATGCCAG AAACGCAGCC CGATGGCGTT TTGCCGCAGG ATGATTCGTT TATGGCTAAG CCGGCGATTA ATCTGGCTAA ATACAAAGCG GCGGCGGGGA AGCAAACCAT TCTGCTGGAT TATTCACGCG CAGAAGTGAA CGAGATGCAG ATCCTCAAAC AAGCTGATGT GGTGATGCTC AATTACATGC TGCCGGAGCA GTTCTCAGCG GTATCGTGTC TTGCCAATCT GCAATTTTAT GAACCGCGCA CTATTCACGA CTCGTCATTA AGTAAAGCAA TCCACGGCAT TGTTGCCGCA CGCTGTGGCC TGCTGACCCA AAGTTATCAG TTCTGGCGCG AGGGGACTGA AATCGATCTT GGTGCTGATC CGCATAGTTG TGATGATGGT ATCCACGCTG CCGCAACTGG CGCTATCTGG CTGGGGGCGA TTCAGGGTTT TGCCGGGGTG AGCGTGCGTG ACGGTGAATT ACATCTCAAT CCGGCGTTAC CGGAGCAGTG GCAACAGTTG TCTTTCCCTC TGTTCTGGCA GGGCTGCGAA TTACAGGTCA CGCTCGACGC GCAGCGTATT GCGATTCGAA CTTCTGCGCC CGTTTCACTG CGTTTGAACG GGCAGCTTAT ATCCGTGGCT GAAGAATCTG TTTTCTGTTT GGGTGATTTT ATTTTGCCCT TCAATGGGAC CGCTACCACG CATCAGGAGG ATGAATGA
|
Protein sequence | MTRPVTLSEP HFSQHTLNKY ASLMAQGNGY LGLRASHEED YTRQTRGMYL AGLYHRAGKG EINELVNLPD ILGMEIAING EVFSLSHEAW QRELDFASGE LRRNVVWRTS NGAGYTIASR RFVSADQLPL IALEITITPL DADASVLIST GIDATQTNHG RQHLDETQVR VFGQHLMQGI YTTQDGRSDV AISCCCKVSG DVQQCYTAKE RRLLQHTSAQ LHAGETVTLQ KLVWIDWRDD RQAVLDEWGS ASLRQLEICA QQSYDQLLAA STENWRQWWQ KRRITVNGGD AHDQQALDYA LYHLRIMTPA HDERSSIAAK GLTGEGYKGH VFWDTEVFLL PFHLFSDPTV ARSLLRYRWH NLPGAQEKAR RSGWQGALFP WESARSGEEE TPEFAAINIR TGLRQKVASA QAEHHLVADI AWAVIQYWQT TGDESFIAHE GMALLLETAK FWISRAVRVN DRLEIHDVIG PDEYTEHVNN NAFTSYMARY NVQQALNIAR QFGCSDDAFI HRAEMFLKEL WMPETQPDGV LPQDDSFMAK PAINLAKYKA AAGKQTILLD YSRAEVNEMQ ILKQADVVML NYMLPEQFSA VSCLANLQFY EPRTIHDSSL SKAIHGIVAA RCGLLTQSYQ FWREGTEIDL GADPHSCDDG IHAAATGAIW LGAIQGFAGV SVRDGELHLN PALPEQWQQL SFPLFWQGCE LQVTLDAQRI AIRTSAPVSL RLNGQLISVA EESVFCLGDF ILPFNGTATT HQEDE
|
| |