Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1527 |
Symbol | |
ID | 5586196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 1509486 |
End bp | 1511753 |
Gene Length | 2268 bp |
Protein Length | 755 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640925218 |
Product | glycosy hydrolase family protein |
Protein accession | YP_001462623 |
Protein GI | 157155852 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1554] Trehalose and maltose hydrolases (possible phosphorylases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAGGC CAGTAACGTT ATCAGAACCC CATTTCAGCC AGCATACCCT GAACAAGTAT GCATCGCTGA TGGCGCAGGG GAACGGCTAT CTTGGGCTTC GCGCCAGCCA TGAAGAAGAT TACACGAGCC AGACGCGAGG GATGTATCTG GCGGGGCTGT ATCATCGGGC GGGAAAAGGT GAAATCAACG AACTGGTGAA CCTGCCTGAT ATCTTGGGGA TGGAGATTGC CATAAATGGT GAGGTTTTCT CGTTATCCCG CGAAGCCTGG CAGCGTGAAC TTGACTTTGC CAGTGGAGAA TTACGCCGTA GCGTTGTCTG GCGTACCAGC AACGGCACAG GTTACACCAT CGCCAGCCGT CGCTTTGTTT CGGCAGACCA ACTGCCGCTC ATTGTGCTGG AAATCACTAT TACGCCACTG GACGCCGACG CGTCAGTGCT GATTTCAACA GGTATCGACG CCACGCAAAC CAACCACGGT CGCCAACATC TCGACGAAAC CCAGGTGCGG GTGTTTGGTC AGCATCTGAT GCAGGGGATC TACACCACCC AGGATGGACG CAGTGATGTG GCCATCAGCT GTTGCTGTAT GGTGAGCGGT GATGTGCAGC AATGCTATAC CGCCAAAGAG CGCCGTTTGC AGCAACATAC CAGTGCGCAG CTTCATGCAG GCGAGACAGT GACGTTGCAA AAACTGGTGT GGATCGACTG GCGGGATGAC AGGCAAGCCG TTTTAGACGA GTGGGGCAGC GCGTCGCTTC GCCAGCTTGA AATGTGCGCG CAGCAGAGTT ACGACCAACT TCTTGCAGTA TCAACAGAAA ACTGGCGTCA ATGGTGGCAG AAACGTCGTA TCACGGTAAA TGGCGGCGAA GCGCACGATC AGCAAGCGTT AGATTATGCG CTTTATCATC TGCGCATCAT GACGCCTGCC CACGACGAGC GCAGCAGCAT TGCGGCAAAA GGCTTAACCG GCGAAGGCTA CAAAGGCCAC GTTTTCTGGG ATACAGAAGT ATTTTTGTTA CCGTTTCATC TGTTTAGCGA TCCGACGGTT GCCCGAAGTT TACTGCGTTA TCGCTGGCAC AACTTGCCAG GCGCGCAGGA GAAAGCGCGA CGCAACGGCT GGCAGGGCGC GCTATTTCCG TGGGAAAGCG CGCGCAGCGG CGAAGAAGAG ACGCCGGAAT TTGCCGCCAT TAACATTCGC ACCGGGCTGC GGCAAAAAGT GGCCTCGGCG CAGGCGGAAC ATCATCTGGT GGCCGATATC GCCTGGGCGG TTATTCAATA CTGGCAGACC ACGGGGGATG AAAGTTTCAT TGCGCATGAA GGCATGGCGC TACTTCTGGA AACTGCAAAG TTCTGGATTA GCCGCGCGGT GAGGGTTAAC GACCGTCTGG AAATTCATGA TGTTATTGGG CCTGACGAAT ATACCGAACA TGTCAATAAT AACGCCTTCA CCAGCTATAT GGCGTATTAC AACGTCCAGC AGGCGCTGAG TATTGCCCGC CAGTTTGGCT GTAGCGACGA TGCGTTTATC CATCGCGCCG AAATGTTCCT TAAAGAACTG CGGCTGCCAG AAATTCAGCC CGACGGCGTT TTGCCGCAGG ATGATTCGTT TATGGCTAAG CCGGCGATTA ATCTGGCGAA ATACAAAGCG GCGGCGGGGA AGCAAACCAT TCTGCTGGAT TATTCACGCG CAGAAGTGAA CGAGATGCAA ATCCTCAAAC AAGCTGATGT GGTGATGCTC AATTACATGC TGCCGGAGCA GTTCTCAGCG GCATCGTGTC TTGCCAATCT GCAATTTTAT GAACCGCGCA CTATTCACGA CTCGTCATTA AGTAAAGCAA TCCACGGCAT TGTTGCCGCA CGCTGTGGCC TGCTGACCCA AAGTTATCAG TTCTGGCGCG AGGGGACTGA AATCGATCTT GGTGCTGATC CGCATAGTTG TGATGATGGT ATCCATGCTG CCGCAACTGG CGCTATCTGG CTGGGGGCGA TTCAGGGTTT TGCCGGGGTG AGCGTGCGTG ACGGTGAATT GCATCTCAAT CCGGCGTTAC CTGAGCAGTG GCAACAGTTG TCTTTCCCTC TGTTCTGGCA GGGCTGCGAA TTACAGGTCA CTCTTGACGC GCAGCGTATT GCGATTCGAA CTTCTGCGCC CGTTTCACTG CGTTTGAACG GTCAGCTTAT ATCCGTGGCT GAAGAATCTG TTTTCTGTTT GGGTGATTTT ATTTTGCCCT TCAATGGGAC CGCTACCACA CATCAGGAGG ATGAATGA
|
Protein sequence | MTRPVTLSEP HFSQHTLNKY ASLMAQGNGY LGLRASHEED YTSQTRGMYL AGLYHRAGKG EINELVNLPD ILGMEIAING EVFSLSREAW QRELDFASGE LRRSVVWRTS NGTGYTIASR RFVSADQLPL IVLEITITPL DADASVLIST GIDATQTNHG RQHLDETQVR VFGQHLMQGI YTTQDGRSDV AISCCCMVSG DVQQCYTAKE RRLQQHTSAQ LHAGETVTLQ KLVWIDWRDD RQAVLDEWGS ASLRQLEMCA QQSYDQLLAV STENWRQWWQ KRRITVNGGE AHDQQALDYA LYHLRIMTPA HDERSSIAAK GLTGEGYKGH VFWDTEVFLL PFHLFSDPTV ARSLLRYRWH NLPGAQEKAR RNGWQGALFP WESARSGEEE TPEFAAINIR TGLRQKVASA QAEHHLVADI AWAVIQYWQT TGDESFIAHE GMALLLETAK FWISRAVRVN DRLEIHDVIG PDEYTEHVNN NAFTSYMAYY NVQQALSIAR QFGCSDDAFI HRAEMFLKEL RLPEIQPDGV LPQDDSFMAK PAINLAKYKA AAGKQTILLD YSRAEVNEMQ ILKQADVVML NYMLPEQFSA ASCLANLQFY EPRTIHDSSL SKAIHGIVAA RCGLLTQSYQ FWREGTEIDL GADPHSCDDG IHAAATGAIW LGAIQGFAGV SVRDGELHLN PALPEQWQQL SFPLFWQGCE LQVTLDAQRI AIRTSAPVSL RLNGQLISVA EESVFCLGDF ILPFNGTATT HQEDE
|
| |