Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1806 |
Symbol | |
ID | 6146197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1825698 |
End bp | 1827965 |
Gene Length | 2268 bp |
Protein Length | 755 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641616682 |
Product | glycosy hydrolase family protein |
Protein accession | YP_001743860 |
Protein GI | 170682507 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1554] Trehalose and maltose hydrolases (possible phosphorylases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.769957 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGGC CAGTAACGTT AACGGAACCC CATTTCAGCC AGCATACCCT GAACAAGTAT GCTTCGCTGA TGGCGCAGGG GAACGGCTAT CTTGGGCTTC GCGCCAGCCA TGAAGAAGAT TATACGCGCC AGACACGCGG TATGTATCTG GCGGGGCTTT ATCATCGCGC GGGGAAAAAT GACATCAACG AACTGGTGAA CCTGCCTGAC GTCATAGGGA TGGAGATTAC CTTGAATGGT GAACTTTTTG CGCTATCCCG CGAAACGTGG CAGCGCGAGC TTGACTTCGC CAGTGGAGAA TTACGTCGCA ATGTCGTCTG GTCTTCAGCC AGCGGCGCAC GTTATGCCAT CGCCAGCCGT CGCTTTGTTT CGGCAGAACA ACTGCCGCTG ATGGCGCTGG AAATCAGCAT TACGCCGCTG GACGCTGACG CGTCAGTGCT GATTTCAACA GGCATTGACG CCACGCAAAC CAACCACGGA CGACAACATC TCGACGAAAC CCAGGTGCGG GTGTTTGGTC AGCATTTGAT GCAGGGGATC TATACCACTC AGGATGGACG CAGTGATGTG GCCATCAGCT GTTGCTGCCA GGTGAGCGGT GACGTGCAGC AATGCTATAC CGCCAAAGAG CGCCGCTTGC TGCAACATAC CAGTGCACAA CTACCTGCGG GCAAAACGCT GACGCTGCAA AAACGGGTGT GGATCGACTG GCGGGACGAC AGACACGTCG CTTTAGACGA GTGGGGTAGT GCATCGCTTC GTCAGCTTGA AATGTGTGTA CAGCAGAGTT ACGACCAACT TCTTGCAGCA TCCACAGAAA ACTGGCGTCA ATGGTGGCAG AAACGTCGTA TCACGGTAAA CGGTGGCGAT GCGCACGATC AGCAAGCGTT AGATTATGCG CTTTATCACC TACGCATCAT GACGCCTGCT CACGACGAGC GCAGCAGTAT TGCGGCAAAA GGTTTGACCG GGGAAGGCTA CAAAGGCCAC GTTTTCTGGG ATACGGAAGT ATTTTTACTG CCGTTCCATC TGTTTAGCGA ACCGACGATT GCCAGAAGTT TACTGCGTTA TCGCTGGCAC AACTTGCCAG GCGCGCAGGA GAAAGCGCGA CGCAACGGCT GGCAGGGCGC GCTATTTCCG TGGGAAAGCG CGCGCAGCGG CGAAGAAGAG ACGCCGGAAT TTGCCGCCAT TAACATTCGC ACCGGGCTGC GGCAAAAAGT GGCCTCGGCG CAGGCGGAAC ATCATCTGGT CGCCGATATC GCCTGGGCGG TTGTTCAATA CTGGCAGACC ACGGGGGATG AAAGTTTCAT TGCTCATGAA GGCATGGCGC TACTTCTGGA AACTGCAAAG TTCTGGATTA GCCGCACGGT GAGGGTTAAC GACCGTCTGG AAATTCATGA TGTTATTGGG CCAGACGAAT ATACCGAACA TGTCAATAAT AACGCCTTCA CCAGCTATAT GGCGTATTAC AACGTCCAGC AGGCGCTGAA CATCGCTCGT CAATTTGGCT GTAGCGACGA TGCGTTTATC CATCGCGCCG AAATGTTCCT TAAAGAACTG CGGCTGCCAG AAATTCAGCC CGACGGCGTT TTGCCGCAGG ATGATTCGTT TATGGCGAAA CCAGCGATTA ATCTGGCGAA ATACAAAGCG GCGGCGGGGA AGCAAACCAT TCTGCTGGAT TATTCACGCG CAGAAGTGAA CGAGATGCAG ATCCTCAAAC AAGCTGATGT GGTGATGCTC AATTACATGC TGCCGGAGCA GTACTCAGCG GCATCGTGTC TTGCCAATCT GCAATTTTAT GAACCGCGCA CCATTCACGA CTCTTCACTG AGTAAAGCGA TCCACGGCAT TGTTGCCGCA CGCTGTGGCC TGTTGGCGCA AAGTTATCAG TTCTGGCGGG AGGGGACTGA AATCGATCTT GGTGCTGATC CGCATAGCTG TGATGACGGT ATCCACGCTG CCGCAACTGG CGCAATCTGG TTGGGGGCGA TTCAGGGTTT TGCCGGGGCG AGCGTGCGCA ACGGTGAATT ACATCTCAAT CCGGCGTTAC CTGAGCAGTG GCAACAGTTG TCTTTCCCTC TGTTCTGGCA GGGCTGTGAA TTACAGGTCA CTCTTGACGC GCAGCGCATT GCGATTCGGA CTTCTGCGCC AGTTTCACTG CGTTTGAACG GTCAGCTTAT TTCAGTCTCT GAAGAATCTG TTTTCTGTTT AGGGGATTTT ATTTTGCCCT TCAATGGGAC CGCTACCACG CATCAGGAGG GTGAATGA
|
Protein sequence | MTRPVTLTEP HFSQHTLNKY ASLMAQGNGY LGLRASHEED YTRQTRGMYL AGLYHRAGKN DINELVNLPD VIGMEITLNG ELFALSRETW QRELDFASGE LRRNVVWSSA SGARYAIASR RFVSAEQLPL MALEISITPL DADASVLIST GIDATQTNHG RQHLDETQVR VFGQHLMQGI YTTQDGRSDV AISCCCQVSG DVQQCYTAKE RRLLQHTSAQ LPAGKTLTLQ KRVWIDWRDD RHVALDEWGS ASLRQLEMCV QQSYDQLLAA STENWRQWWQ KRRITVNGGD AHDQQALDYA LYHLRIMTPA HDERSSIAAK GLTGEGYKGH VFWDTEVFLL PFHLFSEPTI ARSLLRYRWH NLPGAQEKAR RNGWQGALFP WESARSGEEE TPEFAAINIR TGLRQKVASA QAEHHLVADI AWAVVQYWQT TGDESFIAHE GMALLLETAK FWISRTVRVN DRLEIHDVIG PDEYTEHVNN NAFTSYMAYY NVQQALNIAR QFGCSDDAFI HRAEMFLKEL RLPEIQPDGV LPQDDSFMAK PAINLAKYKA AAGKQTILLD YSRAEVNEMQ ILKQADVVML NYMLPEQYSA ASCLANLQFY EPRTIHDSSL SKAIHGIVAA RCGLLAQSYQ FWREGTEIDL GADPHSCDDG IHAAATGAIW LGAIQGFAGA SVRNGELHLN PALPEQWQQL SFPLFWQGCE LQVTLDAQRI AIRTSAPVSL RLNGQLISVS EESVFCLGDF ILPFNGTATT HQEGE
|
| |