Gene EcSMS35_1806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1806 
Symbol 
ID6146197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1825698 
End bp1827965 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content54% 
IMG OID641616682 
Productglycosy hydrolase family protein 
Protein accessionYP_001743860 
Protein GI170682507 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1554] Trehalose and maltose hydrolases (possible phosphorylases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.769957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGGC CAGTAACGTT AACGGAACCC CATTTCAGCC AGCATACCCT GAACAAGTAT 
GCTTCGCTGA TGGCGCAGGG GAACGGCTAT CTTGGGCTTC GCGCCAGCCA TGAAGAAGAT
TATACGCGCC AGACACGCGG TATGTATCTG GCGGGGCTTT ATCATCGCGC GGGGAAAAAT
GACATCAACG AACTGGTGAA CCTGCCTGAC GTCATAGGGA TGGAGATTAC CTTGAATGGT
GAACTTTTTG CGCTATCCCG CGAAACGTGG CAGCGCGAGC TTGACTTCGC CAGTGGAGAA
TTACGTCGCA ATGTCGTCTG GTCTTCAGCC AGCGGCGCAC GTTATGCCAT CGCCAGCCGT
CGCTTTGTTT CGGCAGAACA ACTGCCGCTG ATGGCGCTGG AAATCAGCAT TACGCCGCTG
GACGCTGACG CGTCAGTGCT GATTTCAACA GGCATTGACG CCACGCAAAC CAACCACGGA
CGACAACATC TCGACGAAAC CCAGGTGCGG GTGTTTGGTC AGCATTTGAT GCAGGGGATC
TATACCACTC AGGATGGACG CAGTGATGTG GCCATCAGCT GTTGCTGCCA GGTGAGCGGT
GACGTGCAGC AATGCTATAC CGCCAAAGAG CGCCGCTTGC TGCAACATAC CAGTGCACAA
CTACCTGCGG GCAAAACGCT GACGCTGCAA AAACGGGTGT GGATCGACTG GCGGGACGAC
AGACACGTCG CTTTAGACGA GTGGGGTAGT GCATCGCTTC GTCAGCTTGA AATGTGTGTA
CAGCAGAGTT ACGACCAACT TCTTGCAGCA TCCACAGAAA ACTGGCGTCA ATGGTGGCAG
AAACGTCGTA TCACGGTAAA CGGTGGCGAT GCGCACGATC AGCAAGCGTT AGATTATGCG
CTTTATCACC TACGCATCAT GACGCCTGCT CACGACGAGC GCAGCAGTAT TGCGGCAAAA
GGTTTGACCG GGGAAGGCTA CAAAGGCCAC GTTTTCTGGG ATACGGAAGT ATTTTTACTG
CCGTTCCATC TGTTTAGCGA ACCGACGATT GCCAGAAGTT TACTGCGTTA TCGCTGGCAC
AACTTGCCAG GCGCGCAGGA GAAAGCGCGA CGCAACGGCT GGCAGGGCGC GCTATTTCCG
TGGGAAAGCG CGCGCAGCGG CGAAGAAGAG ACGCCGGAAT TTGCCGCCAT TAACATTCGC
ACCGGGCTGC GGCAAAAAGT GGCCTCGGCG CAGGCGGAAC ATCATCTGGT CGCCGATATC
GCCTGGGCGG TTGTTCAATA CTGGCAGACC ACGGGGGATG AAAGTTTCAT TGCTCATGAA
GGCATGGCGC TACTTCTGGA AACTGCAAAG TTCTGGATTA GCCGCACGGT GAGGGTTAAC
GACCGTCTGG AAATTCATGA TGTTATTGGG CCAGACGAAT ATACCGAACA TGTCAATAAT
AACGCCTTCA CCAGCTATAT GGCGTATTAC AACGTCCAGC AGGCGCTGAA CATCGCTCGT
CAATTTGGCT GTAGCGACGA TGCGTTTATC CATCGCGCCG AAATGTTCCT TAAAGAACTG
CGGCTGCCAG AAATTCAGCC CGACGGCGTT TTGCCGCAGG ATGATTCGTT TATGGCGAAA
CCAGCGATTA ATCTGGCGAA ATACAAAGCG GCGGCGGGGA AGCAAACCAT TCTGCTGGAT
TATTCACGCG CAGAAGTGAA CGAGATGCAG ATCCTCAAAC AAGCTGATGT GGTGATGCTC
AATTACATGC TGCCGGAGCA GTACTCAGCG GCATCGTGTC TTGCCAATCT GCAATTTTAT
GAACCGCGCA CCATTCACGA CTCTTCACTG AGTAAAGCGA TCCACGGCAT TGTTGCCGCA
CGCTGTGGCC TGTTGGCGCA AAGTTATCAG TTCTGGCGGG AGGGGACTGA AATCGATCTT
GGTGCTGATC CGCATAGCTG TGATGACGGT ATCCACGCTG CCGCAACTGG CGCAATCTGG
TTGGGGGCGA TTCAGGGTTT TGCCGGGGCG AGCGTGCGCA ACGGTGAATT ACATCTCAAT
CCGGCGTTAC CTGAGCAGTG GCAACAGTTG TCTTTCCCTC TGTTCTGGCA GGGCTGTGAA
TTACAGGTCA CTCTTGACGC GCAGCGCATT GCGATTCGGA CTTCTGCGCC AGTTTCACTG
CGTTTGAACG GTCAGCTTAT TTCAGTCTCT GAAGAATCTG TTTTCTGTTT AGGGGATTTT
ATTTTGCCCT TCAATGGGAC CGCTACCACG CATCAGGAGG GTGAATGA
 
Protein sequence
MTRPVTLTEP HFSQHTLNKY ASLMAQGNGY LGLRASHEED YTRQTRGMYL AGLYHRAGKN 
DINELVNLPD VIGMEITLNG ELFALSRETW QRELDFASGE LRRNVVWSSA SGARYAIASR
RFVSAEQLPL MALEISITPL DADASVLIST GIDATQTNHG RQHLDETQVR VFGQHLMQGI
YTTQDGRSDV AISCCCQVSG DVQQCYTAKE RRLLQHTSAQ LPAGKTLTLQ KRVWIDWRDD
RHVALDEWGS ASLRQLEMCV QQSYDQLLAA STENWRQWWQ KRRITVNGGD AHDQQALDYA
LYHLRIMTPA HDERSSIAAK GLTGEGYKGH VFWDTEVFLL PFHLFSEPTI ARSLLRYRWH
NLPGAQEKAR RNGWQGALFP WESARSGEEE TPEFAAINIR TGLRQKVASA QAEHHLVADI
AWAVVQYWQT TGDESFIAHE GMALLLETAK FWISRTVRVN DRLEIHDVIG PDEYTEHVNN
NAFTSYMAYY NVQQALNIAR QFGCSDDAFI HRAEMFLKEL RLPEIQPDGV LPQDDSFMAK
PAINLAKYKA AAGKQTILLD YSRAEVNEMQ ILKQADVVML NYMLPEQYSA ASCLANLQFY
EPRTIHDSSL SKAIHGIVAA RCGLLAQSYQ FWREGTEIDL GADPHSCDDG IHAAATGAIW
LGAIQGFAGA SVRNGELHLN PALPEQWQQL SFPLFWQGCE LQVTLDAQRI AIRTSAPVSL
RLNGQLISVS EESVFCLGDF ILPFNGTATT HQEGE