Gene ECH74115_1961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1961 
Symbol 
ID6968804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1852713 
End bp1854980 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content54% 
IMG OID643385887 
Productglycosyl hydrolase, family 65 
Protein accessionYP_002270376 
Protein GI209397081 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1554] Trehalose and maltose hydrolases (possible phosphorylases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.756125 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.752383 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGGC CAGTAACGTT ATCAGAACCC CATTTCAGCC AGCATACCCT GAACAAGTAT 
GCATCGCTGA TGGCGCAGGG GAACGGCTAT CTTGGGCTTC GCGCCAGCCA TGAAGAAGAT
TACACGCGCC AGACGCGAGG GATGTATCTG GCGGGGCTGT ATCATCGGGC GGGAAAAGGT
GAAATCAACG AACTGGTGAA CCTGCCTGAT ATCTTGGGGA TGGAGATTGC CATAAATGGT
GAGGTTTTCT CGTTATCCCA CGAAGCCTGG CAGCGTGAGC TTGACTTTGC CAGTGGCGAA
TTACGCCGCA ACGTTGTCTG GCGTACCAGC AACGGCGCAG GTTACACCAT CGCCAGCCGT
CGCTTTGTTT CGGCAGACCA ACTGCCGCTC ATTGCGCTGG AAATCACTAT TACGCCACTG
GACGCCGACG CGTCAGTGCT GATTTCAACA GGTATCGACG CCACGCAAAC CAACCACGGT
CGCCAACATC TCGACGAAAC CCAGGTGCGG GTGTTTGGTC AGCATCTGAT GCAGGGGATC
TACACCACCC AGGATGGACG CAGTGATGTG GCCATCAGCT GTTGCTGTAA GGTGAGCGGT
GATGTGCAGC AATGCTATAC CGCCAAAGAG CGCCGTTTGC TGCAACATAC CAGTGCGCAG
CTTCATGCAG GCGAGACAGT GACGTTGCAA AAACTGGTGT GGATCGACTG GCGGGATGAC
AGGCAAGCCG TTTTAGACGA GTGGGGCAGC GCGTCGCTTC GCCAGCTTGA AATATGCGCG
CAGCAGAGTT ACGACCAACT TCTTGCAGCA TCAACAGAAA ACTGGCGTCA ATGGTGGCAG
AAACGTCGTA TCACGGTAAA TGGCGGCGAT GCGCACGATC AGCAAGCGTT AGATTATGCG
CTTTATCATC TGCGCATCAT GACGCCGGCT CACGACGAGC GCAGCAGTAT TGCGGCAAAA
GGCTTAACCG GCGAAGGCTA CAAAGGCCAC GTTTTCTGGG ATACAGAAGT ATTTTTGCTG
CCGTTCCATC TGTTTAGCGA TCCGACGGTT GCCCGAAGTT TACTGCGTTA TCGCTGGCAC
AACTTGCCAG GCGCGCAGGA GAAAGCACGG CGCAGCGGCT GGCAGGGCGC GCTATTTCCG
TGGGAAAGCG CGCGCAGCGG CGAAGAAGAG ACGCCAGAAT TTGCCGCCAT TAACATTCGT
ACCGGGCTGC GGCAAAAAGT GGCCTCGGCG CAGGCGGAAC ATCATCTGGT GGCCGATATC
GCCTGGGCGG TTATTCAATA CTGGCAGACC ACGGGGGATG AAAGTTTCAT TGCTCATGAA
GGCATGGCGC TACTTCTGGA AACTGCAAAG TTCTGGATTA GCCGCGCGGT GAGGGTTAAC
GACCGTCTGG AAATTCATGA TGTTATTGGG CCAGACGAAT ATACCGAACA TGTCAATAAT
AACGCCTTCA CCAGCTATAT GGCCCGCTAC AACGTTCAAC AGGCGCTGAA TATTGCCCGC
CAGTTCGGCT GTAGCGACGA TGCGTTTATC CATCGCGCCG AAATGTTCCT CAAAGAGCTA
TGGATGCCAG AAACGCAGCC CGATGGCGTT TTGCCGCAGG ATGATTCGTT TATGGCTAAG
CCGGCGATTA ATCTGGCTAA ATACAAAGCG GCGGCGGGGA AGCAAACCAT TCTGCTGGAT
TATTCACGCG CAGAAGTGAA CGAGATGCAG ATCCTCAAAC AAGCTGATGT GGTGATGCTC
AATTACATGC TGCCGGAGCA GTTCTCAGCG GTATCGTGTC TTGCCAATCT GCAATTTTAT
GAACCGCGCA CTATTCACGA CTCGTCATTA AGTAAAGCAA TCCACGGCAT TGTTGCCGCA
CGCTGTGGCC TGCTGACCCA AAGTTATCAG TTCTGGCGCG AGGGGACTGA AATCGATCTT
GGTGCTGATC CGCATAGTTG TGATGATGGT ATCCACGCTG CCGCAACTGG CGCTATCTGG
CTGGGGGCGA TTCAGGGTTT TGCCGGGGTG AGCGTGCGTG ACGGTGAATT ACATCTCAAT
CCGGCGTTAC CGGAGCAGTG GCAACAGTTG TCTTTCCCTC TGTTCTGGCA GGGCTGCGAA
TTACAGGTCA CGCTCGACGC GCAGCGTATT GCGATTCGAA CTTCTGCGCC CGTTTCACTG
CGTTTGAACG GGCAGCTTAT ATCCGTGGCT GAAGAATCTG TTTTCTGTTT GGGTGATTTT
ATTTTGCCCT TCAATGGGAC CGCTACCACG CATCAGGAGG ATGAATGA
 
Protein sequence
MTRPVTLSEP HFSQHTLNKY ASLMAQGNGY LGLRASHEED YTRQTRGMYL AGLYHRAGKG 
EINELVNLPD ILGMEIAING EVFSLSHEAW QRELDFASGE LRRNVVWRTS NGAGYTIASR
RFVSADQLPL IALEITITPL DADASVLIST GIDATQTNHG RQHLDETQVR VFGQHLMQGI
YTTQDGRSDV AISCCCKVSG DVQQCYTAKE RRLLQHTSAQ LHAGETVTLQ KLVWIDWRDD
RQAVLDEWGS ASLRQLEICA QQSYDQLLAA STENWRQWWQ KRRITVNGGD AHDQQALDYA
LYHLRIMTPA HDERSSIAAK GLTGEGYKGH VFWDTEVFLL PFHLFSDPTV ARSLLRYRWH
NLPGAQEKAR RSGWQGALFP WESARSGEEE TPEFAAINIR TGLRQKVASA QAEHHLVADI
AWAVIQYWQT TGDESFIAHE GMALLLETAK FWISRAVRVN DRLEIHDVIG PDEYTEHVNN
NAFTSYMARY NVQQALNIAR QFGCSDDAFI HRAEMFLKEL WMPETQPDGV LPQDDSFMAK
PAINLAKYKA AAGKQTILLD YSRAEVNEMQ ILKQADVVML NYMLPEQFSA VSCLANLQFY
EPRTIHDSSL SKAIHGIVAA RCGLLTQSYQ FWREGTEIDL GADPHSCDDG IHAAATGAIW
LGAIQGFAGV SVRDGELHLN PALPEQWQQL SFPLFWQGCE LQVTLDAQRI AIRTSAPVSL
RLNGQLISVA EESVFCLGDF ILPFNGTATT HQEDE