Gene ECH74115_0647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0647 
Symbol 
ID6967509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp671403 
End bp676253 
Gene Length4851 bp 
Protein Length1616 aa 
Translation table11 
GC content58% 
IMG OID643384684 
ProductRHS Repeat family protein 
Protein accessionYP_002269197 
Protein GI209396800 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAG GACCAGGCGG GCCACAGGGA GCGACCGCAG GCGGTACGCT GGCAATGCGG 
ATGCTGTCGC AGCGGGCGAT GGCCGCCAGC CAGATGAAAC GGGCAGCCAA CGACAAAGCC
ATTGCACAGA TGTTGGCATC AAAAAAGTCT GGCCCCCCCG CCGCCAGGCT GGGCGATGAA
ATTCAGCACA AGAGTTTTTT GGGGGCGCTG GCAGGGGCCG TGCTGGGGGC GATAGTGACC
ATAGCAGAAG GTTGCCTGAT TATGGCCGCC TGTGCCACCG GCCCTTATGC GCTGGTTCTG
GTGCCTGCGC TGATGTATGC CAGCTATAAG GCGAGTGATT ATGTGGAGGA GAAACAGAAC
CAGCTTGAAT CATGGATAAA CAGCTTTTGT GACACGGACG GCGCCATCAA TACCGGTTCT
AAAAATGTAA AAATTAACGG AAAGCCAGCA GCCCGTGCAG CCGTCACCCT TCCCCCTCCT
CCCCCACCTG GAGCAATACC TGAAGTCCCA CAGGGGGAAC CCTCATGGGG TGATATTGCC
ACTGACCTGC TTGAATCGGC AGCGGAAAAA GCAGTACCAC TGGCGAAGGC CTGGGGGAAC
GCTGTTATCA CCCTGACGGA CAGCAATGCC GGTTTTATGG ATCGCGTCAG CGCCGGCGCA
TCGCTTCTGT TTCCCGCCGG TCCGGTATTA ATGGAGTTTG CCACCATGGT GGGCGGGCGT
GGCGAAATCA AAAAAGATGT GGATTTCCCG GAAGCCGGTG AGGACACGGC GCTCTGCGAC
AAGGAGAACA AACCACCGAG GATAGCCCAG GGCAGCAGCA ACGTCTTTAT CAACAATCAG
CCTGCCGCGC GCAAGGGCGA CAAACTGGAG TGCAGCGCGG CAATCGTGGA AGGTTCGCCG
GACGTCTTTA TTGGGGGTGA GCAGGTCACC TATCTGGATA TCCAGCCGGA GTTCCCGCCA
TGGCAGAGAA TGATCCTGGG AGGAATAACG ATAGCCAGCT ATCTTCTGCC GCCAGCAGGA
CTGCTGGGAA AACTGGGGAA TCTGGCGAAA CTGGGCAAAC TGGGAAACCT GCTGGGGAAA
AGCGGTAAGC TGCTGGGCGC AAAGCTCGGC GCGTTGCTGG GGAAAACAGG TAATTCGTTA
AAAAGTATTG CCAATAAAGT CATCAGATGG GTAACAGATC CTGTCGATCC GGTAACCGGC
GCATACTGCG ACGAACGTAC CGACTTTACG CTGGGCCAGA CACTCCCCCT CTCCTTCACC
CGTTTCCACA GTTCTGTACT GCCGCTGCAT GGCCTGACGG GCGTGGGCTG GAGCGACTCC
TGGAGCGAAT ACGCCTGGGT GCGTGAACAG GGAAACCGGG TGGATGTCAT CAGCCTGGGA
GCCACGCTGA ACTTCGCCTT CGACGGTGAA AGTGATACGG CGGTTAACCC GTATCACGCC
CAGTACATTC TGCGCCGCCG TGATGATTAT CTGGAGCTGT TCGACAGGGA TGCACTGAGC
AGCCGCTTCT TTTATGACGC CTTTCCGGGA ATGCGTCTGC GCCACCCGGT GACTGACGAT
ACCAGCGATG ACCGCCTGGC ACACAGCCCC GCAGACCGGA TGTACATGCT GGGCGGGATG
AGCGACACCG CCAGCAACCG CATCACGTTT GAGCGCGACA CCCAGTACCG GATCACGGGT
GTCAGTCACA CCGACGGGAT CCGGCTTAAA CTGACGTACC ACGCCAGCGG CTACCTGAAA
GCCATTCACC GCACGGATAA CGGCATACAG ACGCTGGCGA CCTACGAACA GGATGCGCGG
CTGGACTACC ACCTGTTTTA TGAGTACGAC GCTGCGGACC GGATCATCCG CTGGTCCGAT
AACGACCAGA CGTGGAGCCG TTTCACCTAC GATGCACAGG GCCGGTGCGT GACCGTCACC
GGGGCGGAGG GCTATTACAA CGCCACGCTG GACTATGGTG ACGGCTGCAC CACCGTGACG
GACGGCAAGG GCATTCACCG TTATTACTAT GATCCTGACG GCAATATTCT GCGGGAAGAA
GCGCCGGACG GCAGCACCAC CACGTATGAA TGGGATGAAT TCCATCACCT GCTGGCCCGC
CACTCCCCTG CCGGGCGGGT GGAGAAGTTT GAATACAACG CCGCACACGG TCAGTTAAGC
CGTTACACGG CGGCAGACGG CGCGGATTGG CAGTACTGCT ATGATGAGCG CGGCCTGCTC
AGCAACATCA CCGCCCCTGC CGGGCAGACG TGGACGCAGC AGTGTGATGA ACGCGGCCTG
CCGGTGAGTC TGGTATCGCC ACAGGGCGAA GAGACCCGGC TGGCGTACAC CCCTCAGGGG
CTGCTGTCGG GGATATTCCG CCAGGATGAA CGGCGTCTGG GCATAGAGTA CGACCACCAC
AACTGGCCGG AAACACTCAC CGACGTGATG GGCCGCGAAC ACCACACCGA ATACAGCGGT
CACGACCTGC CGGTGAAGAT GCGCGGCCCC GGCGGTCAGT CAGTGCGGTT GCAGTGGCAG
CAGCACCATA AACTGAGTGG CCTTGAGCGG GCAGGAACCG GCGCGGAAGG ATTCCGCTAC
GACCGCCACG GCAACCTGCT GGCGTACACG GACGGTAACG GCGTTGTCTG GACAATGGAG
TACGGCCCGT TTGATTTGCC GGTGGCGCGA ACGGACGGTG AAGGCCACCG CTGGCAGTAC
CGCTACGATA AAGACACGCT GCAACTGACA GAAGTCATTA ACCCGCAGGG CGAGTCTTAT
CTTTATATTC TGGACAACTG TGGCCGGGTG ACGGAAGAAC GTGACTGGGG CGGCGTGGTC
TGTCGTTACC GTTATGACGC TGATGGCCTG TGTACCGCCA GGGTCAACGG CCTGGAGGAA
ACCATCCTCT ACAGCCGGGA TGCCGCAGGC CGCCTGGCAG AAGTCATCAC TCCGGAAGGC
AAAACGCAGT ATGCGTATGA CAAATCCGGC AGGCTGACGG GTATCTTCAG CCCGGACGGC
ACATCACAGC GCACCGGCTA TGATGAACGC GGGCGGGTGA ATGTCACCAC TCAGGGCCGA
CGGGCCATTG AATACCACTA CCCCGACGAA CACACCGTCA TCCGCTGTAT CCTGCCACCG
GAAGATGAAC GCGACAGACA CCCCGACGGA TCCCTGCTGA AAACCACATA CCGCTACAAC
GCCGCCGGAG AACTGACGGA GGTTATCCTG CCGGGGGATG AGACGCTGAC GTTCAGCCGT
GATGAGGCGG GACGTGAAGT GCTCCGGCAC AGTAACCGGG GTTTTGCCTG TGAACAGGGC
TGGAATGCAG CCGGTCAGCC TGTCAGCCAG CGCGCCGGAC TTTTCCCGGC GGAAGCCACA
TGGGGCGGAC TGCTCCCTTC ACTGCTACGG GAATACCGTT ACGACAGCGC GGGTAACGTA
TCAGGCGTCA CCAGCCGGGA AGATTACGGA CGGGAAACAC ACCGGGAGTA CCGGCTTGAC
CGGAACGGCC AGGTCACGGC GGTGACAGCC TCAGGCACCG GGCTGGGCTA TGGCGAAGGC
GACGAGACTT ATGGCTATGA CAGCTGCGGC TACCTGAAGG CGCAGTCTGC GGGCAGACAC
CGGATAAGCG GAGAGACTGA CCAGTATGCC GCAGGCCACC GGCTGAAACA GGCCGGAAAC
ACACAGTATG ACTATGACGC CGCAGGCCGG ATGGTCAGCC GCACAAAACA CCGTGACGGC
TACCGCCCAG AAACAGAGCG GTTCCGGTGG GACAGCCGGG ACCAGCTGAC CGGGTATCGC
AGCGCACAGG GGGAGCAGTG GGAATACCGC CACGACGCCA GCGGCAGACG GACGGAAAAA
CGCTGCGACC GGAAGAAAAT CCGTTTTACG TACCTGTGGG ACGGCGACAG TATTGCGGAA
ATCCGGGAAT ACCGCGATGA TAAACTGTAC AGCGTAAGGC ACCTGGTGTT TAACGGCTTT
GAGCTGATAA GCCAGCAGTT CAGCCGGGTA CGACAGCCGC ATCCGTCCGT GGCCCCGCAG
TGGGTGACGC GAACGAATCA TGCGGTGAGC GACCTGACGG GCCGCCCGCT GATGCTCTTT
AACAGTGAAG GTAAAACCGT CTGGCGGCCG GGGCAGACCA GCCTGTGGGG GCTGGCACTC
AGCCTGCCCG CAGACACAGA CTACCCGGCC CCGCGCGGGG AGCGGGACCC GGAAGCGGAC
CCCGGCCTGC TGTATGCGGG ACAGTGGCAG GATGCAGAAT CGGGGCTGTG CTATAACCGG
TTCCGGTACT ACGAGCCGGA AACCGGGATG TACCTGGTGA GTGATCCGCT GGGGTTGCAG
GGAGGGGAGC AGACTTATCG GTATGTGCCG AATCCTTGTG GGTATATCGA TCCTTTGGGG
CTGGCTATAT GTCAGTTAGC CCGCTGGACG AAATGGGGGA GTGAGCAAAG CAACATATCT
GATGTTTTGA ACTCATTAGG GAATAGAGCA CTTAAATATG CTAATGGTGA TTGGATAAAA
TCAGAGGCTG CATTCAATAA ATACATAAAC ATGATAAATA AAAGACTAGA ATTAACAGGT
AGTAAATTTA GAGTTGAGAT TCAACCAGCC ATAAAAAATG GAGAGCGAGT TCCTGCGACA
ACGAATGGAC CATTTAAAGT AAATGGTAAG TGGACATCCG GCACTCATTA TACAGGTGGT
TCCAAACGTC TAGATGCCGG TATTATTGAT ATCACATCTC CTACAAACCA ATATGGATTA
CATCCAGTTA TTGAAGGATT TGATATAACA CTTAATAAAA CAAAACCATC AGCAGTGGAT
ATATATTCAG ATGTGTTTGG TGGGATTGAT ATTAACGACT TTCGGTTATA A
 
Protein sequence
MSEGPGGPQG ATAGGTLAMR MLSQRAMAAS QMKRAANDKA IAQMLASKKS GPPAARLGDE 
IQHKSFLGAL AGAVLGAIVT IAEGCLIMAA CATGPYALVL VPALMYASYK ASDYVEEKQN
QLESWINSFC DTDGAINTGS KNVKINGKPA ARAAVTLPPP PPPGAIPEVP QGEPSWGDIA
TDLLESAAEK AVPLAKAWGN AVITLTDSNA GFMDRVSAGA SLLFPAGPVL MEFATMVGGR
GEIKKDVDFP EAGEDTALCD KENKPPRIAQ GSSNVFINNQ PAARKGDKLE CSAAIVEGSP
DVFIGGEQVT YLDIQPEFPP WQRMILGGIT IASYLLPPAG LLGKLGNLAK LGKLGNLLGK
SGKLLGAKLG ALLGKTGNSL KSIANKVIRW VTDPVDPVTG AYCDERTDFT LGQTLPLSFT
RFHSSVLPLH GLTGVGWSDS WSEYAWVREQ GNRVDVISLG ATLNFAFDGE SDTAVNPYHA
QYILRRRDDY LELFDRDALS SRFFYDAFPG MRLRHPVTDD TSDDRLAHSP ADRMYMLGGM
SDTASNRITF ERDTQYRITG VSHTDGIRLK LTYHASGYLK AIHRTDNGIQ TLATYEQDAR
LDYHLFYEYD AADRIIRWSD NDQTWSRFTY DAQGRCVTVT GAEGYYNATL DYGDGCTTVT
DGKGIHRYYY DPDGNILREE APDGSTTTYE WDEFHHLLAR HSPAGRVEKF EYNAAHGQLS
RYTAADGADW QYCYDERGLL SNITAPAGQT WTQQCDERGL PVSLVSPQGE ETRLAYTPQG
LLSGIFRQDE RRLGIEYDHH NWPETLTDVM GREHHTEYSG HDLPVKMRGP GGQSVRLQWQ
QHHKLSGLER AGTGAEGFRY DRHGNLLAYT DGNGVVWTME YGPFDLPVAR TDGEGHRWQY
RYDKDTLQLT EVINPQGESY LYILDNCGRV TEERDWGGVV CRYRYDADGL CTARVNGLEE
TILYSRDAAG RLAEVITPEG KTQYAYDKSG RLTGIFSPDG TSQRTGYDER GRVNVTTQGR
RAIEYHYPDE HTVIRCILPP EDERDRHPDG SLLKTTYRYN AAGELTEVIL PGDETLTFSR
DEAGREVLRH SNRGFACEQG WNAAGQPVSQ RAGLFPAEAT WGGLLPSLLR EYRYDSAGNV
SGVTSREDYG RETHREYRLD RNGQVTAVTA SGTGLGYGEG DETYGYDSCG YLKAQSAGRH
RISGETDQYA AGHRLKQAGN TQYDYDAAGR MVSRTKHRDG YRPETERFRW DSRDQLTGYR
SAQGEQWEYR HDASGRRTEK RCDRKKIRFT YLWDGDSIAE IREYRDDKLY SVRHLVFNGF
ELISQQFSRV RQPHPSVAPQ WVTRTNHAVS DLTGRPLMLF NSEGKTVWRP GQTSLWGLAL
SLPADTDYPA PRGERDPEAD PGLLYAGQWQ DAESGLCYNR FRYYEPETGM YLVSDPLGLQ
GGEQTYRYVP NPCGYIDPLG LAICQLARWT KWGSEQSNIS DVLNSLGNRA LKYANGDWIK
SEAAFNKYIN MINKRLELTG SKFRVEIQPA IKNGERVPAT TNGPFKVNGK WTSGTHYTGG
SKRLDAGIID ITSPTNQYGL HPVIEGFDIT LNKTKPSAVD IYSDVFGGID INDFRL