Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0647 |
Symbol | |
ID | 6967509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 671403 |
End bp | 676253 |
Gene Length | 4851 bp |
Protein Length | 1616 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643384684 |
Product | RHS Repeat family protein |
Protein accession | YP_002269197 |
Protein GI | 209396800 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAAG GACCAGGCGG GCCACAGGGA GCGACCGCAG GCGGTACGCT GGCAATGCGG ATGCTGTCGC AGCGGGCGAT GGCCGCCAGC CAGATGAAAC GGGCAGCCAA CGACAAAGCC ATTGCACAGA TGTTGGCATC AAAAAAGTCT GGCCCCCCCG CCGCCAGGCT GGGCGATGAA ATTCAGCACA AGAGTTTTTT GGGGGCGCTG GCAGGGGCCG TGCTGGGGGC GATAGTGACC ATAGCAGAAG GTTGCCTGAT TATGGCCGCC TGTGCCACCG GCCCTTATGC GCTGGTTCTG GTGCCTGCGC TGATGTATGC CAGCTATAAG GCGAGTGATT ATGTGGAGGA GAAACAGAAC CAGCTTGAAT CATGGATAAA CAGCTTTTGT GACACGGACG GCGCCATCAA TACCGGTTCT AAAAATGTAA AAATTAACGG AAAGCCAGCA GCCCGTGCAG CCGTCACCCT TCCCCCTCCT CCCCCACCTG GAGCAATACC TGAAGTCCCA CAGGGGGAAC CCTCATGGGG TGATATTGCC ACTGACCTGC TTGAATCGGC AGCGGAAAAA GCAGTACCAC TGGCGAAGGC CTGGGGGAAC GCTGTTATCA CCCTGACGGA CAGCAATGCC GGTTTTATGG ATCGCGTCAG CGCCGGCGCA TCGCTTCTGT TTCCCGCCGG TCCGGTATTA ATGGAGTTTG CCACCATGGT GGGCGGGCGT GGCGAAATCA AAAAAGATGT GGATTTCCCG GAAGCCGGTG AGGACACGGC GCTCTGCGAC AAGGAGAACA AACCACCGAG GATAGCCCAG GGCAGCAGCA ACGTCTTTAT CAACAATCAG CCTGCCGCGC GCAAGGGCGA CAAACTGGAG TGCAGCGCGG CAATCGTGGA AGGTTCGCCG GACGTCTTTA TTGGGGGTGA GCAGGTCACC TATCTGGATA TCCAGCCGGA GTTCCCGCCA TGGCAGAGAA TGATCCTGGG AGGAATAACG ATAGCCAGCT ATCTTCTGCC GCCAGCAGGA CTGCTGGGAA AACTGGGGAA TCTGGCGAAA CTGGGCAAAC TGGGAAACCT GCTGGGGAAA AGCGGTAAGC TGCTGGGCGC AAAGCTCGGC GCGTTGCTGG GGAAAACAGG TAATTCGTTA AAAAGTATTG CCAATAAAGT CATCAGATGG GTAACAGATC CTGTCGATCC GGTAACCGGC GCATACTGCG ACGAACGTAC CGACTTTACG CTGGGCCAGA CACTCCCCCT CTCCTTCACC CGTTTCCACA GTTCTGTACT GCCGCTGCAT GGCCTGACGG GCGTGGGCTG GAGCGACTCC TGGAGCGAAT ACGCCTGGGT GCGTGAACAG GGAAACCGGG TGGATGTCAT CAGCCTGGGA GCCACGCTGA ACTTCGCCTT CGACGGTGAA AGTGATACGG CGGTTAACCC GTATCACGCC CAGTACATTC TGCGCCGCCG TGATGATTAT CTGGAGCTGT TCGACAGGGA TGCACTGAGC AGCCGCTTCT TTTATGACGC CTTTCCGGGA ATGCGTCTGC GCCACCCGGT GACTGACGAT ACCAGCGATG ACCGCCTGGC ACACAGCCCC GCAGACCGGA TGTACATGCT GGGCGGGATG AGCGACACCG CCAGCAACCG CATCACGTTT GAGCGCGACA CCCAGTACCG GATCACGGGT GTCAGTCACA CCGACGGGAT CCGGCTTAAA CTGACGTACC ACGCCAGCGG CTACCTGAAA GCCATTCACC GCACGGATAA CGGCATACAG ACGCTGGCGA CCTACGAACA GGATGCGCGG CTGGACTACC ACCTGTTTTA TGAGTACGAC GCTGCGGACC GGATCATCCG CTGGTCCGAT AACGACCAGA CGTGGAGCCG TTTCACCTAC GATGCACAGG GCCGGTGCGT GACCGTCACC GGGGCGGAGG GCTATTACAA CGCCACGCTG GACTATGGTG ACGGCTGCAC CACCGTGACG GACGGCAAGG GCATTCACCG TTATTACTAT GATCCTGACG GCAATATTCT GCGGGAAGAA GCGCCGGACG GCAGCACCAC CACGTATGAA TGGGATGAAT TCCATCACCT GCTGGCCCGC CACTCCCCTG CCGGGCGGGT GGAGAAGTTT GAATACAACG CCGCACACGG TCAGTTAAGC CGTTACACGG CGGCAGACGG CGCGGATTGG CAGTACTGCT ATGATGAGCG CGGCCTGCTC AGCAACATCA CCGCCCCTGC CGGGCAGACG TGGACGCAGC AGTGTGATGA ACGCGGCCTG CCGGTGAGTC TGGTATCGCC ACAGGGCGAA GAGACCCGGC TGGCGTACAC CCCTCAGGGG CTGCTGTCGG GGATATTCCG CCAGGATGAA CGGCGTCTGG GCATAGAGTA CGACCACCAC AACTGGCCGG AAACACTCAC CGACGTGATG GGCCGCGAAC ACCACACCGA ATACAGCGGT CACGACCTGC CGGTGAAGAT GCGCGGCCCC GGCGGTCAGT CAGTGCGGTT GCAGTGGCAG CAGCACCATA AACTGAGTGG CCTTGAGCGG GCAGGAACCG GCGCGGAAGG ATTCCGCTAC GACCGCCACG GCAACCTGCT GGCGTACACG GACGGTAACG GCGTTGTCTG GACAATGGAG TACGGCCCGT TTGATTTGCC GGTGGCGCGA ACGGACGGTG AAGGCCACCG CTGGCAGTAC CGCTACGATA AAGACACGCT GCAACTGACA GAAGTCATTA ACCCGCAGGG CGAGTCTTAT CTTTATATTC TGGACAACTG TGGCCGGGTG ACGGAAGAAC GTGACTGGGG CGGCGTGGTC TGTCGTTACC GTTATGACGC TGATGGCCTG TGTACCGCCA GGGTCAACGG CCTGGAGGAA ACCATCCTCT ACAGCCGGGA TGCCGCAGGC CGCCTGGCAG AAGTCATCAC TCCGGAAGGC AAAACGCAGT ATGCGTATGA CAAATCCGGC AGGCTGACGG GTATCTTCAG CCCGGACGGC ACATCACAGC GCACCGGCTA TGATGAACGC GGGCGGGTGA ATGTCACCAC TCAGGGCCGA CGGGCCATTG AATACCACTA CCCCGACGAA CACACCGTCA TCCGCTGTAT CCTGCCACCG GAAGATGAAC GCGACAGACA CCCCGACGGA TCCCTGCTGA AAACCACATA CCGCTACAAC GCCGCCGGAG AACTGACGGA GGTTATCCTG CCGGGGGATG AGACGCTGAC GTTCAGCCGT GATGAGGCGG GACGTGAAGT GCTCCGGCAC AGTAACCGGG GTTTTGCCTG TGAACAGGGC TGGAATGCAG CCGGTCAGCC TGTCAGCCAG CGCGCCGGAC TTTTCCCGGC GGAAGCCACA TGGGGCGGAC TGCTCCCTTC ACTGCTACGG GAATACCGTT ACGACAGCGC GGGTAACGTA TCAGGCGTCA CCAGCCGGGA AGATTACGGA CGGGAAACAC ACCGGGAGTA CCGGCTTGAC CGGAACGGCC AGGTCACGGC GGTGACAGCC TCAGGCACCG GGCTGGGCTA TGGCGAAGGC GACGAGACTT ATGGCTATGA CAGCTGCGGC TACCTGAAGG CGCAGTCTGC GGGCAGACAC CGGATAAGCG GAGAGACTGA CCAGTATGCC GCAGGCCACC GGCTGAAACA GGCCGGAAAC ACACAGTATG ACTATGACGC CGCAGGCCGG ATGGTCAGCC GCACAAAACA CCGTGACGGC TACCGCCCAG AAACAGAGCG GTTCCGGTGG GACAGCCGGG ACCAGCTGAC CGGGTATCGC AGCGCACAGG GGGAGCAGTG GGAATACCGC CACGACGCCA GCGGCAGACG GACGGAAAAA CGCTGCGACC GGAAGAAAAT CCGTTTTACG TACCTGTGGG ACGGCGACAG TATTGCGGAA ATCCGGGAAT ACCGCGATGA TAAACTGTAC AGCGTAAGGC ACCTGGTGTT TAACGGCTTT GAGCTGATAA GCCAGCAGTT CAGCCGGGTA CGACAGCCGC ATCCGTCCGT GGCCCCGCAG TGGGTGACGC GAACGAATCA TGCGGTGAGC GACCTGACGG GCCGCCCGCT GATGCTCTTT AACAGTGAAG GTAAAACCGT CTGGCGGCCG GGGCAGACCA GCCTGTGGGG GCTGGCACTC AGCCTGCCCG CAGACACAGA CTACCCGGCC CCGCGCGGGG AGCGGGACCC GGAAGCGGAC CCCGGCCTGC TGTATGCGGG ACAGTGGCAG GATGCAGAAT CGGGGCTGTG CTATAACCGG TTCCGGTACT ACGAGCCGGA AACCGGGATG TACCTGGTGA GTGATCCGCT GGGGTTGCAG GGAGGGGAGC AGACTTATCG GTATGTGCCG AATCCTTGTG GGTATATCGA TCCTTTGGGG CTGGCTATAT GTCAGTTAGC CCGCTGGACG AAATGGGGGA GTGAGCAAAG CAACATATCT GATGTTTTGA ACTCATTAGG GAATAGAGCA CTTAAATATG CTAATGGTGA TTGGATAAAA TCAGAGGCTG CATTCAATAA ATACATAAAC ATGATAAATA AAAGACTAGA ATTAACAGGT AGTAAATTTA GAGTTGAGAT TCAACCAGCC ATAAAAAATG GAGAGCGAGT TCCTGCGACA ACGAATGGAC CATTTAAAGT AAATGGTAAG TGGACATCCG GCACTCATTA TACAGGTGGT TCCAAACGTC TAGATGCCGG TATTATTGAT ATCACATCTC CTACAAACCA ATATGGATTA CATCCAGTTA TTGAAGGATT TGATATAACA CTTAATAAAA CAAAACCATC AGCAGTGGAT ATATATTCAG ATGTGTTTGG TGGGATTGAT ATTAACGACT TTCGGTTATA A
|
Protein sequence | MSEGPGGPQG ATAGGTLAMR MLSQRAMAAS QMKRAANDKA IAQMLASKKS GPPAARLGDE IQHKSFLGAL AGAVLGAIVT IAEGCLIMAA CATGPYALVL VPALMYASYK ASDYVEEKQN QLESWINSFC DTDGAINTGS KNVKINGKPA ARAAVTLPPP PPPGAIPEVP QGEPSWGDIA TDLLESAAEK AVPLAKAWGN AVITLTDSNA GFMDRVSAGA SLLFPAGPVL MEFATMVGGR GEIKKDVDFP EAGEDTALCD KENKPPRIAQ GSSNVFINNQ PAARKGDKLE CSAAIVEGSP DVFIGGEQVT YLDIQPEFPP WQRMILGGIT IASYLLPPAG LLGKLGNLAK LGKLGNLLGK SGKLLGAKLG ALLGKTGNSL KSIANKVIRW VTDPVDPVTG AYCDERTDFT LGQTLPLSFT RFHSSVLPLH GLTGVGWSDS WSEYAWVREQ GNRVDVISLG ATLNFAFDGE SDTAVNPYHA QYILRRRDDY LELFDRDALS SRFFYDAFPG MRLRHPVTDD TSDDRLAHSP ADRMYMLGGM SDTASNRITF ERDTQYRITG VSHTDGIRLK LTYHASGYLK AIHRTDNGIQ TLATYEQDAR LDYHLFYEYD AADRIIRWSD NDQTWSRFTY DAQGRCVTVT GAEGYYNATL DYGDGCTTVT DGKGIHRYYY DPDGNILREE APDGSTTTYE WDEFHHLLAR HSPAGRVEKF EYNAAHGQLS RYTAADGADW QYCYDERGLL SNITAPAGQT WTQQCDERGL PVSLVSPQGE ETRLAYTPQG LLSGIFRQDE RRLGIEYDHH NWPETLTDVM GREHHTEYSG HDLPVKMRGP GGQSVRLQWQ QHHKLSGLER AGTGAEGFRY DRHGNLLAYT DGNGVVWTME YGPFDLPVAR TDGEGHRWQY RYDKDTLQLT EVINPQGESY LYILDNCGRV TEERDWGGVV CRYRYDADGL CTARVNGLEE TILYSRDAAG RLAEVITPEG KTQYAYDKSG RLTGIFSPDG TSQRTGYDER GRVNVTTQGR RAIEYHYPDE HTVIRCILPP EDERDRHPDG SLLKTTYRYN AAGELTEVIL PGDETLTFSR DEAGREVLRH SNRGFACEQG WNAAGQPVSQ RAGLFPAEAT WGGLLPSLLR EYRYDSAGNV SGVTSREDYG RETHREYRLD RNGQVTAVTA SGTGLGYGEG DETYGYDSCG YLKAQSAGRH RISGETDQYA AGHRLKQAGN TQYDYDAAGR MVSRTKHRDG YRPETERFRW DSRDQLTGYR SAQGEQWEYR HDASGRRTEK RCDRKKIRFT YLWDGDSIAE IREYRDDKLY SVRHLVFNGF ELISQQFSRV RQPHPSVAPQ WVTRTNHAVS DLTGRPLMLF NSEGKTVWRP GQTSLWGLAL SLPADTDYPA PRGERDPEAD PGLLYAGQWQ DAESGLCYNR FRYYEPETGM YLVSDPLGLQ GGEQTYRYVP NPCGYIDPLG LAICQLARWT KWGSEQSNIS DVLNSLGNRA LKYANGDWIK SEAAFNKYIN MINKRLELTG SKFRVEIQPA IKNGERVPAT TNGPFKVNGK WTSGTHYTGG SKRLDAGIID ITSPTNQYGL HPVIEGFDIT LNKTKPSAVD IYSDVFGGID INDFRL
|
| |