Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0258 |
Symbol | |
ID | 6967418 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 276352 |
End bp | 278112 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643384328 |
Product | RHS Repeat family protein |
Protein accession | YP_002268844 |
Protein GI | 209400909 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCCTGGC TGCACCGTGA GACGGCCCGC AGCTTCGGCG GGGCGGGCAG TACAGCGGGG TACGAACAGG TCACGGCGTA CACCCTCACA GGGCAGCTAC AGAGCAGGCA CCTGAACCTC CCGCAGCTTG ACTGTGACTA CGACTGGAAC GACAACGGAC AGCTAATCCG CATCAGCGGC CCGCAGGAGA GCCGGGAGTA CCGTTACAGC GACACGGGAA GGCTGACGGG CGTCCACACC ACGGCAGCGA ACCTGGATAT CGATATCCCG TATGCAACGG ACCCGGCAGG AAACCGGCTG CCGGACCCGG AACTGCATCC GGACAGCACG CTCACGGCGT GGCCGGATAA CCGCATCGCG GAAGATGCGC ACTATGTCTA CCGCTACGAT GAATACGGCA GGCTGACGGA GAAGACGGAC CGCATTCCGG AAGGGGTTAT CCGGATGCAC GACGAGCGGA CCCACCACTA CCATTACGAC AACCAGCACC GCCTGGTGTT CTACACGCGG ATACAATACG GCGAGCCGCT GGTCGAGAGC CGCTACCTCT ATGACCCGCT GGGCCGCCGG ACGGGGAAAC GGGTGTGGCG GCGGGAGCGT GACCTGACGG GGTGGATGTC GCTGTCGCGT AAACCGGAGG TGACCTGGTA CGGGTGGGAC GGCGACAGGC TGACGACGGT ACAGACCGGC ACCACACGTA TCCAGACGGT ATACCGGCCG GGGAGCTTCA CACCGCTCAT CCGCATCGAA ACGGAGAACG GCGAGCGGGA GAAAGCGCAG CGCCGCAGCC TGGCGGAGAA ACTCCAGCAG GAAGGGAGTG AGGACGGTCA CGGTGTGGTG TTCCCGGCAG AACTGGTGAG GATGCTGGAC AGGCTGGAGG AAGAAATCCG GGCAGACCGC GTGAGCAGTG AAAGCCGGGC GTGGCTTGCG CAGTGCGGAC TGACGGTGGA GCAACTGGAA AAACAGGTGG AGCCGGAATA CACGCCGGCG CGCACGCTGC ATCTGTACCA CTGTGACCAC CGGGGACTGC CGCTGGCGCT TATCAGCGAA GACGGCAATA CGGCGTGGAG CGCGGAATAT GATGAATGGG GCAACCAGCT TAATGAGGAG AACCCGCATC ACCTGCACCA GCCGTACCGT CTGCCGGGCC AGCAGCATGA TGAGGAGTCG GGGCTGTACT ATAACCGTCA CCGGTACTAC GATCCGTTGC AGGGGCGGTA TATCACCCCG GACCCGATTG GGTTGAGAGG TGGATGGAAT ATGTATCAGT ATCCGTTGAA TCCCATACAA GTGATAGACC CAATGGGGTT AGATGCGATT GAGAATATGA CATCAGGTGG ACTAATTTAT GCCGTATCTG GTGTACCTGG ATTGATTGTT GCAAACAGCA TTACTAACAG TGCTTACCAG TTCGGTTATG ATATGGATGC TATTGTTGGC GGAGCTCATA ATGGGGCCGC CGATGCAATG AGATATTGTT ACTTGATGTG TCGAATGACT AAGACATTTG GATCAACAAT AGCTGACGTG ATAGGTAAAA ATCATGAGGC GGCTGGGGAT AGACAAGGTC AGCCAGCTAA AGAAAGAATC ATGGATCTTA AAAATAACAC TGTCGGTATT GCTTGTGGCG ATTTTTCTGC CAAATGTAGC GATGCATGTA TTGAAAAATA TAACATTGGG CAACTCTTCG GGTTAGATGG TATAAAAGCA GATAATCCAA TAAAAGCAAA GCAAGGGAGT TCAGATGCTT CAAATTATTA G
|
Protein sequence | MPWLHRETAR SFGGAGSTAG YEQVTAYTLT GQLQSRHLNL PQLDCDYDWN DNGQLIRISG PQESREYRYS DTGRLTGVHT TAANLDIDIP YATDPAGNRL PDPELHPDST LTAWPDNRIA EDAHYVYRYD EYGRLTEKTD RIPEGVIRMH DERTHHYHYD NQHRLVFYTR IQYGEPLVES RYLYDPLGRR TGKRVWRRER DLTGWMSLSR KPEVTWYGWD GDRLTTVQTG TTRIQTVYRP GSFTPLIRIE TENGEREKAQ RRSLAEKLQQ EGSEDGHGVV FPAELVRMLD RLEEEIRADR VSSESRAWLA QCGLTVEQLE KQVEPEYTPA RTLHLYHCDH RGLPLALISE DGNTAWSAEY DEWGNQLNEE NPHHLHQPYR LPGQQHDEES GLYYNRHRYY DPLQGRYITP DPIGLRGGWN MYQYPLNPIQ VIDPMGLDAI ENMTSGGLIY AVSGVPGLIV ANSITNSAYQ FGYDMDAIVG GAHNGAADAM RYCYLMCRMT KTFGSTIADV IGKNHEAAGD RQGQPAKERI MDLKNNTVGI ACGDFSAKCS DACIEKYNIG QLFGLDGIKA DNPIKAKQGS SDASNY
|
| |