Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0252 |
Symbol | |
ID | 6970670 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 267063 |
End bp | 271295 |
Gene Length | 4233 bp |
Protein Length | 1410 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643384322 |
Product | RHS Repeat family protein |
Protein accession | YP_002268838 |
Protein GI | 209400273 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.661469 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAGGAA AACCGGCGGC GCGGCAGGGT GACATGACCC GCAAGGGACT GGATATTGTG CAGGGTTCAG CAGGGGTGCT GATAGGTGCG CCGACGGGCG TGGCCTGCTC GGTGTGTCCG GGAGGGATTA CCTATGCTAA CCCGGTGAAC CCGGTGCTGG GTGCGAAGGT GCTGCCGGGC GAGACGGACC TTGCGCTGCC CGGCCCGCTG CCGTTTATTC TTTCCCGCGC CTACAGCAGC TACCGGACCA GAACACCCGC GCCGGTGGGG GTGTTTGGTC CCGGCTGGAA AGCGCCCTTC GATATCCGCT TACAGATACG CGATGAAGGC CTGATACTCA ACGACAACGG CGGCAGGAGC ATTCACTTTG AGCCGCTGTT TCCCGGCGAG ATAAGCTACA GCCGCAGCGA GTCGTTCTGG CTGGCGCGGG GCGGGGTGGC GGAGCAGCAC AGTTCGCAGC CGCTAAGCGC GCTCTGGCAG GTGCTGCCGG AAGATGTTCG CCTGAGTCCG CATATGTACC TGGCGACAAA CAGCCTTCAG GGGCCGTGGT GGATACTCAA CTGGCCGGAG CGGGTGCCGG GGGCGGACGA GGTGCTGCCG CCGGAGCCGC CCGCATACCG GGTGCTGACG GGCGTGGTGG ATGGCTTCGG GCGGACGCTG GCCTTTCACC GGGCGGCGGA GGGTGATGTG GCGGGCGCGG TGACGGGGGT GACGGACGGC GCGGGCCGCC GTTTTCACCT GGTGCTGACC ACACAGGCAC AGCGGGCGGA AGTATTCCGT AAACAGCGCG CCACGTCTTT ATCTTCCCCT GCCGGTCCCC GTTCTGCTTC CTCTTCTCTG GTTTTCCCCG ACACGCTGCC CGCCGGTACA GAATACGGTG CCGATAACGG TATCCGGCTG GAGGCGGTAT GGCTGACACA CGACCCGGCA TACCCGGATG AACTGCCCGC CGCGCCGCTG GCGCGCTACA CGTACACGGC CAGCGGAGAA CTGCGGGCGG TGTATGACCG CAGCGGGACG CAGGTGCGCG GGTTTGCTTA TGATGCGGAG CACGCCGGGC GGATGGTGGC GCACCATTAT GCGGGTAGGC CGGAGAGCCG CTACCGGTAT GATGATACCG GCCGGGTGAC GGAGCTGGTC AACCCGGAGG GGCTGGACTA CCGCTTTGAG TACGGGCAGG ACCGTGTGAC CATCACGGAC AGCCTGAACC GGCGGGAGGT GCTGTACACG GAAGGCGAGG GTGGCCTGAA ACGTGTTGTG AAGAAGGAAC ATGCGGACGG GAGCATCACC CGCAGCGAGT ATGATGAGGC GGGGAGGCTG AAGGCACAGA CGGATGCGGC GGGACGGCGG ACGGAGTACA GCCTGCATAT GGCGTCGGGT GCGGTGACAG CGGTGACGGG GCCGGACGGC AGGACGGTGC GGTATGGCTA TAACAGCCAG CGGCAGGTGA CGTCAGTGAC GTACCCGGAC GGGCTGCGCA GCAGCCGGGA GTATGATGAG AAGGGAAGGC TGGCGGCGGA GACCTCGCGC AGCGGAGAGA CGACGCGGTA CAGCTATGAT GACCCGGCGA GTGAGCTGCC GACAGGGATA CAGGACGCGA CGGGCAGTAC AAAACAGATG GCATGGAGCC GTTACGGTCA GCTGCTGACC TTTACGGACT GCTCGGGGTA CACGACGCGG TATGAGTATG ACCGGTACGG TCAACAAATC GCCGTTCACC GGGAAGAAGG CATCAGCACT TACAGCAGTT ATAACCCGCG TGGCCAACTG GTCAGTCAGA AGGATGCGCA GGGGCGTGAA ACCCGTTATG AGTACAGCGC CGCAGGCGAC CTCACCGCTA TCGTTGCCCC GGACGGCAGC CGCAGTGAGA TACAGTATGA TGCGTGGGGA AAGGCCGTCA GCACCACGCA GGGCGGTCTG ACGCGCAGCA TGGGGTATGA CGCTGCCGGG CGCATCACCG TGCTGACCAA CGAGAACGGC AGCCAGTCCA CGTTCCGGTA TGACCCGGTG GACAGGCTGA CTGAACAGCG CGGTTTTGAC GGCCGGACGC AACGTTACCA CTATGACCTG ACCGGAAAAC TCACGCAGAG TGAAGACGAG GGGCTTGTCA CCCTCTGGCA CTACGATGCG TCGGACCGCA TCACGCACCG GACGGTGAAC GGCGACCCGG CAGAGCAGTG GCAGTATGAT GAGCACGGGT GGCTAACCAC CCTCAGCCAT ACCAGTGAAG GCCACCGGGT GTCGGTCCAC TACGGCTATG ACGATAAAGG CCGCCTGACG GGTGAACGGC AGACGGTGGA GAACCCGGAG ACGGGGGAGA TGCTGTGGGA GCATGAGACG GGGCACGCGT ACAGCGAACA GGGGCTGGCG ACCCGTCAGG AGCCGGACGG TCTGCCGCCG GTAGAGTGGC TGACGTATGG CAGCGGTTAT CTTGCGGGGA TGAAGCTGGG CGGAACGCCA CTGGTCGAGT ACATGCGGGA CCGGCTGCAC CGTGAGACGG CCCGCAGCTT CGGCGGGGAG GCATATGAAC TTGCCACCGC CTGGAATACC AGCGGCCAGC TCCGGAGCAG GCACCTGAAC CTTCCGCAGC TTGACCGTGA CTACGACTGG AACGACAACG GACAGCTAAT CCGCATCAGC GGCCCGCAGG AGAGCCGGGA GTACCGTTAC AGTGACACGG GAAGGCTGAC GGGCGTCCAC ACCACGGCAG CGAACCTGGA TATCGATATC CCGTATGCAA CGGACCCGGC AGGAAACCGG CTGCCGGACC CGGAACTGCA TCCGGACAGC ACGCTCACGG CGTGGCCGGA TAACCGCATC GCGGAAGATG CGCACTATAT CTATCGCTAC GATGAATACG GCAGGCTGGC GGAGAAGACG GACCGTATCC CGGAAGGGGT TATCCGGATG CACGACGAGC GCACCCACCA CTATCACTAC GACAGCCAGC ACCGCCTGGT GTTCCACACG CGGATACAGC ACGGCGAACC ACAGGTGGAG AGCCGGTACC TCTATGACCC GCTGGGCCGC CGGACGGGAA AACGGGTGTG GCGGCGGGAG CGTGACCTGA CGGGGTGGAT GTCGCTGTCG CGTAAACCGG AGGAGACCTG GTACGGGTGG GACGGTGACA GGCTGACCAC TGTACAGACC CAACAGACAA GAATCCAGAC GGTATACCAG CCGGGAAGCT TCACGCCGCT CCTGAGAATC GAAACAGAGA ATGGTGAACA GGCGAAGGCG CGGCACCGTA GCCTGGCGGA GGTGTTGCAG GAGGACACGG GTGTGACGCT ACCGGCGGAG CTGGCGGTGA TGCTGGGGAG GCTGGAGCGG GAACTGCGGC AGGGCAGCGT GAGTGAAGAA AGCCAGCAGT GGCTTGCGCA GTGCGGGCTG ACGGCGGAAC AGATGGCCGC GCAGCTGGAG GCGGAATATA TCCCGGAGCG GAAACTTCAT CTTTACCACT GCGACCACCG GGGACTGCCG CAGGCGCTCA TCAGCCCGGA AGGGGAAACG GCGTGGCAGG GGGAGTATGA CGAGTGGGGA AACCTGCTGG GCGAAACCAG CGCGCAGCAC CTGCAACAGC CGTACCGTCT GCCGGGACAG CAGTATGATG AGGAGTCGGG GCTGTATTAC AACCGTCACC GGTACTATGA CCCGCTACAG GGGAGGTATA TCACCCAGGA TCCAATAGAT ATAAAAGGAG GATGGAATTT ATATTCTTAT GCGCTTAATC CGGTAAGTTG GATTGACCCA TTAGGATTGA CGCAGTGCGA TTCCGAAGGG TGTAATAATG ATATATTATT TACAGGAGGA AGTGGTCCCG ATAATAAAAT ACTTAATGAA TTAGGTCCGA GAGATGGCAT TGACGGTTTG GGTTCACAAA ACATGAAAAT GTATAGTGGG CTATTAGGGG GCGATATTTT AAAACCGGGG ATTCTGGGCG GTTTGACGCT TGGTAGTGTG ACACAACGAC CATCGCGAAC GGCTGAGGAG GAGGTGCAAG CTCAAATAGA ATATTTAGCA TATAAGAAGC GTTGTGAGCA AAAACCTGAT AGTAATTTAA GTCGTTGTGC TGCCGCGATC TTTCAAAAGA GCAGAAAAGA AGATTGTTTA AGAATGAGAC AAGAATGGGA TGATAAATGG TGGCCTGGCA AACATGCGGA TGAAATAAAA AATGTGAAAG CATCGTTGAA AAACACACTT ATTGATGTTA AAAAAAAGAT GCATGCCAAC TAG
|
Protein sequence | MGGKPAARQG DMTRKGLDIV QGSAGVLIGA PTGVACSVCP GGITYANPVN PVLGAKVLPG ETDLALPGPL PFILSRAYSS YRTRTPAPVG VFGPGWKAPF DIRLQIRDEG LILNDNGGRS IHFEPLFPGE ISYSRSESFW LARGGVAEQH SSQPLSALWQ VLPEDVRLSP HMYLATNSLQ GPWWILNWPE RVPGADEVLP PEPPAYRVLT GVVDGFGRTL AFHRAAEGDV AGAVTGVTDG AGRRFHLVLT TQAQRAEVFR KQRATSLSSP AGPRSASSSL VFPDTLPAGT EYGADNGIRL EAVWLTHDPA YPDELPAAPL ARYTYTASGE LRAVYDRSGT QVRGFAYDAE HAGRMVAHHY AGRPESRYRY DDTGRVTELV NPEGLDYRFE YGQDRVTITD SLNRREVLYT EGEGGLKRVV KKEHADGSIT RSEYDEAGRL KAQTDAAGRR TEYSLHMASG AVTAVTGPDG RTVRYGYNSQ RQVTSVTYPD GLRSSREYDE KGRLAAETSR SGETTRYSYD DPASELPTGI QDATGSTKQM AWSRYGQLLT FTDCSGYTTR YEYDRYGQQI AVHREEGIST YSSYNPRGQL VSQKDAQGRE TRYEYSAAGD LTAIVAPDGS RSEIQYDAWG KAVSTTQGGL TRSMGYDAAG RITVLTNENG SQSTFRYDPV DRLTEQRGFD GRTQRYHYDL TGKLTQSEDE GLVTLWHYDA SDRITHRTVN GDPAEQWQYD EHGWLTTLSH TSEGHRVSVH YGYDDKGRLT GERQTVENPE TGEMLWEHET GHAYSEQGLA TRQEPDGLPP VEWLTYGSGY LAGMKLGGTP LVEYMRDRLH RETARSFGGE AYELATAWNT SGQLRSRHLN LPQLDRDYDW NDNGQLIRIS GPQESREYRY SDTGRLTGVH TTAANLDIDI PYATDPAGNR LPDPELHPDS TLTAWPDNRI AEDAHYIYRY DEYGRLAEKT DRIPEGVIRM HDERTHHYHY DSQHRLVFHT RIQHGEPQVE SRYLYDPLGR RTGKRVWRRE RDLTGWMSLS RKPEETWYGW DGDRLTTVQT QQTRIQTVYQ PGSFTPLLRI ETENGEQAKA RHRSLAEVLQ EDTGVTLPAE LAVMLGRLER ELRQGSVSEE SQQWLAQCGL TAEQMAAQLE AEYIPERKLH LYHCDHRGLP QALISPEGET AWQGEYDEWG NLLGETSAQH LQQPYRLPGQ QYDEESGLYY NRHRYYDPLQ GRYITQDPID IKGGWNLYSY ALNPVSWIDP LGLTQCDSEG CNNDILFTGG SGPDNKILNE LGPRDGIDGL GSQNMKMYSG LLGGDILKPG ILGGLTLGSV TQRPSRTAEE EVQAQIEYLA YKKRCEQKPD SNLSRCAAAI FQKSRKEDCL RMRQEWDDKW WPGKHADEIK NVKASLKNTL IDVKKKMHAN
|
| |