Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0601 |
Symbol | |
ID | 6969303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 620324 |
End bp | 624520 |
Gene Length | 4197 bp |
Protein Length | 1398 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643384643 |
Product | RHS Repeat family protein |
Protein accession | YP_002269157 |
Protein GI | 209400573 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGAA AACCAGCGGC GCGTCAGGGA GATATGACTC AGTATGGCGG TCCCATTGTC CAGGGTTCGG CAGGTGTAAG AATTGGCGCG CCAACTGGCG TGGCGTGCTC GGTGTGCCCG GGCGGGATGA CTTCGGGCAA CCCGGTAAAT CCGCTGCTGG GGGCGAAGGT GCTGCCCGGC GAGACGGACC TTGCGCTGCC CGGCCCGCTG CCGTTCATTC TCTCCCGCAC CTACAGCAGC TACCGGACCC GGACGCCTGC GCCGGTGGGG ATTTTTGGCC CCGGCTGGAA AGCGCCTTCT GATATCCGCT TACAGCTACG CGATGATGCA CTGGTACTCA ATGACAACGG CGGGCGGAGC ATTCACTTTG AGCCGCTGCT GCCGGGGGAG GCGGTGTACA GCCGCAGCGA GTCAATGTGG CTGGTGCGCG GTGGTAAGGC AGCGCAGCCG GACGGCCACA CGCTGGCGCG GCTGTGGGGG GCGCTGCCGC CGGATATCCG GTTAAGCCCG CATCTTTACC TGGCGACCAA CAGCGCACAG GGGCCGTGGT GGATACTGGG GTGGTCAGAG CGGGTGCCGG GTGCTGAGGA CGTACTGCCA GCGCCGCTGC CGCCGTACCG GGTGCTTACC GGGATGGCGG ACCGCTTCGG GCGGACGCTG ACGTACAGGC GTGAGGCCGC CGGTGACCTG GCCGGGGAAA TCACCGGCGT GACGGACGGT GCCGGGCGGG AGTTCCGTCT GGTGCTGACC ACGCAGGCGC AGCGGGCGGA AGAGGCCCGT AAACAGCACA CCGCTTCTTT ATCTTCCCCT GACACCCCCC GCCCTCTTTC AGACTCAGCG TTCCCCGACA CACTGCCCGG TACCGAATAC GGTCCCGACA GAGGTATCCG CCTTTCGGCG GTGTGGCTGA CGCACGACCC GGCATACCCG GAGAGCCTGC CCGCTGCGCC ACTGGTGCGG TACACGTATA CGGAAGCCGG TGAACTGCTG GCGGTATATG ACCGCAGCAA TACGCAGGTG CGCGCTTTCA CGTATGACGC GCAGCACCCG GGCCGGATGG TGGGGCACCG TTATGCGGGA AGGCCGGAGA TGCGCTACCG CTACGACGAT GCGGGGCGGG TGGTGGAGCA ACTGAACCCG GCAGGCCTGA GTTACCACTA CCAGTATGAG CAGGACCGCA TCACCGTCAC GGACAGCCTG AACCGGCGTG AGGTGCTGCA TACAGAAGGC GGGGCCGGGC TGAAGCGGGT GGTGAAAAAA GAACTGGCGG ACGGCAGCGT CACGCACAGC GGGTATGACG CGGCAGGAAG GCTAACGGCG CAGACGGACG CGGCGGGACG GCGGACAGAG TATGGTCTGA ATGTGGTATC CGGCGATATC ACGGACATCA CCACACCGGA CGGGCGGGAG ACGAAATTTT ACTATAACGA CGGGAACCAG CTGACGGCGG TGGTGTCCCC GGACGGGCTG GAGAGCCGCC GGGCATATGA TGAACCGGGC AGGCTGGTAT CGGAGACATC GCGCAGCGGG GAGACAGTAC GCTACCGCTA CGATGATGCG TACAGTGAGT TACCGGCGAC GACAACAGAT GCGACGGGCA GCACCCGGCA GATGACCTGG AGCCGCTACG GTCAGTTGCT GGCGTTCACC GACTGCTCGG GCTACCAGAC CCGCTATGAA TACGACCGCT TCGGCCAGAT GACGGCGGTC CACCGCGAGG AAGGCATCAG CCTTTACCGC CGCTATGACA ACCGTGGCCG GTTAATCTCG GTGAAAGACG CACAGGGCCA TGAAACGCGG TATGAGTACA ACGCCGCAGG CGACCTGACT GCCGTTATCA CCCCGGACGG CAACCGGAGC GAGACACAGT ACGATGCGTG GGGAAAGGCG GTCAGCACCA CGCAGGGCGG GCTGACGCGC AGTATGGAAT ACGACCTTGC CGGACGCATC ACCACGCTGA CCAACGAGAA CGGCAGCCGG AGTGAGTTTA CCTACGATGC GCTTGACCGG CTGGTACAGC AGCGCGGCTT TGACGGGCGG ACGCAACGTT ACCACTATGA CCTGACCGGA AAACTCACGC AGAGTGAAGA TGAGGGGCTT GTCACCCTCT GGCACTACGA CGAATCGGAC CGCCTCACTC ACCGCACGGT GAACGGCGAA CCGGCAGAGC AGTGGCAGTA CGACGAGCAC GGCTGGCTGA CAGAAATCAG CCACCTGAGC GAAGGCCATC AGGTGGCGGT GCATTACGGT TATGATGATA AGGGCCGCCT GGCCGGGGAG CGCCAGACGG TGCATAACCC GGAGACGGGG GAACTGCTGT GGCAGCATGA GACAGAGCAC GCATACAACG AACAGGGTCT GGCAAACCGC GTCACGCCGG ACAGCCTGCC GCGGGTGGAG TGGCTGACCT ACGGCAGCGG TTATCTTGCG GGGATGAAGC TGGGCGGGAC GCCGCTGGTG GAGTTCACGC GCGACAGGCT GCACCGCGAG ACGGTGCGCA GCTTCGGCAA TAACGCATAC GAACTGACCA GCACATACAC TCCCGCAGGC CATTTACAGA GCCAGCGCCT GAACAGCCAG GTGTATGACC GTGACTACGA CTGGAATGAC AATGGCGACC TGGTGCGCAT CAGCGGCCCG CGACAGACGC GGGAATATGG CTACAGCGCC ACGGGCAGGC TGGAGAGCGT GCGCACCCTT GCATCAGACC TGGATATCCG CATCCCGTAT GCGACCGACC CGGCGGGAAA CCGGCTGCCG GACCCGGAGC TACACCCGGA CAGCACGCTC ACGGCGTGGC CGGATAACCG CATCGCGGAG GATGCGCACT ATGTCTACCG ACACGATGAA TACGGCAGGC TGACGGAGAA GACGGACCGC ATCCCGGCGG GTGTGATACG GACGGACGAC GAGCGGACCC ACCACTACCA CTACGACAGC CAGCACCGCC TGGTGTTCTA CACGCGGATA CAGCATGGCG AGCCACTGGT CGAGAGCCGC TACCTCTACG ACCCGCTGGG CCGCCGGACG GGGAAACGGG TGTGGCGTCG CGGGCGTGAC CTGACGGGAT GGATGTCGCT GTCGCGGAAA CCGGAGGTGA CGTGGTACGG GTGGGACGGC GACCGGCTGA CGACGGTACA GACCGACACC ACGCGTATCC AGACGGTATA CGAGCCGGGA AGCTTCGCGC CGCTCATCCG CATCGAAACA GACAACGGTG AGCGGGAGAA AGCGCAGCGC CGCAGCCTGG CGGAGAAGCT GCAGCAGGAA GGGAGCGAGG ACGGGCACGG TGTGGTATTT CCGGCTGAAC TGGTGCGGCT GCTGGACAGA CTGGAGGAAG AAATCCGGGC AGACCGCGTG AGCAGTGAAA GCCGGGCGTG GCTTGCGCAG TGCGGGCTGA CGGTGGAGCA ACTGGCCAGA CAGGTGGAGC CGGAATACAC ACCGGCGAGA AAAGTTCATC TTTACCACTG CGACCACCGG GGCCTGCCGC TGGCGCTCAT CAGCGAAGAC GGCAATACGG CGTGGAGCGG GGAGTATGAT GAATGGGGCA ACCAGCTGAA TGAGGAGAAC CCGCATCACC TGCACCAGCC GTACCGGCTG CCGGGGCAGC AGTATGATAA GGAGTCGGGG CTGTACTACA ACCGGAACCG GTACTACGAT CCGTTGCAGG GGCGGTATAT CACTCAGGAC CCGATAGGGC TGGAGGGGGG ATGGAGTCTG TATGCGTATC CGCTGAATCC GGTGAATGGT ATTGATCCAT TAGGGTTAAG TCCCGCAGAT GTAGCGCTAA TAAGAAGAAA AGATCAACTA AACCATCAAA GAGCATGGGA TATATTATCT GATACTTATG AAGATATGAA GAGATTAAAT TTAGGTGGGA CTGATCAATT TTTCCATTGT ATGGCATTTT GTCGAGTGTC TAAATTAAAT GACGCTGGTG TTAGCCGATC GGCGAAAGGG CTGGGTTATG AAAAAGAGAT TAGAGATTAC GGGTTAAATC TGTTCGGTAT GTACGGCAGA AAAGTAAAGC TATCCCATTC TGAAATGATT GAAGATAATA AAAAAGACTT GGCTGTAAAT GACCATGGGT TGACATGTCC ATCAACAACA GATTGCTCAG ATAGATGTAG TGATTATATT AATCCAGAGC ATAAAGAAAC GATAAAGGCT TTACAAGATG CTGGCTATCT CAAGTAA
|
Protein sequence | MSGKPAARQG DMTQYGGPIV QGSAGVRIGA PTGVACSVCP GGMTSGNPVN PLLGAKVLPG ETDLALPGPL PFILSRTYSS YRTRTPAPVG IFGPGWKAPS DIRLQLRDDA LVLNDNGGRS IHFEPLLPGE AVYSRSESMW LVRGGKAAQP DGHTLARLWG ALPPDIRLSP HLYLATNSAQ GPWWILGWSE RVPGAEDVLP APLPPYRVLT GMADRFGRTL TYRREAAGDL AGEITGVTDG AGREFRLVLT TQAQRAEEAR KQHTASLSSP DTPRPLSDSA FPDTLPGTEY GPDRGIRLSA VWLTHDPAYP ESLPAAPLVR YTYTEAGELL AVYDRSNTQV RAFTYDAQHP GRMVGHRYAG RPEMRYRYDD AGRVVEQLNP AGLSYHYQYE QDRITVTDSL NRREVLHTEG GAGLKRVVKK ELADGSVTHS GYDAAGRLTA QTDAAGRRTE YGLNVVSGDI TDITTPDGRE TKFYYNDGNQ LTAVVSPDGL ESRRAYDEPG RLVSETSRSG ETVRYRYDDA YSELPATTTD ATGSTRQMTW SRYGQLLAFT DCSGYQTRYE YDRFGQMTAV HREEGISLYR RYDNRGRLIS VKDAQGHETR YEYNAAGDLT AVITPDGNRS ETQYDAWGKA VSTTQGGLTR SMEYDLAGRI TTLTNENGSR SEFTYDALDR LVQQRGFDGR TQRYHYDLTG KLTQSEDEGL VTLWHYDESD RLTHRTVNGE PAEQWQYDEH GWLTEISHLS EGHQVAVHYG YDDKGRLAGE RQTVHNPETG ELLWQHETEH AYNEQGLANR VTPDSLPRVE WLTYGSGYLA GMKLGGTPLV EFTRDRLHRE TVRSFGNNAY ELTSTYTPAG HLQSQRLNSQ VYDRDYDWND NGDLVRISGP RQTREYGYSA TGRLESVRTL ASDLDIRIPY ATDPAGNRLP DPELHPDSTL TAWPDNRIAE DAHYVYRHDE YGRLTEKTDR IPAGVIRTDD ERTHHYHYDS QHRLVFYTRI QHGEPLVESR YLYDPLGRRT GKRVWRRGRD LTGWMSLSRK PEVTWYGWDG DRLTTVQTDT TRIQTVYEPG SFAPLIRIET DNGEREKAQR RSLAEKLQQE GSEDGHGVVF PAELVRLLDR LEEEIRADRV SSESRAWLAQ CGLTVEQLAR QVEPEYTPAR KVHLYHCDHR GLPLALISED GNTAWSGEYD EWGNQLNEEN PHHLHQPYRL PGQQYDKESG LYYNRNRYYD PLQGRYITQD PIGLEGGWSL YAYPLNPVNG IDPLGLSPAD VALIRRKDQL NHQRAWDILS DTYEDMKRLN LGGTDQFFHC MAFCRVSKLN DAGVSRSAKG LGYEKEIRDY GLNLFGMYGR KVKLSHSEMI EDNKKDLAVN DHGLTCPSTT DCSDRCSDYI NPEHKETIKA LQDAGYLK
|
| |