Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0254 |
Symbol | |
ID | 6972234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 272333 |
End bp | 273841 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643384324 |
Product | RhsG core protein with extension |
Protein accession | YP_002268840 |
Protein GI | 209399453 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.814194 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACGCCC TGCAAGGGAA TTTCCCGTAT GCAACAGACC CGGCAGGAAA CCGGCTGCCG GACCCGGAAC TGCACCCGGA CAGCACCCTC ACGGCATGGC CGGATAACCG CATCGCGGAA GATGCGCACT ATGTCTACCG CTACGATGAA TACGGCAGGC TGGCGGAGAA GACGGACCGC ATCCCGGAAG GGGTTATCCG GATGCACGAC GAGCGCACCC ACCACTATCA CTACGACAGC CAGCACCGCC TGGTGTTCCA CACGCGGATA CAGCACGGCG AACCACAGGT GGAGAGCCGG TACCTCTATG ACCCGCTGGG CCGCCGGACG GGAAAACGGG TGTGGCGGCG GGAGCGTGAC CTGACGGGGT GGATGTCGCT GTCGCGTAAA CCGGAGGAGA CCTGGTACGG GTGGGACGGT GACAGGCTGA CCACTGTACA GACCCAACAG ACAAGAATCC AGACGGTATA CCAGCCGGGA AGCTTCACGC CGCTCCTGAG AATCGAAACA GAGAATGGTG AACAGGCGAA GGCGCGGCAC CGTAGCCTGG CGGAGGTGTT GCAGGAGGAC ACGGGTGTGA CGCTACCGGC GGAGCTGGCG GTGATGCTGG GAAGGCTGGA GCGGGAGCTG CGGCAGGGCA GCGTGAGTGA AGAAAGCCAG CAGTGGCTTG CGCAGTGCGG GCTGACGGCG GAGCAGATGG CCGCGCAGCT GGAGGCGGAA TACATCCCGG AGAGGAAACT TCATCTTTAC CACTGCGACC ACCGGGGACT GCCGCTGGCG CTCATCAGCC CGGAAGGGGA AACGGCGTGG CAGGGGGAGT ATGACGAGTG GGGAAACCTG CTGGGCGAAA CCAGCGCGCA GCACCTTCAA CAGTCACTCC GTCTGCCGGG GCAGCAGTAT GATGAGGAGT TGGGGCTGTA CTACAACCGC AACCGGTACT ATGATCCGTT GCAGGGGAGA TATATCACCC AGGACCCGAT AGGGCTGGAG GGGGGATGGA ACCTGTATCA GTACCCACTC AATCCTATTG AACATATAGA TCCGTTGGGG TTAGCACTTG ATTTGAATTA TTATTCTCCG TCAGATCCTA TATATAAAGG TTCTCTTAAT GTAAGGGAAT TTCCAACGGG TTTTACAGTT GGAGGTCATG GTTCGCCTAC ATCTATGAGT GATGACAGAA TCAAAAAAGG AAGTGATCTG ACAATAAAAC AATTAGCTAG CGACATAAGA GCGAATCCTA AATATCATGA AGGTATGCCT GTAGTCTTGT TTTCCTGCGA AACAGGAAAA GGCAAAAATT CATTTGCACA GAAACTAGCT AACGAACTTG ATGCTACAGT GATTGCTCCT GATGAAATAA TATGGATTTG GCCGGATGGA AACTATGCTA TTATGGGGCA AACAGCTAGG ATAACAATAG GCGGTAAGGA CAACGGGGTG TTTGAATTGG TGCCAGACGA GAAACAACCA GGGGACTTTC ATAAGTTTAC ACCTACAGGA AGCAAATAG
|
Protein sequence | MDALQGNFPY ATDPAGNRLP DPELHPDSTL TAWPDNRIAE DAHYVYRYDE YGRLAEKTDR IPEGVIRMHD ERTHHYHYDS QHRLVFHTRI QHGEPQVESR YLYDPLGRRT GKRVWRRERD LTGWMSLSRK PEETWYGWDG DRLTTVQTQQ TRIQTVYQPG SFTPLLRIET ENGEQAKARH RSLAEVLQED TGVTLPAELA VMLGRLEREL RQGSVSEESQ QWLAQCGLTA EQMAAQLEAE YIPERKLHLY HCDHRGLPLA LISPEGETAW QGEYDEWGNL LGETSAQHLQ QSLRLPGQQY DEELGLYYNR NRYYDPLQGR YITQDPIGLE GGWNLYQYPL NPIEHIDPLG LALDLNYYSP SDPIYKGSLN VREFPTGFTV GGHGSPTSMS DDRIKKGSDL TIKQLASDIR ANPKYHEGMP VVLFSCETGK GKNSFAQKLA NELDATVIAP DEIIWIWPDG NYAIMGQTAR ITIGGKDNGV FELVPDEKQP GDFHKFTPTG SK
|
| |