Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2065 |
Symbol | |
ID | 6971881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1960504 |
End bp | 1964706 |
Gene Length | 4203 bp |
Protein Length | 1400 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643385977 |
Product | protein rhsD |
Protein accession | YP_002270466 |
Protein GI | 209397644 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.595023 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGAA AACCAGCGGC GCGTCAGGGA GATATGACTC AGTATGGCGG TCCCATTGTC CAGGGTTCGG CAGGTGTAAG AATTGGCGCG CCAACTGGCG TGGCGTGCTC GGTGTGCCCG GGCGGGATGA CTTCGGGCAA CCCGGTAAAT CCGCTGCTGG GGGCGAAGGT GCTGCCCGGC GAGACGGACC TTGCGCTGCC CGGCCCGCTG CCGTTCATTC TCTCCCGCAC CTACAGCAGC TACCGGACCC GGACGCCTGC GCCGGTGGGG ATTTTTGGCC CCGGCTGGAA AGCGCCTTCT GATATCCGCT TACAGCTACG CGATGATGCA CTGGTACTCA ATGACAACGG CGGGCGGAGC ATTCACTTTG AGCCGCTGCT GCCGGGGGAG GCGGTGTACA GCCGCAGCGA GTCAATGTGG CTGGTGCGCG GTGGTAAGGC AGCGCAGCCG GACGGCCACA CGCTGGCGCG GCTGTGGGGG GCGCTGCCGC CGGATATCCG GTTAAGCCCG CATCTTTACC TGGCGACCAA CAGCGCACAG GGGCCGTGGT GGATACTGGG GTGGTCAGAG CGGGTGCCGG GTGCTGAGGA CGTACTGCCA GCGCCGCTGC CGCCGTACCG GGTGCTTACC GGGATGGCGG ACCGCTTCGG GCGGACGCTG ACGTACAGGC GTGAGGCCGC CGGTGACCTG GCCGGGGAAA TCACCGGCGT GACGGACGGT GCCGGGCGGG AGTTCCGTCT GGTGCTGACC ACGCAGGCGC AGCGGGCGGA AGAGGCCCGT AAACAGCACA CCGCTTCTTT ATCTTCCCCT GACCCCCCCC GCCCTCTTTC AGACTCAGCG TTCCCCGACA CACTGCCCGG TACCGAATAC GGTCCCGACA GAGGTATCCG CCTTTCGGCG GTGTGGCTGA CGCACGACCC GGCATACCCG GAAAGCCTGC CCGGTGCGCC ACTGGCGCGG TACACGTATA CGGAAGCCGG TGAACTGCTG GCGGTATATG ACCGCAGCAA TACGCAGGTG CGCGCTTTCA CGTATGACGC GCAGCATCCG GGCCGGATGG TGGCGCACCG TTACGCGGGA AGGCCGGAGA TGCGCTACCG CTACGACGAT ACCGGGCGGG TGGTGGAGCA GCTGAACCCG GCAGGCCTGA GTTACCGCTA CCAGTATGAG CAGGACCGCA TCACCGTCAC GGACAGCCTG AACCGGCGTG AGGTGCTGCA TACAGAAGGC GGGGCCGGGC TGAAGCGGGT GGTGAAAAAA GAACTGGCGG ACGGCAGCGT CACGCACAGC GGGTATGACG CGGCAGGAAG GCTCACGGCG CAGACGGACG CGGCGGGACG GCGGACAGAG TACGGTCTGA ATGTGGTGTC CGGCGATATC ACGGACATCA CCACACCGGA CGGGCGGGAG ACGAAATTTT ACTATAACGA CGGGAACCAG CTGACGGCGG TGGTGTCCCC GGACGGGCTG GAGAGCCGCC GGGCATATGA TGAACCGGGC AGGCTGGTAT CGGAGACATC GCGCTGTGGG GACGTCATCC GGTATGCTTA TGATAATCCG CACAGTGAAT TACCGGCCAC GACAACAGAT GCGACGGGCA GCACCCGGCA GATGACCTGG AGCCGCTACG GGCAGTTGCT GGCGTTCACC GACTGCTCGG GCTACCAGAC CCGTTATGAA TACGACCGCT TTGGTCAGAT GACGGCGGTC CACCGTGAGG AAGGTATCAG CCGTTACCGC CGCTATGACA ACCGTGGCCG GTTAACCTCG GTGAAAGACG CACAGGGCCA TGAAACGCGG TATGAGTACA ACGCCGCAGG CGACCTGACT GCCGTTATCA CTCCGGACGG CAACCGGAGC GAGACACAGT ACGATGCGTG GGGAAAAGCG GTCAGCACCA CGCAGGGCGG GCTGACGCGC AGTATGGAGT ATGACCTCGC CGGACGCATC ACCACGCTGA CCAACGAGAA CGGCAGCCGG AGTGAGTTTA CCTACGATGC GCTTGACCGG CTGGTACAGC AGCGCGGCTT TGACGGGCGG ACGCAACGTT ACCACTATGA CCTGACCGGA AAACTCACGC AGAGTGAAGA TGAGGGGCTT GTCACCCTCT GGCACTACGA CGAATCGGAC CGCCTCACTC ACCGCACGGT GAACGGCGAA CCGGCAGAGC AGTGGCAGTA CGACGAGCAC GGCTGGCTGA CAGAAATCAG CCACCTGAGC GAAGGCCATC AGGTGGCGGT GCATTACGGT TATGATGATA AGGGCCGCCT GGCCGGGGAG CGCCAGACGG TGCATAACCC GGAGACGGGG GAACTGCTGT GGCAGCATGA GACAGAGCAC GCATACAACG AACAGGGTCT GGCAAACCGC GTCACGCCGG ACAGCCTGCC GCGGGTGGAG TGGCTGACCT ACGGCAGCGG TTATCTTGCG GGGATGAAGC TGGGCGGGAC GCCGCTGGTG GAGTTCACGC GCGACAGGCT GCACCGCGAG ACGGTGCGCA GCTTCGGCAA TAACGCATAC GAACTGACCA GCACATACAC TCCCGCAGGC CATTTACAGA GCCAGCGCCT GAACAGCCAG GTGTATGACC GTGACTACGA CTGGAATGAC AATGGCGACC TGGTGCGCAT CAGCGGCCCG CGACAGACGT GGGAATATGG CTACAGTGCC ACGGGCAGGC TGGAGAGCGT GCGCACCCTT GCATCAGACC TGGATATCCG CATCCCGTAT GCGACCGACC CGGCGGGAAA CCGGCTGCCG GACCCGGAGC TACACCCGGA CAGCACGCTC ACGGCGTGGC CGGATAACCG CATCGCGGAG GATGCGCACT ATGTCTACCG ACACGATGAA TACGGCAGGC TGACGGAGAA GACGGACCGC ATCCCGGCGG GTGTGATACG GACGGACGAC GAGCGGACCC ACCACTACCA CTACGACAGC CAGCACCGCC TGGTGTTCTA CACGCGGATA CAGCATGGCG AGCCACTGGT CGAGAGCCGC TACCTCTACG ACCCGCTGGG ACGGCGAATG GCAAAACGGG TCTGGCGGCG GGAGCGTGAC CTGACGGGGT GGATGTCGCT GTCGCGTAAA CCGGAGGTGA CGTGGTATGG CTGGGACGGA GACAGGCTGA CGACGGTGCA GACTGACACC ACACGTATCC AGACGGTATA CGAGCCGGGA AGCTTCACGC CGCTCATCCG GGTCGAGACA GAGAACGGCG AGCGGGAAAA AGCGCAGCGG CGCAGCCTGG CAGAGACGCT CCAGCAGGAA GGGAGTGAGA ACGGCCACGG CGTGGTGTTC CCGGCTGAAC TGGTGCGGCT GCTGGACAGG CTGGAGGAAG AAATCCGGGC AGACCGCGTG AGCAGTGAAA GCCGGGCGTG GCTTGCGCAG TGCGGGCTGA CGGTGGAGCA ACTGGCCAGA CAGGTGGAGC CGGAATACAC ACCGGCGCGA AAAGTTCATT TTTACCACTG CGACCACCGG GGCCTGCCGC TGGCGCTCAT CAGCGAAGAC GGCAATACGG CGTGGCGCGG GGAGTATGAT GAATGGGGCA ACCAGCTTAA TGAAGAGAAC CCGTATTACC TGCACCAGCC ATACCGTCTG CCGGGGCAGC AGCATGATGA GGAATCAGGG CTGTACTATA ACCGGAACCG GTACTATGAC CCGCTACAGG GGAGGTATAT TACACAAGAC CCCATTGGGC TGGCGGGGGG ATGGAATCTG TATAATTACC CACTGAATCC GATAATAAGG ATGGATCCTT TGGGTTTGTA TAATTTATAT CAATTATTAT ATGATGTTTG GCATGATGAT TCATATGGAA CATCATCAAT TGATATTACT GGCAGTGGAG ATCTAATATC ATTAGGTGGT CATGCAGGAC TTGGCGTTGC GTTTGCTAAA AAGAAAGGTG AAATGTTATC TGATATTTGT ATTTATGCTA CAGCATGCGG ACATGCAGGA ATTGGTGGTG GGATAAATGC GGCTATCACA TATTCTGAGA CCAAGTCTTT ACCTACATCG GGAGTCAGCA ATTCAGTAGG TGTAACGGTT GGCGGCGGAG TTGGGGGGCA TTTTGCGTAT ACTTATGTAG TGGATGTTGA TAATCCAGAA TCATCGACAG AATCTGTTGG TATCGGTGCA GGTGTTGACG CTTCAGTTAT GACTCTGGCT TGTAGAACGT GGCAAGAATG CTGGGTCAAT TAA
|
Protein sequence | MSGKPAARQG DMTQYGGPIV QGSAGVRIGA PTGVACSVCP GGMTSGNPVN PLLGAKVLPG ETDLALPGPL PFILSRTYSS YRTRTPAPVG IFGPGWKAPS DIRLQLRDDA LVLNDNGGRS IHFEPLLPGE AVYSRSESMW LVRGGKAAQP DGHTLARLWG ALPPDIRLSP HLYLATNSAQ GPWWILGWSE RVPGAEDVLP APLPPYRVLT GMADRFGRTL TYRREAAGDL AGEITGVTDG AGREFRLVLT TQAQRAEEAR KQHTASLSSP DPPRPLSDSA FPDTLPGTEY GPDRGIRLSA VWLTHDPAYP ESLPGAPLAR YTYTEAGELL AVYDRSNTQV RAFTYDAQHP GRMVAHRYAG RPEMRYRYDD TGRVVEQLNP AGLSYRYQYE QDRITVTDSL NRREVLHTEG GAGLKRVVKK ELADGSVTHS GYDAAGRLTA QTDAAGRRTE YGLNVVSGDI TDITTPDGRE TKFYYNDGNQ LTAVVSPDGL ESRRAYDEPG RLVSETSRCG DVIRYAYDNP HSELPATTTD ATGSTRQMTW SRYGQLLAFT DCSGYQTRYE YDRFGQMTAV HREEGISRYR RYDNRGRLTS VKDAQGHETR YEYNAAGDLT AVITPDGNRS ETQYDAWGKA VSTTQGGLTR SMEYDLAGRI TTLTNENGSR SEFTYDALDR LVQQRGFDGR TQRYHYDLTG KLTQSEDEGL VTLWHYDESD RLTHRTVNGE PAEQWQYDEH GWLTEISHLS EGHQVAVHYG YDDKGRLAGE RQTVHNPETG ELLWQHETEH AYNEQGLANR VTPDSLPRVE WLTYGSGYLA GMKLGGTPLV EFTRDRLHRE TVRSFGNNAY ELTSTYTPAG HLQSQRLNSQ VYDRDYDWND NGDLVRISGP RQTWEYGYSA TGRLESVRTL ASDLDIRIPY ATDPAGNRLP DPELHPDSTL TAWPDNRIAE DAHYVYRHDE YGRLTEKTDR IPAGVIRTDD ERTHHYHYDS QHRLVFYTRI QHGEPLVESR YLYDPLGRRM AKRVWRRERD LTGWMSLSRK PEVTWYGWDG DRLTTVQTDT TRIQTVYEPG SFTPLIRVET ENGEREKAQR RSLAETLQQE GSENGHGVVF PAELVRLLDR LEEEIRADRV SSESRAWLAQ CGLTVEQLAR QVEPEYTPAR KVHFYHCDHR GLPLALISED GNTAWRGEYD EWGNQLNEEN PYYLHQPYRL PGQQHDEESG LYYNRNRYYD PLQGRYITQD PIGLAGGWNL YNYPLNPIIR MDPLGLYNLY QLLYDVWHDD SYGTSSIDIT GSGDLISLGG HAGLGVAFAK KKGEMLSDIC IYATACGHAG IGGGINAAIT YSETKSLPTS GVSNSVGVTV GGGVGGHFAY TYVVDVDNPE SSTESVGIGA GVDASVMTLA CRTWQECWVN
|
| |