Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4965 |
Symbol | |
ID | 6966670 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4607989 |
End bp | 4612218 |
Gene Length | 4230 bp |
Protein Length | 1409 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643388647 |
Product | Rhs family protein |
Protein accession | YP_002273074 |
Protein GI | 209396291 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.832182 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGAA AACCGGCGGC GCGTCAGGGC GACATGACGC AGTATGGCGG TAGCATTGTT CAGGGTTCAG CCGGGGTGCG CATTGGTGCC CCCACCGGCG TGGCCTGTTC GGTGTGCCCC GGCGGAGTGA CGTCCGGCCA TCCGGTCAAT CCCCTGCTCG GTGCAAAGGT CCTTCCCGGT GAAACCGACA TCGCCCTGCC CGGCCCGCTG CCGTTCATCC TCTCCCGCAC CTACAGCAGT TACCGGACAA AAACGCCCGC GCCGGTGGGG AGCCTCGGCC CCGGCTGGAA AATGCCTGCG GATATCCGCT TACAGCTGCG CGATAACACA CTGATACTCA GTGATAACGG CGGCAGAAGC CTGTATTTTG AGCACCTGTT TCCCGGTGAG GACGGTTACA GCCGCAGCGA GTCACTCTGG CTGGTGCGCG GCGGCGTGGC GAAACTGGAT GAAGGTCACC GGCTGGCCGC ACTCTGGCAG GCGCTGCCGG AAGAACTCCG CTTAAGTCCG CATCGTTATC TGGCGACAAA CAGTCCGCAG GGGCCGTGGT GGCTGCTCGG CTGGTGTGAG CGGGTGCCGG AAGCGGATGA GGTGCTGCCT GCGCCGCTGC CGCCGTACCG GGTACTGACC GGGCTGGTGG ACCGCTTCGG GCGCACACAG ACGTTCCACC GCGAAGCCGC CGGTGAATTC AGCGGCGAAA TCACCGGCGT GACGGATGGT GCCGGGCGTC ACTTCCGGCT GGTACTGACC ACGCAGGCGC AGCGGGCAGA AGAAGCCCGG CAGCAGGCCA TTTCCGGCGG GACGGAACCG TCCGCTTTTC CTGATACCCT GCCGGGTTAC ACCGAATATG GCCGGGACAA CGGCATCCGT CTGTCTGCCG TGTGGCTGAC GCACGACCCG GAATACCCGG AGAATTTACC TGCCGCGCCG CTGGTGCGCT ATGGCTGGAC GCCGCGCGGC GAACTGGCGG TGGTGTATGA CCGTAGTGGC AAACAGGTGC GCAGCTTTAC TTACGATGAT AAATACCGGG GCCGGATGGT GGCGCACCGT CACACGGGCC GGCCGGAAAT CCGTTACCGT TACGACAGCG ACGGGCGGGT GACAGAACAG CTAAACCCGG CAGGCTTAAG CTACACGTAT CAGTATGAGA AAGACCGCAT CACCATCACC GACAGCCTGA ACCGCCGTGA AGTCCTGCAC ACGCAGGGTG AAGGCGGGCT GAAGCGGGTG GTGAAAAAGG AACACGCGGA CGGCAGCGTC ACGCAGAGTC AGTTTGACGC GGTGGGCAGG CTCAGGGCAC AGACGGATGC CGCAGGCAGG ACAACAGAAT ACAGCCCGGA TGTGGTGACG GGCCTCATCA CGCGCATCAC CACGCCGGAT GGCAGGGCAT CGGCGTTTTA CTATAACCAC CACAGCCAGT TAACGTCAGC CACCGGGCCT GACGGGCTGG AAATACGCCG GGAATATGAT GAATGGGGCC GTCTGATTCA GGAAACTGCC CCTGACGGCG ATATCACCCG CTACCGTTAT GATAATCCAC ACAGTGACTT ACCCTGCGCA ACGGAAGATG CCACCGGCAG CCGGAAAACC ATGACGTGGA GCCGTTACGG TCAGTTGCTG AGCTTCACCG ACTGTTCCGG TTATGTAACC CGTTATGACC ATGACCGCTT CGGGCAGATG ACGGCGGTGC ACCGCGAGGA AGGGCTGAGT CAGTACCGCG CATACGACAG CCGTGGACAG TTAATTGCCG TGAAAGACAC GCAGGGCCAT GAAACGCGGT ATGAATACAA CGCCGCCGGT GACCTGACCA CCGTCATTGC CCCGGACGGC AGCAGAAACG GGACACAGTA CGATGCGTGG GGAAAAGCCA TCTGTACCAC GCAGGGCGGT CTGACGCGCA GTATGGAATA CGATGCTGCC GGACGGGTCA TCCGCCTGAC CAGTGAAAAC GGCAGCCACA CCACCTTCCG TTACGATGTA CTCGACCGGC TGATACAGGA AACCGGCTTT GACGGCCGCA CACAGCGTTA TCACCACGAC CTGACCGGCA AACTTATCCG CAGCGAGGAT GAGGGGCTGG TCACCCACTG GCACTATGAC GAAGCAGACC GCCTCACGCA CCGCACCGTG AAGGGTGAAA CCGCAGAGCG CTGGCAGTAT GACGAACGCG GCTGGCTGAC AGACATCAGC CATATCAGCG AAGGGCACCG GGTGACGGTG CATTACGGGT ATGATGAGAA AGGCCGGCTG ACCGGTGAGC GTCAGACGGT GCATCACCCG CAGACGGAAG CACTGCTCTG GCAGCATGAG ACCAGACACG CTTACAACGC GCAGGGGCTG GCGAACCGCT GTATACCGGA CAGCCTGCCC GCCGTGGAAT GGCTGACCTA TGGCAGCGGC TGGCTGGCAG GCATGAAGCT CGGCGACACA CCGCTGGTGG ATTTCACCCG CGACCGCCTG CACCGGAAAA CGCTGCGCAG ATTCGGCCGT TATGAACTCA CCACCGCTTA TACCCCTGCC GGGCAGTTAC AGAGCCAGCA CCTGAACAGC CTGCAGTATG ACCGCGATTA CACCTGGAAC GACAACGGCG AACTCATCCG CATCAGCAGC CCGCGCCAGA CCCGGAGTTA CAGCTACAGC GACTCCGGCA GGCTGACCGG CGTTCACACC ACCGCAGCGA ATCTGGATAT CCGCATCCCG TATGCCACGG ACCCGGCAGG TAACCGCCTG CCCGACCCGG AGCTGCACCC GGACAGCACC CTCAGCATGT GGCCGGATAA CCGTATCGCC CGTGACGCGC ACTATCTTTA CTGGTATGAC CGTCACGGCA GGCTGACAGA GAAAACCGAC CTCATCCCGG AAGGGGTTAT CCGCACGGAT GATGAGCGGA CTCACCGGTA CCATTACGAC AGTCAGCACC GGCTGGTGCA CTACACGCGG ACACAATATG AAGAGCCGCT GGTCGAAAGC CGCTATCTTT ACGACCCGCT GGGCCGCAGG GTGGCAAAAC GGGTGTGGCG ACGTGAACGG GACCTGACGG GCTGGATGTC GCTGTCACGG AAACCGCAAG TGACCTGGTA CGGCTGGGAC GGCGACCGGC TGACCACAAT ACAGAACGAC AGAACCCGCA TCCAGACGAT TTATCAGCCG GGGAGCTTCA CGCCACTCAT CAGGGTTGAA ACCGCCACCG GTGAGCTGGC GAAAACGCAG CGCCGCAGCC TGGCGGATGC GCTTCAGCAG TCCGGCGGCG AAGACGGTGG CAGTGTGGTG TTCCCGCCGG TGCTGGTGCA GATGCTCGAC CGGCTGGAAA GTGAAATCCT GGCTGACCGG GTGAGTGAGG AAAGCCGCCG CTGGCTGGCA TCGTGCGGCC TGACTGTGGC GCAGATGCAA AGCCAGATGG ACCCGGTATA CACGCCGGCG CGAAAAATCC ACCTGTACCA CTGCGACCAT CGCGGCCTGC CGCTGGCCCT TATCAGTAAG GAAGGGGCAA CAGAATGGTG CGCAGAATAC GATGAGTGGG GCAACCTGCT GAATGAAGAG AACCCGCATC AGCTGCAGCA GCTTATCCGC CTGCCGGGGC AGCAGTATGA TGAGGAGTCC GGCCTGTATT ACAACCGCCA CCGCTATTAT GACCCGCTGC ACGGGCGATA TATCACTCAG GATCCGATTG GACTGAAGGG GGGATGGAAT TTTTATCAGT ATCCGTTGAA TCCGGTCATA AATGTAGATC CGCAAGGTTT GGTTGATATA AATTTATACC CCGAAAGTGA TCTTATCCAT TCTGTAGCTG ATGAGATTAA TATCCCAGGC GTTTTCACAA TCGGGGGGCA TGGTACCCCC ACATCTATTG AATCCGCAAC GCGCAGTATC ATGACAGCTA AAGATCTAGC ATATCTAATT AAATTTGATG GGAATTATAA AGATGGGATG ACAGTTTGGT TATTTTCTTG TAATACAGGT AAAGGACAAA ATTCATTTGC TAGCCAATTA GCTAAAGAGT TACATACAAA TGTAATAGGA CCTGACACGC TATGGACGTG GTGGGGGCGA GGAACTAATG GTAAGTTAAA AATGGATACA GTGCTAACAG CACCAACGAA CCTTAATTCA AATAAGGATC TAATGGCTAT AACAACAAAA GACCTTGGTA ATTGGATAAC ATATGGGCCA TCTGGGCACC CCATTTCTAA TATGCAAGGT ACGCCAGAAA AACCCAGTGA TATAAGATAG
|
Protein sequence | MSGKPAARQG DMTQYGGSIV QGSAGVRIGA PTGVACSVCP GGVTSGHPVN PLLGAKVLPG ETDIALPGPL PFILSRTYSS YRTKTPAPVG SLGPGWKMPA DIRLQLRDNT LILSDNGGRS LYFEHLFPGE DGYSRSESLW LVRGGVAKLD EGHRLAALWQ ALPEELRLSP HRYLATNSPQ GPWWLLGWCE RVPEADEVLP APLPPYRVLT GLVDRFGRTQ TFHREAAGEF SGEITGVTDG AGRHFRLVLT TQAQRAEEAR QQAISGGTEP SAFPDTLPGY TEYGRDNGIR LSAVWLTHDP EYPENLPAAP LVRYGWTPRG ELAVVYDRSG KQVRSFTYDD KYRGRMVAHR HTGRPEIRYR YDSDGRVTEQ LNPAGLSYTY QYEKDRITIT DSLNRREVLH TQGEGGLKRV VKKEHADGSV TQSQFDAVGR LRAQTDAAGR TTEYSPDVVT GLITRITTPD GRASAFYYNH HSQLTSATGP DGLEIRREYD EWGRLIQETA PDGDITRYRY DNPHSDLPCA TEDATGSRKT MTWSRYGQLL SFTDCSGYVT RYDHDRFGQM TAVHREEGLS QYRAYDSRGQ LIAVKDTQGH ETRYEYNAAG DLTTVIAPDG SRNGTQYDAW GKAICTTQGG LTRSMEYDAA GRVIRLTSEN GSHTTFRYDV LDRLIQETGF DGRTQRYHHD LTGKLIRSED EGLVTHWHYD EADRLTHRTV KGETAERWQY DERGWLTDIS HISEGHRVTV HYGYDEKGRL TGERQTVHHP QTEALLWQHE TRHAYNAQGL ANRCIPDSLP AVEWLTYGSG WLAGMKLGDT PLVDFTRDRL HRKTLRRFGR YELTTAYTPA GQLQSQHLNS LQYDRDYTWN DNGELIRISS PRQTRSYSYS DSGRLTGVHT TAANLDIRIP YATDPAGNRL PDPELHPDST LSMWPDNRIA RDAHYLYWYD RHGRLTEKTD LIPEGVIRTD DERTHRYHYD SQHRLVHYTR TQYEEPLVES RYLYDPLGRR VAKRVWRRER DLTGWMSLSR KPQVTWYGWD GDRLTTIQND RTRIQTIYQP GSFTPLIRVE TATGELAKTQ RRSLADALQQ SGGEDGGSVV FPPVLVQMLD RLESEILADR VSEESRRWLA SCGLTVAQMQ SQMDPVYTPA RKIHLYHCDH RGLPLALISK EGATEWCAEY DEWGNLLNEE NPHQLQQLIR LPGQQYDEES GLYYNRHRYY DPLHGRYITQ DPIGLKGGWN FYQYPLNPVI NVDPQGLVDI NLYPESDLIH SVADEINIPG VFTIGGHGTP TSIESATRSI MTAKDLAYLI KFDGNYKDGM TVWLFSCNTG KGQNSFASQL AKELHTNVIG PDTLWTWWGR GTNGKLKMDT VLTAPTNLNS NKDLMAITTK DLGNWITYGP SGHPISNMQG TPEKPSDIR
|
| |