Gene ECH74115_0258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0258 
Symbol 
ID6967418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp276352 
End bp278112 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content57% 
IMG OID643384328 
ProductRHS Repeat family protein 
Protein accessionYP_002268844 
Protein GI209400909 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCTGGC TGCACCGTGA GACGGCCCGC AGCTTCGGCG GGGCGGGCAG TACAGCGGGG 
TACGAACAGG TCACGGCGTA CACCCTCACA GGGCAGCTAC AGAGCAGGCA CCTGAACCTC
CCGCAGCTTG ACTGTGACTA CGACTGGAAC GACAACGGAC AGCTAATCCG CATCAGCGGC
CCGCAGGAGA GCCGGGAGTA CCGTTACAGC GACACGGGAA GGCTGACGGG CGTCCACACC
ACGGCAGCGA ACCTGGATAT CGATATCCCG TATGCAACGG ACCCGGCAGG AAACCGGCTG
CCGGACCCGG AACTGCATCC GGACAGCACG CTCACGGCGT GGCCGGATAA CCGCATCGCG
GAAGATGCGC ACTATGTCTA CCGCTACGAT GAATACGGCA GGCTGACGGA GAAGACGGAC
CGCATTCCGG AAGGGGTTAT CCGGATGCAC GACGAGCGGA CCCACCACTA CCATTACGAC
AACCAGCACC GCCTGGTGTT CTACACGCGG ATACAATACG GCGAGCCGCT GGTCGAGAGC
CGCTACCTCT ATGACCCGCT GGGCCGCCGG ACGGGGAAAC GGGTGTGGCG GCGGGAGCGT
GACCTGACGG GGTGGATGTC GCTGTCGCGT AAACCGGAGG TGACCTGGTA CGGGTGGGAC
GGCGACAGGC TGACGACGGT ACAGACCGGC ACCACACGTA TCCAGACGGT ATACCGGCCG
GGGAGCTTCA CACCGCTCAT CCGCATCGAA ACGGAGAACG GCGAGCGGGA GAAAGCGCAG
CGCCGCAGCC TGGCGGAGAA ACTCCAGCAG GAAGGGAGTG AGGACGGTCA CGGTGTGGTG
TTCCCGGCAG AACTGGTGAG GATGCTGGAC AGGCTGGAGG AAGAAATCCG GGCAGACCGC
GTGAGCAGTG AAAGCCGGGC GTGGCTTGCG CAGTGCGGAC TGACGGTGGA GCAACTGGAA
AAACAGGTGG AGCCGGAATA CACGCCGGCG CGCACGCTGC ATCTGTACCA CTGTGACCAC
CGGGGACTGC CGCTGGCGCT TATCAGCGAA GACGGCAATA CGGCGTGGAG CGCGGAATAT
GATGAATGGG GCAACCAGCT TAATGAGGAG AACCCGCATC ACCTGCACCA GCCGTACCGT
CTGCCGGGCC AGCAGCATGA TGAGGAGTCG GGGCTGTACT ATAACCGTCA CCGGTACTAC
GATCCGTTGC AGGGGCGGTA TATCACCCCG GACCCGATTG GGTTGAGAGG TGGATGGAAT
ATGTATCAGT ATCCGTTGAA TCCCATACAA GTGATAGACC CAATGGGGTT AGATGCGATT
GAGAATATGA CATCAGGTGG ACTAATTTAT GCCGTATCTG GTGTACCTGG ATTGATTGTT
GCAAACAGCA TTACTAACAG TGCTTACCAG TTCGGTTATG ATATGGATGC TATTGTTGGC
GGAGCTCATA ATGGGGCCGC CGATGCAATG AGATATTGTT ACTTGATGTG TCGAATGACT
AAGACATTTG GATCAACAAT AGCTGACGTG ATAGGTAAAA ATCATGAGGC GGCTGGGGAT
AGACAAGGTC AGCCAGCTAA AGAAAGAATC ATGGATCTTA AAAATAACAC TGTCGGTATT
GCTTGTGGCG ATTTTTCTGC CAAATGTAGC GATGCATGTA TTGAAAAATA TAACATTGGG
CAACTCTTCG GGTTAGATGG TATAAAAGCA GATAATCCAA TAAAAGCAAA GCAAGGGAGT
TCAGATGCTT CAAATTATTA G
 
Protein sequence
MPWLHRETAR SFGGAGSTAG YEQVTAYTLT GQLQSRHLNL PQLDCDYDWN DNGQLIRISG 
PQESREYRYS DTGRLTGVHT TAANLDIDIP YATDPAGNRL PDPELHPDST LTAWPDNRIA
EDAHYVYRYD EYGRLTEKTD RIPEGVIRMH DERTHHYHYD NQHRLVFYTR IQYGEPLVES
RYLYDPLGRR TGKRVWRRER DLTGWMSLSR KPEVTWYGWD GDRLTTVQTG TTRIQTVYRP
GSFTPLIRIE TENGEREKAQ RRSLAEKLQQ EGSEDGHGVV FPAELVRMLD RLEEEIRADR
VSSESRAWLA QCGLTVEQLE KQVEPEYTPA RTLHLYHCDH RGLPLALISE DGNTAWSAEY
DEWGNQLNEE NPHHLHQPYR LPGQQHDEES GLYYNRHRYY DPLQGRYITP DPIGLRGGWN
MYQYPLNPIQ VIDPMGLDAI ENMTSGGLIY AVSGVPGLIV ANSITNSAYQ FGYDMDAIVG
GAHNGAADAM RYCYLMCRMT KTFGSTIADV IGKNHEAAGD RQGQPAKERI MDLKNNTVGI
ACGDFSAKCS DACIEKYNIG QLFGLDGIKA DNPIKAKQGS SDASNY