Gene ECH74115_5215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5215 
Symbolrho 
ID6967884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4862919 
End bp4864178 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content50% 
IMG OID643388880 
Producttranscription termination factor Rho 
Protein accessionYP_002273300 
Protein GI209400658 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTTA CCGAATTAAA GAATACGCCG GTTTCTGAGC TGATCACTCT CGGCGAAAAT 
ATGGGGCTGG AAAACCTGGC TCGTATGCGT AAGCAGGACA TTATTTTTGC TATCCTGAAG
CAGCACGCAA AGAGTGGCGA AGATATCTTT GGTGATGGCG TACTGGAGAT ATTGCAGGAT
GGATTTGGTT TCCTCCGTTC CGCAGACAGC TCCTACCTCG CCGGTCCTGA TGACATCTAC
GTTTCCCCTA GCCAAATCCG CCGTTTCAAC CTCCGCACTG GTGATACCAT CTCTGGTAAG
ATTCGCCCGC CGAAAGAAGG TGAACGCTAT TTTGCGCTGC TGAAAGTTAA CGAAGTTAAC
TTCGACAAAC CTGAAAACGC CCGCAACAAA ATCCTCTTTG AGAACTTAAC CCCGCTGCAC
GCAAACTCTC GTCTGCGTAT GGAACGTGGT AACGGTTCTA CTGAAGATTT AACCGCTCGC
GTACTGGATC TGGCATCACC TATCGGTCGT GGTCAGCGTG GTCTGATTGT GGCACCGCCG
AAAGCCGGTA AAACCATGCT GCTGCAGAAC ATTGCTCAGA GCATTGCTTA CAACCACCCG
GATTGTGTGC TGATGGTTCT GCTGATCGAC GAACGTCCGG AAGAAGTAAC CGAGATGCAG
CGACTGGTAA AAGGTGAAGT TGTTGCTTCT ACCTTTGACG AACCCGCATC TCGCCACGTT
CAGGTTGCGG AAATGGTGAT CGAGAAGGCG AAACGCCTGG TTGAGCACAA GAAAGACGTT
ATCATTCTGC TCGACTCCAT CACTCGTCTG GCGCGCGCTT ACAACACCGT TGTTCCGGCG
TCAGGTAAAG TGTTGACCGG TGGTGTGGAT GCCAACGCCC TGCATCGTCC GAAACGCTTC
TTCGGTGCGG CGCGTAACGT GGAAGAGGGC GGCAGCCTGA CCATTATCGC GACGGCGCTT
ATCGATACCG GTTCTAAAAT GGACGAAGTT ATTTACGAAG AGTTTAAAGG TACAGGCAAC
ATGGAACTGC ACCTCTCTCG TAAGATCGCT GAAAAACGCG TCTTCCCGGC TATCGACTAC
AACCGTTCCG GTACCCGTAA AGAAGAGCTG CTCACGACTC AGGAAGAACT GCAGAAAATG
TGGATCCTGC GCAAAATCAT TCACCCGATG GGCGAAATCG ATGCAATGGA ATTCCTCATT
AATAAACTGG CAATGACCAA GACCAATGAC GATTTCTTCG AAATGATGAA ACGCTCATAA
 
Protein sequence
MNLTELKNTP VSELITLGEN MGLENLARMR KQDIIFAILK QHAKSGEDIF GDGVLEILQD 
GFGFLRSADS SYLAGPDDIY VSPSQIRRFN LRTGDTISGK IRPPKEGERY FALLKVNEVN
FDKPENARNK ILFENLTPLH ANSRLRMERG NGSTEDLTAR VLDLASPIGR GQRGLIVAPP
KAGKTMLLQN IAQSIAYNHP DCVLMVLLID ERPEEVTEMQ RLVKGEVVAS TFDEPASRHV
QVAEMVIEKA KRLVEHKKDV IILLDSITRL ARAYNTVVPA SGKVLTGGVD ANALHRPKRF
FGAARNVEEG GSLTIIATAL IDTGSKMDEV IYEEFKGTGN MELHLSRKIA EKRVFPAIDY
NRSGTRKEEL LTTQEELQKM WILRKIIHPM GEIDAMEFLI NKLAMTKTND DFFEMMKRS