Gene ECH74115_1692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1692 
SymbolprfA 
ID6971675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1632390 
End bp1633472 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content53% 
IMG OID643385651 
Productpeptide chain release factor 1 
Protein accessionYP_002270145 
Protein GI209398404 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0216] Protein chain release factor A 
TIGRFAM ID[TIGR00019] peptide chain release factor 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0012914 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCTT CTATCGTTGC CAAACTGGAA GCCCTGCATG AACGCCATGA AGAAGTTCAG 
GCGTTGCTGG GTGACGCGCA AACTATCGCC GACCAGGAAC GTTTTCGCGC ATTATCACGC
GAATATGCGC AGTTAAGTGA TGTTTCGCGC TGTTTTACCG ACTGGCAACA GGTTCAGGAA
GATATCGAAA CCGCACAGAT GATGCTCGAT GATCCTGAAA TGCGTGAGAT GGCGCAGGAT
GAACTGCGCG AAGCTAAAGA AAAAAGCGAG CAACTGGAAC AGCAATTACA GGTTCTGTTA
CTGCCAAAAG ATCCTGATGA CGAACGTAAC GCCTTCCTCG AAGTCCGAGC CGGAACCGGC
GGCGACGAAG CAGCGCTGTT CGCAGGCGAT CTGTTCCGTA TGTACAGCCG TTACGCCGAA
GCCCGCCGCT GGCGGGTAGA AATCATGAGC GCCAGCGAGG GTGAACATGG TGGTTATAAA
GAGATCATCG CCAAAATTAG CGGTGATGGT GTGTATGGTC GTCTGAAATT TGAATCTGGC
GGTCATCGCG TGCAGCGTGT TCCTGCTACG GAATCGCAGG GTCGTATTCA TACTTCTGCT
TGTACCGTTG CGGTAATGCC AGAACTGCCT GACGCAGAAC TGCCGGACAT CAACCCAGCA
GATTTACGCA TTGATACTTT CCGCTCGTCA GGGGCGGGTG GTCAGCACGT TAACACCACC
GATTCGGCAA TTCGTATTAC TCACTTGCCG ACCGGGATTG TTGTTGAATG TCAGGACGAA
CGTTCACAAC ATAAAAACAA AGCTAAAGCA CTTTCTGTAC TCGGTGCTCG CATCCACGCT
GCTGAAATGG CAAAACGGCA ACAGGCCGAA GCGTCTACCC GTCGTAACCT GCTGGGGAGT
GGCGATCGCA GCGACCGTAA CCGTACTTAC AACTTCCCGC AGGGGCGCGT TACCGATCAC
CGCATCAACC TGACGCTCTA CCGCCTGGAT GAAGTGATGG AAGGTAAGCT GGATATGCTG
ATTGAACCGA TTATCCAGGA ACATCAGGCC GACCAACTGG CGGCGTTGTC CGAGCAGGAA
TAA
 
Protein sequence
MKPSIVAKLE ALHERHEEVQ ALLGDAQTIA DQERFRALSR EYAQLSDVSR CFTDWQQVQE 
DIETAQMMLD DPEMREMAQD ELREAKEKSE QLEQQLQVLL LPKDPDDERN AFLEVRAGTG
GDEAALFAGD LFRMYSRYAE ARRWRVEIMS ASEGEHGGYK EIIAKISGDG VYGRLKFESG
GHRVQRVPAT ESQGRIHTSA CTVAVMPELP DAELPDINPA DLRIDTFRSS GAGGQHVNTT
DSAIRITHLP TGIVVECQDE RSQHKNKAKA LSVLGARIHA AEMAKRQQAE ASTRRNLLGS
GDRSDRNRTY NFPQGRVTDH RINLTLYRLD EVMEGKLDML IEPIIQEHQA DQLAALSEQE