Gene EcSMS35_1931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1931 
SymbolprfA 
ID6147402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1951001 
End bp1952083 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content53% 
IMG OID641616807 
Productpeptide chain release factor 1 
Protein accessionYP_001743983 
Protein GI170682630 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0216] Protein chain release factor A 
TIGRFAM ID[TIGR00019] peptide chain release factor 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000608431 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.0690317 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCTT CTATCGTTGC CAAACTGGAA GCCCTGCATG AACGCCATGA AGAAGTTCAG 
GCGTTGCTGG GTGACGCGCA AACTATCGCC GACCAGGAAC GTTTTCGCGC ATTATCACGC
GAATATGCGC AGTTAAGTGA TGTTTCGCGC TGTTTTACCG ACTGGCAACA GGTTCAGGAA
GATATCGAAA CCGCACAGAT GATGCTCGAC GATCCTGAAA TGCGTGAGAT GGCGCAGGAT
GAACTGCGCG AAGCTAAAGA AAAAAGCGAG CAACTGGAAC AGCAATTACA GGTTCTGTTA
CTACCAAAAG ATCCTGATGA CGAACGTAAC GCCTTCCTCG AAGTCCGTGC CGGAACCGGC
GGCGACGAAG CGGCGCTGTT CGCTGGCGAT CTGTTCCGTA TGTACAGCCG TTACGCCGAA
GCCCGCCGCT GGCGTGTAGA AATCATGAGC GCCAGCGAGG GTGAACATGG TGGTTATAAA
GAGATCATCG CCAAAATTAG CGGTGATGGT GTATATGGTC GTTTGAAATT CGAATCTGGC
GGTCATCGCG TGCAGCGTGT TCCTGCTACG GAATCGCAGG GTCGTATTCA TACTTCTGCT
TGTACCGTTG CGGTAATGCC AGAACTGCCT GACGCAGAAC TGCCGGACAT CAACCCAGCA
GATTTGCGCA TTGATACTTT CCGCTCGTCA GGGGCGGGGG GGCAGCACGT TAACACCACC
GATTCGGCAA TTCGTATTAC TCACTTGCCG ACCGGAATTG TTGTTGAATG TCAGGACGAA
CGTTCACAAC ATAAAAACAA AGCTAAAGCA CTTTCTGTAC TCGGTGCTCG TATCCACGCT
GCTGAAATGG CAAAACGGCA ACAGGCCGAA GCGTCTACCC GTCGTAACCT GCTGGGGAGT
GGCGATCGCA GCGACCGTAA CCGTACTTAC AACTTCCCGC AGGGGCGCGT TACCGATCAC
CGCATCAACC TGACGCTCTA CCGCCTGGAT GAAGTGATGG AAGGTAAGCT GGATATGCTA
ATTGAACCGA TTATCCAGGA ACATCAGGCC GACCAACTGG CGGCGTTGTC CGAGCAGGAA
TAA
 
Protein sequence
MKPSIVAKLE ALHERHEEVQ ALLGDAQTIA DQERFRALSR EYAQLSDVSR CFTDWQQVQE 
DIETAQMMLD DPEMREMAQD ELREAKEKSE QLEQQLQVLL LPKDPDDERN AFLEVRAGTG
GDEAALFAGD LFRMYSRYAE ARRWRVEIMS ASEGEHGGYK EIIAKISGDG VYGRLKFESG
GHRVQRVPAT ESQGRIHTSA CTVAVMPELP DAELPDINPA DLRIDTFRSS GAGGQHVNTT
DSAIRITHLP TGIVVECQDE RSQHKNKAKA LSVLGARIHA AEMAKRQQAE ASTRRNLLGS
GDRSDRNRTY NFPQGRVTDH RINLTLYRLD EVMEGKLDML IEPIIQEHQA DQLAALSEQE