Gene EcSMS35_4922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4922 
SymbolprfC 
ID6147399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp5037780 
End bp5039369 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content54% 
IMG OID641619725 
Productpeptide chain release factor 3 
Protein accessionYP_001746829 
Protein GI170683721 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG4108] Peptide chain release factor RF-3 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00503] peptide chain release factor 3 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.55632 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTTGT CTCCTTATTT GCAAGAGGTG GCGAAGCGCC GCACTTTTGC CATTATTTCT 
CACCCGGACG CCGGTAAAAC TACCATCACC GAGAAGGTGC TGCTGTTCGG ACAGGCCATT
CAGACCGCCG GTACAGTAAA AGGCCGTGGT TCCAACCAGC ACGCTAAGTC GGACTGGATG
GAGATGGAAA AGCAGCGTGG GATCTCCATT ACTACTTCTG TGATGCAGTT TCCGTATCAC
GATTGCCTGG TTAACCTGCT CGACACCCCG GGGCACGAAG ACTTCTCGGA AGATACCTAT
CGTACCCTGA CGGCGGTGGA CTGCTGCCTG ATGGTTATCG ACGCTGCAAA AGGTGTTGAA
GATCGTACCC GTAAGCTGAT GGAAGTTACC CGTCTGCGCG ATACGCCGAT CCTCACCTTT
ATGAACAAAC TTGACCGTGA TATCCGCGAC CCGATGGAAC TGCTCGATGA AGTTGAGAAC
GAGCTGAAAA TCGGCTGCGC ACCGATCACC TGGCCGATTG GCTGCGGCAA GCTGTTTAAA
GGCGTTTACC ACCTTTATAA AGATGAAACC TATCTCTATC AGAGCGGTAA AGGCCACACC
ATTCAGGAAG TCCGCATTGT TAAAGGGCTG AATAACCCGG ATCTCGACGC TGCGGTTGGT
GAAGATCTGG CACAGCAACT GCGTGACGAA CTGGAACTGG TGAAAGGCGC GTCTAACGAG
TTCGACAAAG AATTGTTCCT TGCGGGCGAA ATTACTCCAG TGTTCTTTGG TACTGCGCTG
GGTAACTTCG GCGTTGATCA TATGCTGGAT GGCCTGGTGG AGTGGGCCCC AGCGCCGATG
CCGCGTCAGA CTGATACCCG TACCGTAGAG GCGAGCGAAG ATAAATTTAC CGGCTTCGTA
TTTAAAATTC AGGCCAACAT GGACCCGAAA CACCGCGACC GCGTGGCGTT TATGCGTGTG
GTGTCCGGTA AATATGAAAA AGGCATGAAG CTGCGCCAGG TGCGTACTGC GAAAGATGTG
GTGATCTCCG ACGCGCTGAC CTTTATGGCG GGTGACCGTT CGCACGTTGA AGAAGCGTAT
CCTGGCGATA TCCTTGGTCT GCACAACCAC GGCACCATTC AGATCGGCGA CACCTTTACC
CAGGGTGAGA TGATGAAGTT CACCGGTATT CCGAACTTCG CGCCAGAACT GTTCCGTCGT
ATCCGCCTGA AAGATCCGCT GAAGCAAAAA CAGCTGCTCA AAGGGCTGGT ACAGCTTTCC
GAAGAGGGCG CGGTGCAGGT GTTCCGTCCG ATCTCCAACA ACGACCTGAT CGTTGGTGCA
GTTGGTGTGC TGCAGTTTGA TGTGGTGGTA GCGCGCCTGA AGAGCGAATA CAACGTTGAA
GCAGTGTATG AGTCAGTCAA CGTTGCCACT GCCCGCTGGG TAGAATGTGC GGACGCGAAG
AAATTCGAAG AGTTCAAGCG TAAGAACGAA AGCCAACTGG CGCTTGATGG CGGCGATAAC
CTCGCTTACA TCGCTACCAG CATGGTCAAC CTGCGCCTGG CGCAGGAACG TTATCCGGAC
GTTCAGTTCC ACCAGACCCG CGAGCATTAA
 
Protein sequence
MTLSPYLQEV AKRRTFAIIS HPDAGKTTIT EKVLLFGQAI QTAGTVKGRG SNQHAKSDWM 
EMEKQRGISI TTSVMQFPYH DCLVNLLDTP GHEDFSEDTY RTLTAVDCCL MVIDAAKGVE
DRTRKLMEVT RLRDTPILTF MNKLDRDIRD PMELLDEVEN ELKIGCAPIT WPIGCGKLFK
GVYHLYKDET YLYQSGKGHT IQEVRIVKGL NNPDLDAAVG EDLAQQLRDE LELVKGASNE
FDKELFLAGE ITPVFFGTAL GNFGVDHMLD GLVEWAPAPM PRQTDTRTVE ASEDKFTGFV
FKIQANMDPK HRDRVAFMRV VSGKYEKGMK LRQVRTAKDV VISDALTFMA GDRSHVEEAY
PGDILGLHNH GTIQIGDTFT QGEMMKFTGI PNFAPELFRR IRLKDPLKQK QLLKGLVQLS
EEGAVQVFRP ISNNDLIVGA VGVLQFDVVV ARLKSEYNVE AVYESVNVAT ARWVECADAK
KFEEFKRKNE SQLALDGGDN LAYIATSMVN LRLAQERYPD VQFHQTREH