Gene EcolC_3682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3682 
SymbolprfC 
ID6067181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4031972 
End bp4033561 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content54% 
IMG OID641603097 
Productpeptide chain release factor 3 
Protein accessionYP_001726620 
Protein GI170021666 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG4108] Peptide chain release factor RF-3 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00503] peptide chain release factor 3 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.297475 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00075439 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGTTGT CTCCTTATTT GCAAGAGGTG GCGAAGCGCC GCACTTTTGC CATTATTTCT 
CACCCGGACG CCGGTAAGAC TACCATCACC GAGAAGGTGC TGCTGTTCGG ACAGGCCATT
CAGACCGCCG GTACAGTAAA AGGCCGTGGC TCTAACCAGC ACGCTAAGTC GGACTGGATG
GAGATGGAAA AGCAGCGTGG GATCTCCATT ACTACGTCTG TGATGCAGTT TCCGTATCAC
GATTGCCTGG TTAACCTGCT CGACACCCCG GGGCACGAAG ACTTCTCGGA AGATACCTAT
CGTACCCTGA CGGCGGTGGA CTGCTGTCTG ATGGTTATCG ACGCCGCAAA AGGTGTTGAA
GATCGTACCC GCAAGCTGAT GGAAGTTACC CGTCTGCGCG ACACGCCGAT CCTCACCTTT
ATGAACAAAC TTGACCGTGA TATCCGCGAC CCGATGGAAC TGCTCGATGA AGTTGAGAAC
GAGCTGAAAA TCGGCTGCGC ACCGATCACC TGGCCGATCG GCTGCGGCAA GCTGTTTAAA
GGCGTTTACC ACCTTTATAA AGACGAAACC TATCTCTATC AGAGCGGTAA AGGCCACACC
ATTCAGGAAG TCCGCATTGT TAAAGGGCTG AATAACCCGG ATCTCGATGC AGCGGTCGGT
GAAGATCTGG CACAGCAGCT GCGTGACGAA CTGGAACTGG TGAAAGGCGC GTCTAATGAG
TTCGACAAAG AGCTGTTCCT TGCGGGCGAA ATCACTCCGG TATTCTTCGG TACTGCGCTG
GGTAACTTCG GTGTCGATCA TATGCTGGAT GGCCTGGTGG AGTGGGCTCC TGCGCCGATG
CCGCGTCAGA CTGATACCCG TACCGTAGAG GCGAGCGAAG ACAAATTTAC CGGCTTCGTA
TTTAAAATTC AGGCCAACAT GGACCCGAAA CACCGCGACC GCGTGGCGTT TATGCGCGTG
GTATCCGGTA AATATGAAAA AGGCATGAAG CTGCGCCAGG TGCGCACTGC GAAAGATGTG
GTGATCTCCG ACGCGCTGAC CTTTATGGCA GGTGACCGTT CGCACGTTGA AGAAGCATAT
CCGGGCGATA TCCTCGGCCT GCACAACCAC GGCACCATTC AGATTGGCGA CACCTTTACC
CAGGGTGAGA TGATGAAGTT CACCGGTATT CCGAACTTCG CGCCAGAACT GTTCCGTCGC
ATCCGCCTGA AAGATCCGCT GAAGCAAAAA CAGCTGCTCA AAGGGCTGGT ACAGCTTTCC
GAAGAGGGCG CGGTGCAGGT GTTCCGTCCA ATCTCCAACA ACGATCTGAT CGTTGGTGCG
GTTGGTGTGC TGCAGTTTGA TGTGGTGGTA TCGCGCCTGA AGAGCGAATA CAACGTTGAA
GCAGTATATG AATCAGTCAA CGTTGCCACC GCCCGCTGGG TAGAATGTGC AGACGCGAAG
AAATTCGAAG AGTTCAAGCG TAAGAACGAA AGCCAACTGG CGCTTGATGG CGGCGATAAC
CTCGCTTACA TCGCTACCAG CATGGTCAAC CTGCGCCTGG CGCAGGAACG TTATCCGGAC
GTTCAGTTCC ACCAGACCCG CGAGCATTAA
 
Protein sequence
MTLSPYLQEV AKRRTFAIIS HPDAGKTTIT EKVLLFGQAI QTAGTVKGRG SNQHAKSDWM 
EMEKQRGISI TTSVMQFPYH DCLVNLLDTP GHEDFSEDTY RTLTAVDCCL MVIDAAKGVE
DRTRKLMEVT RLRDTPILTF MNKLDRDIRD PMELLDEVEN ELKIGCAPIT WPIGCGKLFK
GVYHLYKDET YLYQSGKGHT IQEVRIVKGL NNPDLDAAVG EDLAQQLRDE LELVKGASNE
FDKELFLAGE ITPVFFGTAL GNFGVDHMLD GLVEWAPAPM PRQTDTRTVE ASEDKFTGFV
FKIQANMDPK HRDRVAFMRV VSGKYEKGMK LRQVRTAKDV VISDALTFMA GDRSHVEEAY
PGDILGLHNH GTIQIGDTFT QGEMMKFTGI PNFAPELFRR IRLKDPLKQK QLLKGLVQLS
EEGAVQVFRP ISNNDLIVGA VGVLQFDVVV SRLKSEYNVE AVYESVNVAT ARWVECADAK
KFEEFKRKNE SQLALDGGDN LAYIATSMVN LRLAQERYPD VQFHQTREH