Gene EcHS_A4609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4609 
SymbolprfC 
ID5594921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4612729 
End bp4614318 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content54% 
IMG OID640923703 
Productpeptide chain release factor 3 
Protein accessionYP_001461140 
Protein GI157163822 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG4108] Peptide chain release factor RF-3 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00503] peptide chain release factor 3 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.00121865 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTTGT CTCCTTATTT GCAAGAGGTG GCGAAGCGCC GCACTTTTGC CATTATTTCT 
CACCCGGACG CCGGTAAGAC TACCATCACC GAGAAGGTGC TGCTGTTCGG ACAGGCCATT
CAGACCGCCG GTACAGTAAA AGGCCGTGGT TCCAACCAGC ACGCTAAGTC GGACTGGATG
GAGATGGAAA AGCAGCGTGG GATCTCCATT ACTACGTCTG TGATGCAGTT TCCGTATCAC
GATTGCCTGG TTAACCTGCT CGACACCCCG GGGCACGAAG ACTTCTCGGA AGATACCTAT
CGTACCCTGA CGGCGGTGGA CTGCTGCCTG ATGGTTATCG ACGCTGCAAA AGGTGTTGAA
GATCGTACCC GTAAGCTGAT GGAAGTTACC CGTCTGCGCG ACACGCCGAT CCTCACCTTT
ATGAACAAAC TTGACCGTGA TATCCGCGAC CCGATGGAAC TGCTCGATGA AGTTGAGAAC
GAGCTGAAAA TCGGCTGCGC ACCGATCACC TGGCCGATTG GCTGCGGCAA GCTGTTTAAA
GGCGTTTACC ACCTTTATAA AGACGAAACC TATCTCTATC AGAGCGGTAA AGGCCACACG
ATTCAGGAAG TCCGTATTGT TAAAGGGCTG AATAACCCGG ATCTCGACGC TGCGGTTGGT
GAAGATCTGG CACAGCAGCT ACGTGACGAA CTGGAACTGG TGAAAGGCGC GTCTAACGAG
TTCGACAAAG AATTGTTCCT TGCGGGCGAA ATCACTCCGG TATTCTTCGG TACTGCGCTG
GGTAACTTCG GCGTTGATCA TATGCTGGAT GGCCTGGTGG AGTGGGCTCC TGCGCCGATG
CCGCGTCAGA CTGATACCCG TACCGTAGAG GCGAGCGAAG ACAAATTTAC CGGCTTCGTA
TTTAAAATTC AGGCCAACAT GGACCCGAAA CACCGCGACC GCGTGGCGTT TATGCGCGTA
GTATCCGGTA AATATGAAAA AGGCATGAAG CTGCGCCAGG TGCGTACTGC GAAAGATGTG
GTTATCTCCG ACGCGCTGAC CTTTATGGCG GGCGACCGTT CGCACGTTGA AGAAGCGTAT
CCGGGCGATA TCCTCGGCCT GCACAACCAC GGCACTATTC AGATCGGCGA CACCTTTACC
CAGGGTGAGA TGATGAAGTT CACCGGTATT CCGAACTTCG CACCAGAACT GTTCCGTCGT
ATCCGCCTGA AAGATCCGCT GAAGCAAAAA CAGCTGCTCA AAGGGCTGGT ACAGCTTTCC
GAAGAGGGCG CGGTGCAGGT GTTCCGTCCG ATCTCCAACA ACGATTTGAT CGTTGGTGCT
GTTGGTGTGC TGCAGTTTGA TGTGGTGGTA TCGCGCCTGA AGAGCGAATA CAACGTTGAA
GCAGTATATG AATCAGTCAA CGTTGCCACT GCCCGCTGGG TAGAATGTGC GGACGCGAAG
AAATTCGAAG AGTTCAAGCG TAAGAACGAA AGCCAACTGG CGCTTGATGG CGGCGATAAC
CTCGCTTACA TCGCTACCAG CATGGTCAAC CTGCGCCTGG CGCAGGAACG TTATCCGGAC
GTTCAGTTCC ACCAGACCCG CGAGCATTAA
 
Protein sequence
MTLSPYLQEV AKRRTFAIIS HPDAGKTTIT EKVLLFGQAI QTAGTVKGRG SNQHAKSDWM 
EMEKQRGISI TTSVMQFPYH DCLVNLLDTP GHEDFSEDTY RTLTAVDCCL MVIDAAKGVE
DRTRKLMEVT RLRDTPILTF MNKLDRDIRD PMELLDEVEN ELKIGCAPIT WPIGCGKLFK
GVYHLYKDET YLYQSGKGHT IQEVRIVKGL NNPDLDAAVG EDLAQQLRDE LELVKGASNE
FDKELFLAGE ITPVFFGTAL GNFGVDHMLD GLVEWAPAPM PRQTDTRTVE ASEDKFTGFV
FKIQANMDPK HRDRVAFMRV VSGKYEKGMK LRQVRTAKDV VISDALTFMA GDRSHVEEAY
PGDILGLHNH GTIQIGDTFT QGEMMKFTGI PNFAPELFRR IRLKDPLKQK QLLKGLVQLS
EEGAVQVFRP ISNNDLIVGA VGVLQFDVVV SRLKSEYNVE AVYESVNVAT ARWVECADAK
KFEEFKRKNE SQLALDGGDN LAYIATSMVN LRLAQERYPD VQFHQTREH