Gene EcolC_2415 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2415 
SymbolprfA 
ID6066487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2659513 
End bp2660595 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content53% 
IMG OID641601824 
Productpeptide chain release factor 1 
Protein accessionYP_001725376 
Protein GI170020422 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0216] Protein chain release factor A 
TIGRFAM ID[TIGR00019] peptide chain release factor 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00102463 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000030401 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCCTT CTATCGTTGC CAAACTGGAA GCCCTGCATG AACGCCATGA AGAAGTTCAG 
GCGTTGCTGG GTGACGCGCA AACTATCGCC GACCAGGAAC GTTTTCGCGC ATTATCACGC
GAATATGCGC AGTTAAGTGA TGTTTCGCGC TGTTTTACCG ACTGGCAACA GGTTCAGGAA
GATATCGAAA CCGCACAGAT GATGCTCGAT GATCCTGAAA TGCGTGAGAT GGCGCAGGAT
GAACTGCGCG AAGCTAAAGA AAAAAGCGAG CAACTGGAAC AGCAATTACA GGTTCTGTTA
CTGCCAAAAG ATCCTGATGA CGAACGTAAC GCCTTCCTCG AAGTCCGAGC CGGAACCGGC
GGCGACGAAG CGGCGCTGTT CGCGGGCGAT CTGTTCCGTA TGTACAGCCG TTATGCCGAA
GCCCGCCGCT GGCGGGTAGA AATCATGAGC GCCAGCGAGG GTGAACATGG TGGTTATAAA
GAGATCATCG CCAAAATTAG CGGTGATGGT GTGTATGGTC GTCTGAAATT TGAATCCGGC
GGTCATCGCG TGCAACGTGT TCCTGCTACG GAATCGCAGG GTCGTATTCA TACTTCTGCT
TGTACCGTTG CGGTAATGCC AGAACTGCCT GACGCAGAAC TGCCGGACAT CAACCCAGCA
GATTTACGCA TTGATACTTT CCGCTCGTCA GGGGCGGGTG GTCAGCACGT TAACACCACC
GATTCGGCAA TTCGTATTAC TCACTTGCCG ACCGGGATTG TTGTTGAATG TCAGGACGAA
CGTTCACAAC ATAAAAACAA AGCTAAAGCA CTTTCTGTTC TCGGTGCTCG CATCCACGCT
GCTGAAATGG CAAAACGCCA ACAGGCCGAA GCGTCTACCC GTCGTAACCT GCTGGGGAGT
GGCGATCGCA GCGACCGTAA CCGTACTTAC AACTTCCCGC AGGGGCGCGT TACCGATCAC
CGCATCAACC TGACGCTCTA CCGCCTGGAT GAAGTGATGG AAGGTAAGCT GGATATGCTG
ATTGAACCGA TTATCCAGGA ACATCAGGCC GACCAACTGG CGGCGTTGTC CGAGCAGGAA
TAA
 
Protein sequence
MKPSIVAKLE ALHERHEEVQ ALLGDAQTIA DQERFRALSR EYAQLSDVSR CFTDWQQVQE 
DIETAQMMLD DPEMREMAQD ELREAKEKSE QLEQQLQVLL LPKDPDDERN AFLEVRAGTG
GDEAALFAGD LFRMYSRYAE ARRWRVEIMS ASEGEHGGYK EIIAKISGDG VYGRLKFESG
GHRVQRVPAT ESQGRIHTSA CTVAVMPELP DAELPDINPA DLRIDTFRSS GAGGQHVNTT
DSAIRITHLP TGIVVECQDE RSQHKNKAKA LSVLGARIHA AEMAKRQQAE ASTRRNLLGS
GDRSDRNRTY NFPQGRVTDH RINLTLYRLD EVMEGKLDML IEPIIQEHQA DQLAALSEQE