Gene EcHS_A1316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1316 
SymbolprfA 
ID5593790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1313451 
End bp1314533 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content53% 
IMG OID640920473 
Productpeptide chain release factor 1 
Protein accessionYP_001458034 
Protein GI157160716 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0216] Protein chain release factor A 
TIGRFAM ID[TIGR00019] peptide chain release factor 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000000610156 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCTT CTATCGTTGC CAAACTGGAA GCCCTGCATG AACGCCATGA AGAAGTTCAG 
GCGTTGCTGG GTGACGCGCA AACTATCGCC GACCAGGAAC GTTTTCGCGC ATTATCACGC
GAATATGCGC AGTTAAGTGA TGTTTCGCGC TGTTTTACCG ACTGGCAACA GGTTCAGGAA
GATATCGAAA CCGCACAGAT GATGCTCGAT GATCCTGAAA TGCGTGAGAT GGCGCAGGAT
GAACTGCGCG AAGCTAAAGA AAAAAGCGAG CAACTGGAAC AGCAATTACA GGTTCTGTTA
CTGCCAAAAG ATCCTGATGA CGAACGTAAC GCCTTCCTCG AAGTCCGAGC CGGAACCGGC
GGCGACGAAG CGGCGCTGTT CGCGGGCGAT CTGTTCCGTA TGTACAGCCG TTATGCCGAA
GCCCGCCGCT GGCGGGTAGA AATCATGAGC GCCAGCGAGG GTGAACATGG TGGTTATAAA
GAGATCATCG CCAAAATTAG CGGTGATGGT GTGTATGGTC GTCTGAAATT TGAATCCGGC
GGTCATCGCG TGCAACGTGT TCCTGCTACG GAATCGCAGG GTCGTATTCA TACTTCTGCT
TGTACCGTTG CGGTAATGCC AGAACTGCCT GACGCAGAAC TGCCGGACAT CAACCCAGCA
GATTTACGCA TTGATACTTT CCGCTCGTCA GGGGCGGGTG GTCAGCACGT TAACACCACC
GATTCGGCAA TTCGTATTAC TCACTTGCCG ACCGGGATTG TTGTTGAATG TCAGGACGAA
CGTTCACAAC ATAAAAACAA AGCTAAAGCA CTTTCTGTTC TCGGTGCTCG CATCCACGCT
GCTGAAATGG CAAAACGCCA ACAGGCCGAA GCGTCTACCC GTCGTAACCT GCTGGGGAGT
GGCGATCGCA GCGACCGTAA CCGTACTTAC AACTTCCCGC AGGGGCGCGT TACCGATCAC
CGCATCAACC TGACGCTCTA CCGCCTGGAT GAAGTGATGG AAGGTAAGCT GGATATGCTG
ATTGAACCGA TTATCCAGGA ACATCAGGCC GACCAACTGG CGGCGTTGTC CGAGCAGGAA
TAA
 
Protein sequence
MKPSIVAKLE ALHERHEEVQ ALLGDAQTIA DQERFRALSR EYAQLSDVSR CFTDWQQVQE 
DIETAQMMLD DPEMREMAQD ELREAKEKSE QLEQQLQVLL LPKDPDDERN AFLEVRAGTG
GDEAALFAGD LFRMYSRYAE ARRWRVEIMS ASEGEHGGYK EIIAKISGDG VYGRLKFESG
GHRVQRVPAT ESQGRIHTSA CTVAVMPELP DAELPDINPA DLRIDTFRSS GAGGQHVNTT
DSAIRITHLP TGIVVECQDE RSQHKNKAKA LSVLGARIHA AEMAKRQQAE ASTRRNLLGS
GDRSDRNRTY NFPQGRVTDH RINLTLYRLD EVMEGKLDML IEPIIQEHQA DQLAALSEQE