Gene SeHA_C1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1971 
SymbolprfA 
ID6490858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1921139 
End bp1922221 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content58% 
IMG OID642742176 
Productpeptide chain release factor 1 
Protein accessionYP_002045819 
Protein GI194448370 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0216] Protein chain release factor A 
TIGRFAM ID[TIGR00019] peptide chain release factor 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00228451 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCTT CTATCGTTGC CAAACTGGAA GCCCTGCACG AACGCCATGA GGAAGTTCAG 
GCGTTGCTGG GCGATGCGGG AATTATCGCC GACCAGGACC GCTTTCGCGC ATTGTCGCGC
GAATATGCGC AATTAAGCGA CGTTTCTCGC TGTTTTACGG ACTGGCAACA GGTTCAGGAC
GATATCGAGA CGGCTCAGAT GATGCTCGAC GATCCTGAAA TGCGGGAAAT GGCGCAGGAA
GAACTGCGCG AAGCGAAAGA AAAAAGCGAA CAACTGGAGC AACAGTTACA GGTACTGCTG
CTGCCGAAAG ATCCGGACGA TGAACGAAAC GCGTTCCTTG AGGTTCGCGC CGGCACTGGC
GGCGACGAAG CCGCGCTGTT TGCCGGCGAT CTGTTCCGCA TGTACAGCCG TTATGCCGAA
GCGCGCCGCT GGCGCGTGGA GATCATGAGC ATGAGCGAAG GCGAGCATGG CGGTTATAAA
GAGATCATCG CCAAAATCAG CGGCGACGGC GTGTATGGCC GACTGAAATT TGAGTCCGGC
GGACACCGCG TACAGCGTGT TCCGGCGACC GAGTCGCAGG GGCGTATCCA TACCTCCGCC
TGTACCGTCG CCGTGATGCC GGAGCTGCCG GAAGCCGAGC TGCCGGATAT TAACCCGGCG
GATCTGCGCA TTGATACGTT TCGTTCTTCC GGCGCGGGCG GTCAGCACGT TAACACCACC
GACTCCGCTA TCCGAATTAC CCACTTGCCG ACCGGCATCG TGGTGGAATG CCAGGACGAG
CGTTCGCAGC ATAAAAACAA AGCGAAAGCG CTCTCGGTGC TCGGGGCGCG CATTCACGCC
GCCGAAACGG CAAAACGCCA GCAGGCCGAG GCGTCAACGC GCCGCAACCT GCTTGGCAGC
GGCGATCGCA GCGATCGTAA CCGGACCTAT AATTTCCCGC AGGGGCGCGT GACCGATCAT
CGTATTAATC TGACGTTATA TCGCCTTGAT GAAACGATGG AAGGTAAGCT GGATATGCTG
ATTGAGCCGA TTGTTCAGGA ACACCAGGCT GACCTGTTAG CCGCCTTATC CGAGCAGGAA
TAA
 
Protein sequence
MKPSIVAKLE ALHERHEEVQ ALLGDAGIIA DQDRFRALSR EYAQLSDVSR CFTDWQQVQD 
DIETAQMMLD DPEMREMAQE ELREAKEKSE QLEQQLQVLL LPKDPDDERN AFLEVRAGTG
GDEAALFAGD LFRMYSRYAE ARRWRVEIMS MSEGEHGGYK EIIAKISGDG VYGRLKFESG
GHRVQRVPAT ESQGRIHTSA CTVAVMPELP EAELPDINPA DLRIDTFRSS GAGGQHVNTT
DSAIRITHLP TGIVVECQDE RSQHKNKAKA LSVLGARIHA AETAKRQQAE ASTRRNLLGS
GDRSDRNRTY NFPQGRVTDH RINLTLYRLD ETMEGKLDML IEPIVQEHQA DLLAALSEQE