Gene SbBS512_E1375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1375 
SymbolprfA 
ID6269304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1254552 
End bp1255634 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content53% 
IMG OID641725484 
Productpeptide chain release factor 1 
Protein accessionYP_001879994 
Protein GI187734065 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0216] Protein chain release factor A 
TIGRFAM ID[TIGR00019] peptide chain release factor 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000017097 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCTT CTATCGTTGC CAAACTGGAA GCCCTGCATG AACGCCATGA AGAAGTTCAG 
GCGTTGCTGG GTGACGCGCA AACTATCGCC GACCAGGAAC GTTTTCGCGC ATTATCACGC
GAATATGCGC AGTTAAGTGA TGTTTCGCGC TGTTTTACCG ACTGGCAACA GGTTCAGGAA
GATATCGAAA CCGCACAGAT GATGCTCGAT GATCCTGAAA TGCGTGAGAT GGCGCAGGAT
GAACTGCGCA AAGCTAAAGA AAAAAGCGAG CAACTGGAAC AGCAATTACA GGTTCTGTTA
CTGCCAAAAG ATCCTGATGA CGAACGTAAC GCCTTCCTCG AAGTCCGAGC CGGAACCGGC
GGCGACGAAG CGGCGCTGTT CGCAGGCGAT CTGTTCCGTA TGTACAGCCG TTACGCCGAA
GCCCGCCGCT GGCGGGTAGA AATCATGAGC GCCAGCGAGG GTGAACATGG TGGTTATAAA
GAGATCATCG CCAAAATTAG CGGTGATGGT GTGTATGGTC GTCTGAAATT TGAATCTGGC
GGTCATCGCG TGCAGCGTGT TCCTGCTACG GAATCGCAGG GTCGTATTCA TACTTCTGCT
TGTACCGTTG CGGTAATGCC AGAACTGCCT GACGCAGAAC TGCCGGACAT CAACCCAGCA
GATTTACGCA TTGATACTTT CCGCTCGTCA GGGGCGGGTG GTCAGCACGT TAATACCACC
GATTCGGCAA TTCGTATTAC TCACTTGCCG ACCGGGATTG TTGTTGAATG TCAGGATGAA
CGTTCACAAC ATAAAAACAA AGCGAAGGCG CTTTCAGTGC TCGGTGCTCG CATCCACGCT
GCTGAAATGG CAAAACGCCA ACAGGCCGAA GCGTCTACCC GTCGTAACCT GCTGGGGAGT
GGCGATCGCA GCGACCGTAA CCGTACTTAC AATTTCCCGC AGGGGCGCGT TACCGATCAC
CGCATCAACC TGACGCTCTA CCGCCTGGAT GAAGTGATGG AAGGTAAGCT GGATATGCTG
ATTGAACCGA TTATCCAGGA ACATCAGGCC GACCAACTGG CGGCGTTGTC CGAGCAGGAA
TAA
 
Protein sequence
MKPSIVAKLE ALHERHEEVQ ALLGDAQTIA DQERFRALSR EYAQLSDVSR CFTDWQQVQE 
DIETAQMMLD DPEMREMAQD ELRKAKEKSE QLEQQLQVLL LPKDPDDERN AFLEVRAGTG
GDEAALFAGD LFRMYSRYAE ARRWRVEIMS ASEGEHGGYK EIIAKISGDG VYGRLKFESG
GHRVQRVPAT ESQGRIHTSA CTVAVMPELP DAELPDINPA DLRIDTFRSS GAGGQHVNTT
DSAIRITHLP TGIVVECQDE RSQHKNKAKA LSVLGARIHA AEMAKRQQAE ASTRRNLLGS
GDRSDRNRTY NFPQGRVTDH RINLTLYRLD EVMEGKLDML IEPIIQEHQA DQLAALSEQE