Gene ECH_0597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0597 
SymbolprfA 
ID3927422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp602148 
End bp603227 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content32% 
IMG OID637901719 
Productpeptide chain release factor 1 
Protein accessionYP_507408 
Protein GI88657945 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0216] Protein chain release factor A 
TIGRFAM ID[TIGR00019] peptide chain release factor 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.349785 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTTG ATAATAGTTT AGAAGAGTTG TCTCAAAAAT TCTACAAACT AAAAAGTATG 
TTAGAAGATC CAAGTCAATT GAGTGTGGAT TCCTTTGTTG CTGCTTCAAA AGAATATTCA
GAATTATTGC CTGTGATATC AGTGATAGAC CAATATAATA TCTTACAAAA AGATATAGCA
GGCTTAGAAG AACTGATAAA TAATCCAGAA ACTGATCATG AATTAAAAAG TCTAGCTAAA
GAAGAATTCT ATGAACGGCA AAAACAATTA CCTAAAGTTA AGCATAAATT AAAATTATCC
TTACTTCCCA AGGATAAAGA TGATGCACGT AATGCTATTT TAGAAATTAG AGCAGGTACA
GGTGGAGAAG AAGCTGCATT ATTTGTGACT GATTTATATA GAATGTATAC AAAATATGCT
GAACAAAAGA ATTGGAAATT TGAACAGATT AACTCATCTT CAACCGGTAT AGGCGGACAT
AAGGAAATAT CATTATGTAT AAGCGGATCT AATGTATTTG CAAGGTTAAA ATTTGAATCT
GGAGTGCATA GAGTACAAAG GGTACCGGAA ACTGAAGCTT CTGGAAGACT TCATACTTCA
GCTGCTACAG TAGCAGTTTT ACCAGAAATT GAAGAAGTAG ATTTAAAGAT AGATGAAAAA
GATTTAAGAA TAGATGTATA TCGTTCAAGC GGTCCAGGAG GACAATCTGT GAATACTACT
GATAGTGCTG TACGTATTAC GCATATACCA AGCGGAATTG TCGTTATACA GCAAGATGAG
AAATCTCAAC ATAAAAATAA AAGTAAAGCT CTTAAGGTAT TAAGAGCAAG GCTTTATAAC
CTAGAAAAAC AAAAAAGAGA TGCAGAAATT TCACAAATGA GAAAAAGTCA GATAGGATCA
GGAGACCGTT CTGAGCGTAT AAGAACTTAC AATTTTCCTC AATCTAGAAT TACAGATCAT
AGGATAAATC TTACATTATA TAGATTAGAT GATATTATGA AAGAAGGAAA TTTGGATGAG
TTTATTGAAG CATTAATAGC CGAAGATGAA GCAAATAAAT TAAAGAACCT GCATATTTGA
 
Protein sequence
MSFDNSLEEL SQKFYKLKSM LEDPSQLSVD SFVAASKEYS ELLPVISVID QYNILQKDIA 
GLEELINNPE TDHELKSLAK EEFYERQKQL PKVKHKLKLS LLPKDKDDAR NAILEIRAGT
GGEEAALFVT DLYRMYTKYA EQKNWKFEQI NSSSTGIGGH KEISLCISGS NVFARLKFES
GVHRVQRVPE TEASGRLHTS AATVAVLPEI EEVDLKIDEK DLRIDVYRSS GPGGQSVNTT
DSAVRITHIP SGIVVIQQDE KSQHKNKSKA LKVLRARLYN LEKQKRDAEI SQMRKSQIGS
GDRSERIRTY NFPQSRITDH RINLTLYRLD DIMKEGNLDE FIEALIAEDE ANKLKNLHI