Gene RPD_1289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1289 
SymbolhisS 
ID4021766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1452863 
End bp1454458 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content62% 
IMG OID637961482 
Producthistidyl-tRNA synthetase 
Protein accessionYP_568428 
Protein GI91975769 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.219331 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAA AACCGAAGAA AACGCAGAAA CTCCGCGCCC GGCTGCCGCG CGGGCTGGCC 
GACCGCGGCC CCGCCGAGAT CGCGGCGACG CGGTCGATGG TCGAGATCAT CCGCGAGGTC
TATGAGCGCT ACGGCTTCGA GCCGGTCGAG ACCCCGGCGT TCGAATACAC CGACGCCCTC
GGCAAGTTCC TGCCCGACCA GGACCGCCCC AACGAGGGCG TTTTCTCGCT GCAGGACGAC
GACGAGCAGT GGATCAGCCT GCGCTACGAC CTGACCGCGC CGCTCGCCCG CTACGTCGCG
GAGAATTTTG ATGCACTGCC GAAGCCGTAT CGCAGCTACC GCTTCGGCTG GGTGTTCCGC
AACGAAAAGC CCGGTCCCGG CCGCTTCCGC CAGTTCATGC AGTTCGACGC CGACACCGTG
GGCGCGCCGA CCGCCGCCGC CGACGCCGAG ATGTGCATGA TGGCGGCGGA CACCATGGAA
GCGCTGGGCA TTCCGCGCGG CAGCTATGTG GTGAAGGTGA ATAACCGCAA GGTGCTTGAT
GGCGTGCTGG AGGCCATTGG GCTTGGCGGC GAAGAAAATG CTGGTCGCAG GCTGACCGTT
CTGAGGGCCA TCGATAAGCT GGACAAGTTT CCGATCACGG AAATCGAGAA GCTACTCGGA
AACGGTCGGT GGGATGGTGG TGAGGAAGGC AAAGGTGACT TCACTAAAGG CGCTGGCCTG
ACTGAAGAGC AGATGCTGCG CATCTTCAAT TATGTTTCGT TTGGGGCTGT GCTTTCTACC
TCGCCAGGTG ACTCAAATTT TGCCGGCTCG ATACAGAATC GCAATCCGAG CGAACTGCGT
CGGTCGAACG AAGAGGTGGT ATCTGGACTC AAAGACTCTT TCGGGGACTC GCCGATCGCA
CAGCAAGGGC TAGATGAGCT TGGCGAGATA GCTCGGCTTG CGACTGCGGG AGGATATGAT
CAGATGAGAA TTCTCATCGA CCCTTCCGTC GTCCGCGGCC TCGAATACTA CACCGGTCCG
GTCTACGAGG TCGAACTGCT GCTCGACACC AAGGACGACA AGGGCCGACC GATTCGGTTC
GGCTCGGTCG GCGGCGGCGG GCGTTATGAT GGTCTGGTGT CGCGTTTCCG CGGCGAGCCG
GTGCCGGCGA CCGGGTTCTC GATCGGCGTG TCGCGGCTGC AGGCGGCGCT GGAGATCGTC
AACCGGGACA AGCCGAAACA GCCGAGCTAC GGCCCCGTCG TCGTCACCGT GTTCGGCGGC
GATATCGCCG GCTATCAAAA AATGGTGTCG ACGCTGCGCC AGGCCGGCAT TCGCGCCGAA
TTGTATCTCG GCAATCCGAA GCACTCGCTC GGCCAGCAGA TGAAATACGC CGACAAGCGC
AACTCGCCCT GCGCCATCAT TCAGGGCTCC GACGAGAAAG AGCGTGGCGA AGTTCAGATC
AAGGACCTGA TCCTCGGTGC CGAACTGGCG TCGCTGGAGA AGGATCGTGA GGAGTATCTG
AAGAAACAGG CCGATGCGCA GTTCTCCTGC GGCGTGGATG ACCTCGTCGC CAAGGTCCGC
GATGTGCTGG CGCGGCACAG CGTGAAGTGG AGTTAG
 
Protein sequence
MAEKPKKTQK LRARLPRGLA DRGPAEIAAT RSMVEIIREV YERYGFEPVE TPAFEYTDAL 
GKFLPDQDRP NEGVFSLQDD DEQWISLRYD LTAPLARYVA ENFDALPKPY RSYRFGWVFR
NEKPGPGRFR QFMQFDADTV GAPTAAADAE MCMMAADTME ALGIPRGSYV VKVNNRKVLD
GVLEAIGLGG EENAGRRLTV LRAIDKLDKF PITEIEKLLG NGRWDGGEEG KGDFTKGAGL
TEEQMLRIFN YVSFGAVLST SPGDSNFAGS IQNRNPSELR RSNEEVVSGL KDSFGDSPIA
QQGLDELGEI ARLATAGGYD QMRILIDPSV VRGLEYYTGP VYEVELLLDT KDDKGRPIRF
GSVGGGGRYD GLVSRFRGEP VPATGFSIGV SRLQAALEIV NRDKPKQPSY GPVVVTVFGG
DIAGYQKMVS TLRQAGIRAE LYLGNPKHSL GQQMKYADKR NSPCAIIQGS DEKERGEVQI
KDLILGAELA SLEKDREEYL KKQADAQFSC GVDDLVAKVR DVLARHSVKW S