Gene RPB_1185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1185 
SymbolhisS 
ID3910120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1359230 
End bp1360810 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content63% 
IMG OID637883079 
Producthistidyl-tRNA synthetase 
Protein accessionYP_484806 
Protein GI86748310 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.445744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.165943 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAGA AACCCAAGAA ACCACAGAAG TTGCGCGCCC GTCTGCCGCG CGGCCTGACC 
GATCGCGGCC CCGCCGAGAT CGCGGCGACG CGCGCGATGG TGGAAACCAT CCGCGAGGTC
TATGAGCGCT ACGGCTTCGA GCCGGTCGAG ACCCCGGCGT TCGAATACAC CGACGCGCTC
GGCAAGTTCC TGCCCGACCA GGATCGCCCC AACGAGGGCG TGTTCTCGCT GCAGGACGAC
GACGAGCAAT GGATCAGCCT GCGCTACGAC CTGACCGCGC CGCTCGCCCG CTACGTCGCC
GAAAATTTCG ACGCACTGCC GAAGCCCTAT CGCAGCTACC GTTTCGGCTG GGTGTTCCGC
AACGAAAAGC CCGGCCCCGG CCGCTTCCGC CAGTTCATGC AGTTCGACGC CGACACCGTG
GGCTCCGGCT CGCCGGCCGC CGATGCCGAG ATGTGCATGA TGGCGGCGGA CACGATGGAA
GCGTTGGGCA TCCCGCGCGG CAGCTACGTG GTGAAGGTGA ACAACCGCAA GGTGCTGGAT
GGGGTGCTGG AGGCCATTGG CCTTGGCGGG GATGAGAATG CGGGGCGCCG GCTCACGGTG
CTGAGGGCTA TCGATAAGTC GGATAAATTT CCGCCCGAGG AGATCAAGAA GCTTCTGGGG
CCGGGACGTT GGGATGGTGG CGAAGAAGGC AAAGGCGATT TCACAAAAGG CGCGATGCTT
GGTGATGATC AGATTGAGCT GATTCTCAGA GCAACTTCGC CAAGCTTCAT AGCAGGCCGC
TTCAATGCCG ATGGAAGCGG CGGCGTTAGC AATGCTGATA CAGTCGAACT CCTTCGCTCG
ACGGCGGACA ACGAGACTTT AAAGCAAGGA TGCGATGAAC TGTCGGTAAT CTCGGATCTG
CTGGATTCTG GCGGCTATGG TGCAACGTCC ACAAATCCTA ATGTGCGCGT TGTCATCGAT
CCCTCCGTCG TCCGAGGCCT CGAATACTAC ACCGGCCCGG TCTACGAGGT CGAACTGCTG
CTCGAGACCA AGGACGAAAA GGGGCGCCCG GTGCGGTTCG GCTCGGTCGG CGGCGGCGGT
CGTTACGATG GTCTGGTGTC GCGCTTCCGC GGCGAGCCGG TGCCGGCGAC CGGGTTCTCG
ATCGGTGTGT CGCGGCTGCA GGCGGCGCTG ACGATGATCG GCAAGCTTGG GACCCGGCCC
GCGACCGGCC CGGTGGTGGT GACGGTGTTC GACCGCGAGC GGCTCGCCGA CTACCAGAAG
ATGGTGTCGC AGCTCCGCGC CGAGGACATC CGCGCCGAGC TCTATCTCGG CAATCCGAAG
AACATGGGCA ACCAGCTCAA ATACGCCGAC AAGCGCAACT CGCCTTGCGT GATCATCCAG
GGCTCCGATG AGAAGAACGA TCCGGACGGC CCGCAGGTGA TCGTCAAGGA CCTGATCCTC
GGCGCCGAAC TCGCCGCTCT GGACAAGGAT CGCGACGATT ATCTGCAGCG TCAGGCCGAC
GCCCAGCGCA AAGTGCCGCA GCTCGGGATG ATCGACGAAG TGCGGCGGAT TCTGGCGCGG
CACGACATCG ACTGGAATTG A
 
Protein sequence
MAEKPKKPQK LRARLPRGLT DRGPAEIAAT RAMVETIREV YERYGFEPVE TPAFEYTDAL 
GKFLPDQDRP NEGVFSLQDD DEQWISLRYD LTAPLARYVA ENFDALPKPY RSYRFGWVFR
NEKPGPGRFR QFMQFDADTV GSGSPAADAE MCMMAADTME ALGIPRGSYV VKVNNRKVLD
GVLEAIGLGG DENAGRRLTV LRAIDKSDKF PPEEIKKLLG PGRWDGGEEG KGDFTKGAML
GDDQIELILR ATSPSFIAGR FNADGSGGVS NADTVELLRS TADNETLKQG CDELSVISDL
LDSGGYGATS TNPNVRVVID PSVVRGLEYY TGPVYEVELL LETKDEKGRP VRFGSVGGGG
RYDGLVSRFR GEPVPATGFS IGVSRLQAAL TMIGKLGTRP ATGPVVVTVF DRERLADYQK
MVSQLRAEDI RAELYLGNPK NMGNQLKYAD KRNSPCVIIQ GSDEKNDPDG PQVIVKDLIL
GAELAALDKD RDDYLQRQAD AQRKVPQLGM IDEVRRILAR HDIDWN