Gene RPC_0893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0893 
SymbolhisS 
ID3969773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp988691 
End bp990250 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content63% 
IMG OID637924009 
Producthistidyl-tRNA synthetase 
Protein accessionYP_530782 
Protein GI90422412 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.200439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAA AACCCAAAAA ATCCCAGAAA CTCCGCGCCC GGCTGCCGCG CGGCCTCGCC 
GATCGCGGCC CCGCCGAACT CGCCGCGACG AGGGCGATGG TCGAGACCAT CCGCGCGGTC
TATGAGCGCT ACGGCTTCGA GCCGGTGGAG ACCCCGGCGT TCGAATTCAC CGACGCGCTC
GGCAAGTTCC TGCCCGACCA GGACCGCCCC AACGAGGGCG TGTTCTCGTT CCAGGACGAC
GACGAGCAAT GGATTTCCTT GCGCTATGAC CTCACCGCGC CGCTGGCGCG CTATGTGGCG
GAGAATTTCG ACTCTTTGCC GAAGCCGTAT CGCAGCTACC GCAACGGCTA CGTCTACCGC
AACGAGAAGC CCGGCCCCGG CCGCTTCCGG CAATTCATGC AGTTCGACGC CGACACGGTC
GGCAGCGCTT CGCCCGCCGC CGACGCCGAG ATGTGCATGA TGGCTGCGGA TGCGATGGAG
GCCTTGGGCA TCCCGCGCGG GTCGTATGTG GTGAAGGTGA ACAACCGCAA GGTGCTCGAC
GGGGTGATGG AGTCGATCGG TCTCGGCGGC GAGGAGAACG CGGGTCGCAG GCTCACGGTG
CTCAGGGCGA TCGATAAGCT GGATCGGCTT GGTATCGAAG GGGTAAAGCT CCTGTTGGGG
GAGGGCCGTT GGGACGGGGG CGAGCAAGGT AAAGGCGATT TCACAATTGG CGCTCAGCTC
TCCCCGGAAA CCTCCACTCC AATCTTGAAT TATCTAGATC TCGGTATTCG TGTTGCCCGT
GATCGAGTCG AATCACCTAA TCGCGAACTC GGCATAGTAG GGTATTTGGA GCAGATAGTA
AGCGGCTCCG AAACTGGAGC TCAGGGAACG ACGGAATTAG CTCAAATCGT CCGTCTAGTT
GAAGCAGCCG GATACGATGA TGGTCGCATC CGCATCGACC CTTCGGTCGT CCGCGGCCTC
GAATACTACA CCGGCCCAGT CTACGAAGTG GAATTGCTGC TCGACACCAA GGACGAAAAA
GGCCGCCCGG TCCGCTTTGG CTCAGTCGGC GGCGGCGGGC GCTATGATGG ATTGGTGTCG
CGATTCCGCG GCGAGCCGGT GCCGGCCACC GGCTTCTCGA TCGGCGTGTC GCGGCTGCAG
GCGGCTCTCA CGCTGCTCGG CAAGCTCGAC ACCAGGCCGC AGGCCGGCCC CGTGGTGGTC
ACGGTGTTCG ACCGCGACCG CGTCGCGGAC TATCAGAAGA TGGTGGCGCG CCTGCGCGCC
GAAAACATTC GCGCCGAACT CTATCTCGGC AATCCGAAGA ACATGGGCAA CCAGCTGAAA
TACGCCGACA AGCGCAATTC GCCTTGCGTG ATCATCCAGG GCTCGGACGA GAAGAACGAT
CCGGACGGCG CGCAGATCAT CGTCAAGGAC CTTGTGCTCG GCGCCGAATT GGCGTCGCTG
GAAAAGGACC GCGAGGAATA TCTGCAGAAA CAGGCCGAGG CGCAGCGCAA GGTGCCGGAA
GCCGACCTCG TCGACGAGGT CCGCCGCATC CTCGCCAAGC ATAGCGTGCG CTGGAGCTGA
 
Protein sequence
MAEKPKKSQK LRARLPRGLA DRGPAELAAT RAMVETIRAV YERYGFEPVE TPAFEFTDAL 
GKFLPDQDRP NEGVFSFQDD DEQWISLRYD LTAPLARYVA ENFDSLPKPY RSYRNGYVYR
NEKPGPGRFR QFMQFDADTV GSASPAADAE MCMMAADAME ALGIPRGSYV VKVNNRKVLD
GVMESIGLGG EENAGRRLTV LRAIDKLDRL GIEGVKLLLG EGRWDGGEQG KGDFTIGAQL
SPETSTPILN YLDLGIRVAR DRVESPNREL GIVGYLEQIV SGSETGAQGT TELAQIVRLV
EAAGYDDGRI RIDPSVVRGL EYYTGPVYEV ELLLDTKDEK GRPVRFGSVG GGGRYDGLVS
RFRGEPVPAT GFSIGVSRLQ AALTLLGKLD TRPQAGPVVV TVFDRDRVAD YQKMVARLRA
ENIRAELYLG NPKNMGNQLK YADKRNSPCV IIQGSDEKND PDGAQIIVKD LVLGAELASL
EKDREEYLQK QAEAQRKVPE ADLVDEVRRI LAKHSVRWS