Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1289 |
Symbol | hisS |
ID | 4021766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1452863 |
End bp | 1454458 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637961482 |
Product | histidyl-tRNA synthetase |
Protein accession | YP_568428 |
Protein GI | 91975769 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0124] Histidyl-tRNA synthetase |
TIGRFAM ID | [TIGR00442] histidyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.219331 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAAA AACCGAAGAA AACGCAGAAA CTCCGCGCCC GGCTGCCGCG CGGGCTGGCC GACCGCGGCC CCGCCGAGAT CGCGGCGACG CGGTCGATGG TCGAGATCAT CCGCGAGGTC TATGAGCGCT ACGGCTTCGA GCCGGTCGAG ACCCCGGCGT TCGAATACAC CGACGCCCTC GGCAAGTTCC TGCCCGACCA GGACCGCCCC AACGAGGGCG TTTTCTCGCT GCAGGACGAC GACGAGCAGT GGATCAGCCT GCGCTACGAC CTGACCGCGC CGCTCGCCCG CTACGTCGCG GAGAATTTTG ATGCACTGCC GAAGCCGTAT CGCAGCTACC GCTTCGGCTG GGTGTTCCGC AACGAAAAGC CCGGTCCCGG CCGCTTCCGC CAGTTCATGC AGTTCGACGC CGACACCGTG GGCGCGCCGA CCGCCGCCGC CGACGCCGAG ATGTGCATGA TGGCGGCGGA CACCATGGAA GCGCTGGGCA TTCCGCGCGG CAGCTATGTG GTGAAGGTGA ATAACCGCAA GGTGCTTGAT GGCGTGCTGG AGGCCATTGG GCTTGGCGGC GAAGAAAATG CTGGTCGCAG GCTGACCGTT CTGAGGGCCA TCGATAAGCT GGACAAGTTT CCGATCACGG AAATCGAGAA GCTACTCGGA AACGGTCGGT GGGATGGTGG TGAGGAAGGC AAAGGTGACT TCACTAAAGG CGCTGGCCTG ACTGAAGAGC AGATGCTGCG CATCTTCAAT TATGTTTCGT TTGGGGCTGT GCTTTCTACC TCGCCAGGTG ACTCAAATTT TGCCGGCTCG ATACAGAATC GCAATCCGAG CGAACTGCGT CGGTCGAACG AAGAGGTGGT ATCTGGACTC AAAGACTCTT TCGGGGACTC GCCGATCGCA CAGCAAGGGC TAGATGAGCT TGGCGAGATA GCTCGGCTTG CGACTGCGGG AGGATATGAT CAGATGAGAA TTCTCATCGA CCCTTCCGTC GTCCGCGGCC TCGAATACTA CACCGGTCCG GTCTACGAGG TCGAACTGCT GCTCGACACC AAGGACGACA AGGGCCGACC GATTCGGTTC GGCTCGGTCG GCGGCGGCGG GCGTTATGAT GGTCTGGTGT CGCGTTTCCG CGGCGAGCCG GTGCCGGCGA CCGGGTTCTC GATCGGCGTG TCGCGGCTGC AGGCGGCGCT GGAGATCGTC AACCGGGACA AGCCGAAACA GCCGAGCTAC GGCCCCGTCG TCGTCACCGT GTTCGGCGGC GATATCGCCG GCTATCAAAA AATGGTGTCG ACGCTGCGCC AGGCCGGCAT TCGCGCCGAA TTGTATCTCG GCAATCCGAA GCACTCGCTC GGCCAGCAGA TGAAATACGC CGACAAGCGC AACTCGCCCT GCGCCATCAT TCAGGGCTCC GACGAGAAAG AGCGTGGCGA AGTTCAGATC AAGGACCTGA TCCTCGGTGC CGAACTGGCG TCGCTGGAGA AGGATCGTGA GGAGTATCTG AAGAAACAGG CCGATGCGCA GTTCTCCTGC GGCGTGGATG ACCTCGTCGC CAAGGTCCGC GATGTGCTGG CGCGGCACAG CGTGAAGTGG AGTTAG
|
Protein sequence | MAEKPKKTQK LRARLPRGLA DRGPAEIAAT RSMVEIIREV YERYGFEPVE TPAFEYTDAL GKFLPDQDRP NEGVFSLQDD DEQWISLRYD LTAPLARYVA ENFDALPKPY RSYRFGWVFR NEKPGPGRFR QFMQFDADTV GAPTAAADAE MCMMAADTME ALGIPRGSYV VKVNNRKVLD GVLEAIGLGG EENAGRRLTV LRAIDKLDKF PITEIEKLLG NGRWDGGEEG KGDFTKGAGL TEEQMLRIFN YVSFGAVLST SPGDSNFAGS IQNRNPSELR RSNEEVVSGL KDSFGDSPIA QQGLDELGEI ARLATAGGYD QMRILIDPSV VRGLEYYTGP VYEVELLLDT KDDKGRPIRF GSVGGGGRYD GLVSRFRGEP VPATGFSIGV SRLQAALEIV NRDKPKQPSY GPVVVTVFGG DIAGYQKMVS TLRQAGIRAE LYLGNPKHSL GQQMKYADKR NSPCAIIQGS DEKERGEVQI KDLILGAELA SLEKDREEYL KKQADAQFSC GVDDLVAKVR DVLARHSVKW S
|
| |