Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1185 |
Symbol | hisS |
ID | 3910120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1359230 |
End bp | 1360810 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637883079 |
Product | histidyl-tRNA synthetase |
Protein accession | YP_484806 |
Protein GI | 86748310 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0124] Histidyl-tRNA synthetase |
TIGRFAM ID | [TIGR00442] histidyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.445744 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.165943 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAGA AACCCAAGAA ACCACAGAAG TTGCGCGCCC GTCTGCCGCG CGGCCTGACC GATCGCGGCC CCGCCGAGAT CGCGGCGACG CGCGCGATGG TGGAAACCAT CCGCGAGGTC TATGAGCGCT ACGGCTTCGA GCCGGTCGAG ACCCCGGCGT TCGAATACAC CGACGCGCTC GGCAAGTTCC TGCCCGACCA GGATCGCCCC AACGAGGGCG TGTTCTCGCT GCAGGACGAC GACGAGCAAT GGATCAGCCT GCGCTACGAC CTGACCGCGC CGCTCGCCCG CTACGTCGCC GAAAATTTCG ACGCACTGCC GAAGCCCTAT CGCAGCTACC GTTTCGGCTG GGTGTTCCGC AACGAAAAGC CCGGCCCCGG CCGCTTCCGC CAGTTCATGC AGTTCGACGC CGACACCGTG GGCTCCGGCT CGCCGGCCGC CGATGCCGAG ATGTGCATGA TGGCGGCGGA CACGATGGAA GCGTTGGGCA TCCCGCGCGG CAGCTACGTG GTGAAGGTGA ACAACCGCAA GGTGCTGGAT GGGGTGCTGG AGGCCATTGG CCTTGGCGGG GATGAGAATG CGGGGCGCCG GCTCACGGTG CTGAGGGCTA TCGATAAGTC GGATAAATTT CCGCCCGAGG AGATCAAGAA GCTTCTGGGG CCGGGACGTT GGGATGGTGG CGAAGAAGGC AAAGGCGATT TCACAAAAGG CGCGATGCTT GGTGATGATC AGATTGAGCT GATTCTCAGA GCAACTTCGC CAAGCTTCAT AGCAGGCCGC TTCAATGCCG ATGGAAGCGG CGGCGTTAGC AATGCTGATA CAGTCGAACT CCTTCGCTCG ACGGCGGACA ACGAGACTTT AAAGCAAGGA TGCGATGAAC TGTCGGTAAT CTCGGATCTG CTGGATTCTG GCGGCTATGG TGCAACGTCC ACAAATCCTA ATGTGCGCGT TGTCATCGAT CCCTCCGTCG TCCGAGGCCT CGAATACTAC ACCGGCCCGG TCTACGAGGT CGAACTGCTG CTCGAGACCA AGGACGAAAA GGGGCGCCCG GTGCGGTTCG GCTCGGTCGG CGGCGGCGGT CGTTACGATG GTCTGGTGTC GCGCTTCCGC GGCGAGCCGG TGCCGGCGAC CGGGTTCTCG ATCGGTGTGT CGCGGCTGCA GGCGGCGCTG ACGATGATCG GCAAGCTTGG GACCCGGCCC GCGACCGGCC CGGTGGTGGT GACGGTGTTC GACCGCGAGC GGCTCGCCGA CTACCAGAAG ATGGTGTCGC AGCTCCGCGC CGAGGACATC CGCGCCGAGC TCTATCTCGG CAATCCGAAG AACATGGGCA ACCAGCTCAA ATACGCCGAC AAGCGCAACT CGCCTTGCGT GATCATCCAG GGCTCCGATG AGAAGAACGA TCCGGACGGC CCGCAGGTGA TCGTCAAGGA CCTGATCCTC GGCGCCGAAC TCGCCGCTCT GGACAAGGAT CGCGACGATT ATCTGCAGCG TCAGGCCGAC GCCCAGCGCA AAGTGCCGCA GCTCGGGATG ATCGACGAAG TGCGGCGGAT TCTGGCGCGG CACGACATCG ACTGGAATTG A
|
Protein sequence | MAEKPKKPQK LRARLPRGLT DRGPAEIAAT RAMVETIREV YERYGFEPVE TPAFEYTDAL GKFLPDQDRP NEGVFSLQDD DEQWISLRYD LTAPLARYVA ENFDALPKPY RSYRFGWVFR NEKPGPGRFR QFMQFDADTV GSGSPAADAE MCMMAADTME ALGIPRGSYV VKVNNRKVLD GVLEAIGLGG DENAGRRLTV LRAIDKSDKF PPEEIKKLLG PGRWDGGEEG KGDFTKGAML GDDQIELILR ATSPSFIAGR FNADGSGGVS NADTVELLRS TADNETLKQG CDELSVISDL LDSGGYGATS TNPNVRVVID PSVVRGLEYY TGPVYEVELL LETKDEKGRP VRFGSVGGGG RYDGLVSRFR GEPVPATGFS IGVSRLQAAL TMIGKLGTRP ATGPVVVTVF DRERLADYQK MVSQLRAEDI RAELYLGNPK NMGNQLKYAD KRNSPCVIIQ GSDEKNDPDG PQVIVKDLIL GAELAALDKD RDDYLQRQAD AQRKVPQLGM IDEVRRILAR HDIDWN
|
| |