Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0403 |
Symbol | hisS |
ID | 5321237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 432749 |
End bp | 434272 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640789338 |
Product | histidyl-tRNA synthetase |
Protein accession | YP_001326095 |
Protein GI | 150395628 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0124] Histidyl-tRNA synthetase |
TIGRFAM ID | [TIGR00442] histidyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.326468 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0636459 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTTCCA TGAATGAAAA ACAGAAGAAA ACGCAGAAGC TCAGGGCGCG CTTGCCGCGT GGTTTCGTAG ATCGTCCGGC CGCGGACATC CGCGCCGTGG ACGAGATGGT TTCGAAGATC CGCGAGGTCT ATGAACGCTA CGGCTTCGAT CCGGTGGAAA CCCCGCTTTT TGAATATACT GACGCGCTCG GAAAATTCCT GCCGGACAGC GACCGGCCGA ATGAAGGCGT CTTCTCGCTC ACCGACGACG ACGACCAGTG GCTTAGTCTT CGCTACGACC TGACGGCGCC GCTGGCGCGC CACGTGGCCG AGAATTTCAA CGAGATCCAG CTTCCCTACC GGACCTATCG CGCCGGCTAC GTCTTCCGCA ACGAGAAGCC GGGGCCGGGC CGTTTCCGCC AGTTCATGCA GTTCGATGCC GATACGGTGG GTGCCGCCGG CGTGCAGGCC GATGCCGAAA TGTGCATGAT GATGGCCGAC ACGATGGAGG CGCTCGGCAT TGCCCGCGGC GACTACGTGA TCCGCGTCAA CAACCGCAAG GTTCTCGACG GCGTTCTGGA GGCGATAGGT CTCGGCGGCG GCGAGCAGAT GAATACGCGC CTGACGGTGC TGCGTGCCAT CGACAAGCTC GACAAATTCG GCCCGGAAGG CGTGCGGCTG TTGCTGGGCG AAGGACGCAA GGACGAGAGC GGCGACTTCA CCAAGGGCGC CGGGCTCGAC GATGAGCAGA TCGGCAAGAT CCTGTTCTTT GTCGGCATCA CCGACTATGC CGAGAGCGCG GACGCGTTGG CGGCACTCGT CGCCGGGACG GCCCGGGGCG CCGAGGGCGT CAATGAGTTG AACACGATCC GCAGCCTTGT GCTCAGCGCC GGCTACGAGG CGGACCGCAT CAAGATCGAC CCCTCGGTCG TGCGTGGTCT GGAATATTAT ACGGGTCCCG TTTTCGAGGC GGAGTTGCAG TTCGCCGTCA CCAACGAAAA GGGGGAGAAG GTCGTCTTCG GCTCCGTCGG CGGAGGCGGC CGCTATGACG GTCTCGTGTC GCGCTTCATG GGACAGCCGG TGCCGGCGAC CGGCTTTTCG ATCGGCGTGT CTCGGCTGAT GACGGCGCTC AAGAATCTGG GCAAACTGGG CGTAGAACAG GTCACCGCGC CGGTCGTCGT TTGCGTCATG GACCGCGACA TCGAAAGCAT GGGGCGCTAC CAGCGCTTTG TGCAGGATCT GCGTCATGCC GGCATCCGCG CCGAAATGTA CCAGGGCAAC AAGAAGAACT TCGGCGACCA GCTCAAATAT GCCGATCGTC GCGGTTCGCC GATCGCTGTC ATTCAGGGCG GCGACGAGCG CGCTTCCGGC GTCGTACAGA TCAAGGACCT GATCGAAGGC AAGCGGCTCT CTGGCGAAAT CGAAGATAAT GTCGCTTGGC GCGAGGCGCG TGCCGCTCAG ATTTCCGTCC CGGAAGGCGA ACTCGTCGTT AAGATTCGCG AAATTCTCGC GGACCAGGCC GAGGACCGGA AAAGGGCGGG TTGA
|
Protein sequence | MLSMNEKQKK TQKLRARLPR GFVDRPAADI RAVDEMVSKI REVYERYGFD PVETPLFEYT DALGKFLPDS DRPNEGVFSL TDDDDQWLSL RYDLTAPLAR HVAENFNEIQ LPYRTYRAGY VFRNEKPGPG RFRQFMQFDA DTVGAAGVQA DAEMCMMMAD TMEALGIARG DYVIRVNNRK VLDGVLEAIG LGGGEQMNTR LTVLRAIDKL DKFGPEGVRL LLGEGRKDES GDFTKGAGLD DEQIGKILFF VGITDYAESA DALAALVAGT ARGAEGVNEL NTIRSLVLSA GYEADRIKID PSVVRGLEYY TGPVFEAELQ FAVTNEKGEK VVFGSVGGGG RYDGLVSRFM GQPVPATGFS IGVSRLMTAL KNLGKLGVEQ VTAPVVVCVM DRDIESMGRY QRFVQDLRHA GIRAEMYQGN KKNFGDQLKY ADRRGSPIAV IQGGDERASG VVQIKDLIEG KRLSGEIEDN VAWREARAAQ ISVPEGELVV KIREILADQA EDRKRAG
|
| |