Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2853 |
Symbol | valS |
ID | 6410522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 3111615 |
End bp | 3114488 |
Gene Length | 2874 bp |
Protein Length | 957 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642712733 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001991836 |
Protein GI | 192291231 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.250427 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGAGA AAACCTACCA GCCAGCCGAC ATCGAGGCCC GCATTTCGCG CGCCTGGGAA GACGCCGAGG CCTTCAAGGC CGGCCGTCCG GAGCGCCGCG ACGCCGTGCC ATACTCGATC GTCATTCCGC CGCCGAACGT CACCGGCTCG CTGCATATGG GCCACGCGCT CAACAATACG CTGCAGGACA TCCTGTGCCG GTTCGAGCGG ATGCGTGGCC GCGACGTGCT GTGGCAGCCC GGCACCGACC ACGCCGGCAT CGCCACCCAA ATGGTGGTCG AACGCCAGTT GATGGAGCGG CAGGAGCCGA GCCGGCGCGA CATGGGCCGC GCCAAGTTCC TGGGGCGCGT TTGGCAGTGG AAGGCGGAGA GCGGCGGCGT CATCGTCAAC CAGCTCAAGC GGCTCGGTGC GTCGTGCGAC TGGTCGCGCG AACGCTTCAC CATGGACGAG GGGCTGTCCC GCGCCGTCGC CAAGGTGTTC GTCGAGCTGC ACCGCCAGGG CCTGATCTAC AAGGACAAGC GGCTGGTCAA TTGGGACCCG AAGCTGCTGA CCGCGATCTC GGATCTGGAA GTTCAGCAGA TCGAGGTGAA GGGCAATCTC TGGCATCTGC GCTATCCGAT CGAAGGCAAG ACCTTCGATC CGGCCGATCC GTCGAGCTTC ATCGTCGTCG CGACCACGCG TCCCGAAACC ATGCTCGGCG ACACTGCGGT CGCGGTGAAT CCCGAAGACG AGCGCTATAC GCATCTGGTC GGCAAGCACG TCATCCTGCC GCTGGTCGGC CGGCGGATTC CGATCGTCGC CGACGAATAC TCCGACCCCG AGAAGGGATC GGGCGCGGTG AAGATCACGC CGGCGCACGA CTTCAACGAC TTCGAGGTCG GCAAGCGCCA TTATTTGCCG CAGATCAATG TGCTCGATAT CGAGGGCAAG ATCTCGGTCG CCGACAACAG TGCCTATCTC GAAGGTCTGC CGGAAGGCGC GCGCGAATTC GCCGGGGAGA TCGACGGCAC CGACCGCTTC GTCGCCCGCA AGATCATCGT GGCGCGGCTG GACGATTTCG GCTTCCTGGA GAAGATCGAG CCGAACGTGC ACATGGTGCC GCACGGCGAC CGCTCCGGCG TGGTGATCGA GCCGTTCCTC ACCGACCAGT GGTACGTCGA CGCCAAGACG CTGGCGCAGC CGGCGATCGC CGCCGTGCGC TCGGGCGAGA CGACCTTCGT GCCCAAGAAC TGGGAGAAGA CCTACTTCGA GTGGATGGAA AACATCCAGC CGTGGTGCAT CTCGCGCCAG CTGTGGTGGG GTCACCAGAT CCCGGCGTGG TATGGCCCGG ACGGCAAGGT GTTCGTCGCC GAGACCGAGG AAGAGGCGGT CGGCAACGCG CTCGGCTATT ACGTCGAGCA GGAAGTGATC ACGCCTGCGC AGGCGCACGA CATGGCGGAA GATCCCGCCA AGCGTGAGGG CTTCATCACC CGTGACGAGG ACGTGCTCGA CACCTGGTTC TCGTCGGCGC TGTGGCCGTT CTCGACGCTC GGCTGGCCGG ACGAGACGCC GGAGCTCGAC CGTTACTACC CGACCAACGT GCTGGTCACC GGCTTCGACA TCATCTTCTT CTGGGTCGCC CGGATGATGA TGATGGGCCT GCACTTCATG GACGACGTGC CGTTCCCGAC CGTCTACATC CACGCGCTCG TCCGCGACGA GAAGGGCGCC AAGATGTCGA AGTCGAAGGG CAACGTCATC GATCCGCTCA ACCTGATCGA CGAATACGGC GCCGACGCGC TGCGCTTCAC GCTGGCTGCG ATGGCGGCGC AGGGCCGCGA CATCAAGCTC GCGACCAGCC GCGTCGAAGG CTATCGCAAC TTCGCCACCA AACTCTGGAA CGCCTGCCGC TTCGCCGAGA TGAACGGCTG CGTCGCGCCG GTCGGGTTCG ACTACACCGC GGCCAAGGAA ACGCTGAACC GCTGGATCGC GCACGAGACG GTGCGGGCGG TGCGTGAGGT GACCGAAGCG ATCGAATCCT ATCGCTTCAA CGACGCCGCC GAGGCTGCGT ATCGCTTCGT CTGGAACGTG TATTGCGACT GGTATCTCGA ACTCGCCAAG CCGGTGCTGA TGGGCGAGGA GGGCGCTGCC AAGACCGAGA CCCGTGCGAT GGTGGCGTGG GCGCGCGACG AGATCCTGAA GATCCTGCAT CCCTTCATGC CGTTCATCAC CGAAGAGCTG TGGGCGGTGA CGGCTCCGCG CGACGGACTG CTGGCGCTGG CGCCGTGGTC GCGCAAGGCG GGTATCTCGG ACGAAGAGGT GTCGGTGCTG GCGGCCTCCG CCGCGACCGA CCCGATGGCC GGGCCGGCGA TGCTGGCGAT TCCGGAGCCG CAGGAGCCGG ACTTCACCGA CGATGCCGCT GAAGCGGAAA TCGGCTGGGT GGTCGATCTC GTCACTGCGA TCCGCTCGGT GCGCGCCGAA ATGAACATCG TGCCCTCGAC CCTCACGCCG CTGGTGCTGG CCGGCGCTTC TGCCGACACC AATGCGCGGG CGAGCCGCTG GAGCGACGTG ATCAAGCGGC TGGCTCGGGT CGGCGAGATC TCGTTCGCTG ACGCCGCTCC GCAAGGCGCC GTGCAGCTCC TGGTGCGCGG CGAGGTCGCG GCGCTGCCGC TGAAGGGCGT GGTCGATTTC GCCGCCGAGC AGGCACGGCT CGAGAAGGAG CTCGGCAAGG CCGAAGCCGA CATCAAGCGC GCCGAGGCCA AGCTGGCGAA CGAGAAGTTC GTCGCCAACG CCGCCGAAGA AGTCGTCGAG GAAGAGCGCG AAAAGCGCGA GGCTGCGGTC GCGCGCAAGG TCAAGATCCT CGAGGCGCTG CTTCGGCTGA AGAACGCGAG CTGA
|
Protein sequence | MIEKTYQPAD IEARISRAWE DAEAFKAGRP ERRDAVPYSI VIPPPNVTGS LHMGHALNNT LQDILCRFER MRGRDVLWQP GTDHAGIATQ MVVERQLMER QEPSRRDMGR AKFLGRVWQW KAESGGVIVN QLKRLGASCD WSRERFTMDE GLSRAVAKVF VELHRQGLIY KDKRLVNWDP KLLTAISDLE VQQIEVKGNL WHLRYPIEGK TFDPADPSSF IVVATTRPET MLGDTAVAVN PEDERYTHLV GKHVILPLVG RRIPIVADEY SDPEKGSGAV KITPAHDFND FEVGKRHYLP QINVLDIEGK ISVADNSAYL EGLPEGAREF AGEIDGTDRF VARKIIVARL DDFGFLEKIE PNVHMVPHGD RSGVVIEPFL TDQWYVDAKT LAQPAIAAVR SGETTFVPKN WEKTYFEWME NIQPWCISRQ LWWGHQIPAW YGPDGKVFVA ETEEEAVGNA LGYYVEQEVI TPAQAHDMAE DPAKREGFIT RDEDVLDTWF SSALWPFSTL GWPDETPELD RYYPTNVLVT GFDIIFFWVA RMMMMGLHFM DDVPFPTVYI HALVRDEKGA KMSKSKGNVI DPLNLIDEYG ADALRFTLAA MAAQGRDIKL ATSRVEGYRN FATKLWNACR FAEMNGCVAP VGFDYTAAKE TLNRWIAHET VRAVREVTEA IESYRFNDAA EAAYRFVWNV YCDWYLELAK PVLMGEEGAA KTETRAMVAW ARDEILKILH PFMPFITEEL WAVTAPRDGL LALAPWSRKA GISDEEVSVL AASAATDPMA GPAMLAIPEP QEPDFTDDAA EAEIGWVVDL VTAIRSVRAE MNIVPSTLTP LVLAGASADT NARASRWSDV IKRLARVGEI SFADAAPQGA VQLLVRGEVA ALPLKGVVDF AAEQARLEKE LGKAEADIKR AEAKLANEKF VANAAEEVVE EEREKREAAV ARKVKILEAL LRLKNAS
|
| |