Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_2570 |
Symbol | valS |
ID | 3970909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 2789421 |
End bp | 2792279 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637925680 |
Product | valyl-tRNA synthetase |
Protein accession | YP_532439 |
Protein GI | 90424069 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.252314 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGAAA AAACCTATCA GCCCGCCGAC ATCGAAGGCC GTATCGCACA AGCCTGGGAC GCCGAAGGCG CCTTCAAAGC CGGCCGCCCG GAGCGCCGCG ACGCCATTCC GTTCTCGATC GTGATCCCGC CGCCGAACGT CACCGGCTCG CTGCATATGG GCCACGCGCT CAACAACACG CTGCAGGACA TCCTGTGCCG GTTCGAGCGG ATGCGCGGCC GCGACGTGCT GTGGCAGCCC GGCACCGACC ACGCCGGCAT CGCCACCCAG ATGGTGGTCG AGCGGCAGCT GATGGAGCGC CAGGAGCCGA GCCGCCGCGA CATGGGCCGC GAAAAGTTTC TCGAGCGGGT GTGGCAGTGG AAGGCCGAAA GCGGCGGCGT CATCGTCAAC CAGCTGAAGC GGCTCGGGGC CTCCTGCGAC TGGTCGCGCG AGCGCTTCAC CATGGACGAG GGGCTGTCGC GCGCGGTCGC CAAGGTGTTC GTCGAGCTGC ACCGCCAAGG CCTGATCTAC AAGGACAAGC GGCTGGTCAA TTGGGACCCC AAGCTGCTCA CCGCGATCTC CGATCTCGAA GTGCAGCAGG TCGAAGTCAA GGGCAACCTG TGGCACCTGC GCTATCCGCT GGACGGCGTC GAATTCAATC CGGACGATCC GGAGACCTTC ATCGTGGTCG CCACCACGCG GCCGGAGACC ATGCTCGGCG ACACCGCGGT TGCGGTCAAT CCGGACGACG TGCGCTATTC GCATGTGGTC GGCCGCAATG TCATCCTGCC GCTGGTCGGG CGCAAGATTC CGATCATCGC CGACGATTAC TCCGATCCCG AAAAGGGCTC CGGCGCGGTG AAGATCACGC CGGCGCACGA CTTCAACGAC TTCGAGGTCG GCCGAAGGCA CAACCTGCCG CAGATCAACG TGCTCGACAT CGAAGGCCGC ATCGACCTCG TCGACAACGA AGCCTATCTG CACGGACTGC CGGATGACGC CGCGCAGTTT GCCGAAGAAC TGCATGGGCT GGAACGCTTC GCCGCGCGCA AGAGCATCGT GGCGCGGCTC GAAGAGCTCG GCTACCTGGA GAAGATCGAG CCCAACGTCC ACATGGTGCC GCACGGCGAC CGCTCCGGCG TGGTGATCGA GCCGTATCTG ACCGACCAGT GGTACGTCGA CGCCAAGACG CTGGCGCAGC CGGCGATCGC CGCGGTGCGC TCGGGCGCGA CGACCTTCGT GCCGAAGAAC TGGGAAAAGA CCTATTTCGA ATGGATGGAC AACATCCAGC CCTGGTGCAT CTCGCGGCAA TTGTGGTGGG GCCATCAGAT CCCGGCCTGG TACGGCCCGG ACGGCAAGGT GTTCGTCGCC GAGACCGAGG ACGAAGCGGT CGGCAACGCG CTCGGCTACT ACGTCGAGCA GGAAGTCATC ACCCCGGCGC AGGCGCATGC GATGACCCAG GACGCTTCGC TTCGCGACGG CTTCATCACC CGCGACGAGG ACGTGCTCGA CACCTGGTTC TCCTCGGCGC TGTGGCCGTT CTCGACGCTG GGCTGGCCGG ACGACGACAC CGATTGCAAG CGCTACTACC CGACCAACGT GCTGGTCACC GGCTTCGACA TCATCTTCTT CTGGGTCGCC CGGATGATGA TGATGGGGCT GCACTTCATG GACGACGTGC CGTTCCCGAC CGTCTACATC CACGCCCTCG TCCGCGACGA GAAGGGCGCC AAGATGTCGA AGTCGAAGGG CAACGTGATC GATCCGTTGC ACCTGATCGA CGATTACGGC GCCGACGCGC TGCGCTTCAC GCTTGCCGCG ATGGCGGCGC AGGGCCGCGA CATCAAGCTC GCCTCCAGCC GCGTCGAGGG CTACCGCAAT TTCGCGACCA AGCTGTGGAA TTCGAGCCGC TTCGCCGAGA TGAACGCCTG CGCGCTGCCC GCGGATTTCG ATCCGACCGC GGCGAAGCAG ACGCTGAACC GCTGGATCGC GCACGAGACC GTGCGCGCGG TGCGCGAGGT CACCGAGGCG ATCGAGGCCT ATCGCTTCAA CGACGCGGCG GGCGCCGCTT ATCGTTTCGT CTGGAACGTG TATTGCGACT GGTATCTCGA ACTCGCCAAG CCGGTGCTGC TCGGCGAGGA CAGCCCGGCC AAGGCCGAGA CCCGCGCCAT GGTGGCCTGG GCGCGCGATG AAATCCTCAA GCTGCTGCAT CCGTTCATGC CGTTCATCAC CGAAGAGCTG TGGGCGGTGA CCGCGGAGCG CGACACGCTG CTGACGCTGA CGCCGTGGTC GCGCAAGGCC GAGACCGCGG TCGAGGTGTT CGACGCCCCG GCGCTGACCG ACCCGATGGT GCCGCCGGTG ATCCTGCCGG CGCCGCACGT CGCCGACTTC AGCGATCCCG CGGCCGAGGC CGAAATCGGC TGGGTGGTCG ACCTGGTCAC CGCGATCCGT TCGGTGCGCG CCGAGATGAA CATCACGCCG TCGACGCTGA CGCCGCTGGT GCTGGCCGGC GCTTCGGCGG AGACCCGCGA CCGCGCGCAG CGCTGGAGCG ATGTGGTGAA GCGGATGGCC CGGCTCGGCG AGATTTCGTT TGCCGACAGC GCACCGCAGG GCGCGGTGCA ACTGTTGATC CGCGGCGAGG TCGCAGCATT GCCGCTGAAG GGCGTGATCG ACGTCGCCGC CGAGCAGGCC CGGCTCGACA AGGAACTGGC GAAGGCCGAG GCCGACATCA AGCGGGTCGA CGCCAAGCTC TCCAACGAGA AATTCGTCGC CAACGCGCCG GAAGAGATCG TCGAGGAAGA GAAGGAAAAG CGCGAGGCCG CGGTCGCGCG CAAGGCCAAG ATTTTGGAAG CGCTGGAGCG GCTGAAGAAC GCCACCTGA
|
Protein sequence | MIEKTYQPAD IEGRIAQAWD AEGAFKAGRP ERRDAIPFSI VIPPPNVTGS LHMGHALNNT LQDILCRFER MRGRDVLWQP GTDHAGIATQ MVVERQLMER QEPSRRDMGR EKFLERVWQW KAESGGVIVN QLKRLGASCD WSRERFTMDE GLSRAVAKVF VELHRQGLIY KDKRLVNWDP KLLTAISDLE VQQVEVKGNL WHLRYPLDGV EFNPDDPETF IVVATTRPET MLGDTAVAVN PDDVRYSHVV GRNVILPLVG RKIPIIADDY SDPEKGSGAV KITPAHDFND FEVGRRHNLP QINVLDIEGR IDLVDNEAYL HGLPDDAAQF AEELHGLERF AARKSIVARL EELGYLEKIE PNVHMVPHGD RSGVVIEPYL TDQWYVDAKT LAQPAIAAVR SGATTFVPKN WEKTYFEWMD NIQPWCISRQ LWWGHQIPAW YGPDGKVFVA ETEDEAVGNA LGYYVEQEVI TPAQAHAMTQ DASLRDGFIT RDEDVLDTWF SSALWPFSTL GWPDDDTDCK RYYPTNVLVT GFDIIFFWVA RMMMMGLHFM DDVPFPTVYI HALVRDEKGA KMSKSKGNVI DPLHLIDDYG ADALRFTLAA MAAQGRDIKL ASSRVEGYRN FATKLWNSSR FAEMNACALP ADFDPTAAKQ TLNRWIAHET VRAVREVTEA IEAYRFNDAA GAAYRFVWNV YCDWYLELAK PVLLGEDSPA KAETRAMVAW ARDEILKLLH PFMPFITEEL WAVTAERDTL LTLTPWSRKA ETAVEVFDAP ALTDPMVPPV ILPAPHVADF SDPAAEAEIG WVVDLVTAIR SVRAEMNITP STLTPLVLAG ASAETRDRAQ RWSDVVKRMA RLGEISFADS APQGAVQLLI RGEVAALPLK GVIDVAAEQA RLDKELAKAE ADIKRVDAKL SNEKFVANAP EEIVEEEKEK REAAVARKAK ILEALERLKN AT
|
| |