Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2579 |
Symbol | valS |
ID | 4023075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2890251 |
End bp | 2893157 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637962777 |
Product | valyl-tRNA synthetase |
Protein accession | YP_569710 |
Protein GI | 91977051 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0215604 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGCGT TCGCCTTTGG CGAAGCGCCC TTTTTTGATT GCATGGCCAT GATCGAGAAA ACCTACCAGC CCGCCGAGAT CGAAGGCCGC ATCGCGCGCG CCTGGGAGGA CGCCGACGCC TTCAAGGCCG GACGCCCCGA CCGCCGCGAC GCCGAACCCT ATTCGATCGT GATCCCGCCG CCGAACGTGA CCGGTTCGCT GCATATGGGA CACGCGCTCA ACAACACACT GCAGGACATT CTGTGCCGGT TCGAGCGGAT GCGCGGCCGT GACGTGCTGT GGCAACCAGG CACGGACCAC GCCGGCATCG CGACTCAGAT GGTCGTCGAG CGGCAGTTGA TGGAGCGGCA GGAGCCGGGC CGTCGCGAGA TGGGCCGCGC CAAGTTTTTG GAGCGGGTCT GGCAGTGGAA GGCCGAGAGC GGCGGCGTCA TCGTCAACCA GCTGAAGCGG CTCGGGGCCT CCTGCGACTG GTCGCGCGAG CGCTTCACGA TGGACGAGGG GCTGTCCCGC GCCGTCGTCA AGGTGTTCGT CGAGCTGCAC CGGCAGGGGC TGATCTATAA GGACAAGCGG CTGGTCAATT GGGACCCGAA GCTGCTCACC GCGATCTCCG ATCTCGAAGT GCAGCAGATT GAGGTCAAGG GCAATCTCTG GCACCTGCGC TATCCGATCG AGGGCGTCAG TTTCGATCCG GAGAATCCGG CGAGCTACAT CGTGGTGGCG ACGACCCGGC CGGAGACGAT GCTCGGCGAC ACCGCGGTGG CGGTGAATCC GGACGATGAG CGCTATGTCG ATCTGGTCGG CAAGCACGTC ATTCTGCCGC TGGTCGGCCG GCGGATTCCG ATCGTCGCTG ACGAATATTC CGATCCGGAG AAGGGCTCCG GCGCGGTGAA GATCACGCCG GCGCACGATT TCAACGACTT CGAGGTCGGA AAGCGGCATC ATTTGCCGCA GATCAATGTG CTCGACATCG AGGGCAAGAT CGCGATCGCC GACAACAGCG CCTATCTCGA AGGCCTACCC GAGGGCGCGC GGGAATTCGC CGAAGAGATC GATGGCACCG ACCGCTTCGC CGCGCGCAAA CTGATCGTGG GGCGGCTCGA CGAATTCGGA TTCCTCGAAA AGATCGAGCC CAACGTCCAC ATGGTTCCGC ATGGCGACCG CTCCGGCGTG GTGATCGAAC CCTATCTGAC CGACCAGTGG TACGTCGACG CCAAGACGAT GGCGCAGCCG GCGATCGCCG CGGTGCGCTC CGGCGCCACC ACCTTCGTGC CGAAGAACTG GGAGAAGACC TATTACGAGT GGATGGATAA CATCCAGCCG TGGTGCATCT CGCGGCAATT GTGGTGGGGC CACCAAATTC CGGCCTGGTA CGGCCCCGAT GGCAAGGTGT TCGTCGCCGA GACCGAGGAA GAGGCAATCG GCAACGCGCT CGGCTATTAC GTCGAGCAGG AAGTCATCAC GCCCGCGCAG GCGCACGAGA TGACGCAGGA CGCCAGCAAG CGTGACGGCT TCATCACCCG CGATGAAGAC GTGCTCGACA CCTGGTTCTC CTCGGCGCTG TGGCCGTTCT CGACACTCGG CTGGCCCGAC GAGACGTCCG AACTCGAGCG CTACTACCCG ACAAACGTTC TCGTCACCGG CTTCGACATC ATCTTCTTCT GGGTCGCCCG GATGATGATG ATGGGCCTGC ATTTCAAGAA CGACGTGCCT TTCCCGACGG TCTACATCCA CGCCCTCGTC CGCGACGAGA AGGGCGCCAA GATGTCGAAG TCGAAAGGCA ACGTCATCGA TCCGCTGCAT CTGATCGACG ACTACGGCGC CGATGCGCTG CGCTTCACGC TGGCGGCGAT GGCGGCACAG GGCCGTGACA TCAAACTGGC GTCGAGCCGG GTCGAGGGCT ATCGCAATTT CGCGACCAAG CTATGGAATG CCTGCCGCTT CGCCGAGATG AACGCCTGCG CCGCGCCGAT CGGCTTCGAT TACACGACGG CGAAAGAGAC CCTGAACCGC TGGATCGCGC ACGAGACCGT GCGGGCGGTC CGCGACGTCA CCGAGGCGAT CGAGTCGTAT CGCTTCAACG ATGCAGCGGG CGCCGCCTAT CGCTTCGTCT GGAACGTGTA TTGCGATTGG TATCTCGAAC TCGCCAAGCC CGTCTTGATG GGCGAGGACA GCACGGCCAA GACCGAGACC CGGGCGATGG TGGCGTGGGC GCGCGACGAG ATTCTGAAGC TGCTGCATCC GTTCATGCCG TTCATCACCG AGGAGTTGTG GGCGGTGACG GGCCAGCGCG ACGGCCTTCT GGTGCTGGCG CCGTGGTCCC GCAAGGCCGA ACTCAGCGTC GAGGTTTTTG CGAGTCCGGT GCTCACCGAC CCGATGGTGC CGCCGGTGAT CCTGCCGGCG CGGCACGACG CCGAATTCAG CGATCCGGCT GCGGAAGCCG AGATCGGCTG GGTGGTCGAT CTGGTGACCG CGATCCGCTC GGTGCGCGCC GAGATGAACA TCGTGCCGTC GACACTGACG CCGTTGGTGC TGGCCGGCGC GTCTGCGGAG ACCCGCGCCC GTGCCGAACG CTGGAGTGAC GTCATCAAGC GGCTGGCGCG GGTCGGCGAG ATTTCGTTCA CCGACGCCGC GCCGCAGGGC GCGGTACAGC TCCTGGTGCG CGGCGAGGTG GCGGCGCTGC CGCTGAAGGG CGTGATCGAT CTTGCCGCCG AGCAGGCGCG GCTGGACAAG GAACTGGCCA AGGCGGAAGC CGACATCAAG CGCGTCGACG CCAAGCTCTC GAACGAGAAG TTCGTCGCCA ACGCGCCGGA GGAGATCGTC GAGGAAGAGA AGGAAAAGCG CGAGGCCGCC GTCGCGCGCA AGGTCAAGAT CCTCGAAGCC TTGCTGCGGC TGAAGAACGC GACGTAA
|
Protein sequence | MRAFAFGEAP FFDCMAMIEK TYQPAEIEGR IARAWEDADA FKAGRPDRRD AEPYSIVIPP PNVTGSLHMG HALNNTLQDI LCRFERMRGR DVLWQPGTDH AGIATQMVVE RQLMERQEPG RREMGRAKFL ERVWQWKAES GGVIVNQLKR LGASCDWSRE RFTMDEGLSR AVVKVFVELH RQGLIYKDKR LVNWDPKLLT AISDLEVQQI EVKGNLWHLR YPIEGVSFDP ENPASYIVVA TTRPETMLGD TAVAVNPDDE RYVDLVGKHV ILPLVGRRIP IVADEYSDPE KGSGAVKITP AHDFNDFEVG KRHHLPQINV LDIEGKIAIA DNSAYLEGLP EGAREFAEEI DGTDRFAARK LIVGRLDEFG FLEKIEPNVH MVPHGDRSGV VIEPYLTDQW YVDAKTMAQP AIAAVRSGAT TFVPKNWEKT YYEWMDNIQP WCISRQLWWG HQIPAWYGPD GKVFVAETEE EAIGNALGYY VEQEVITPAQ AHEMTQDASK RDGFITRDED VLDTWFSSAL WPFSTLGWPD ETSELERYYP TNVLVTGFDI IFFWVARMMM MGLHFKNDVP FPTVYIHALV RDEKGAKMSK SKGNVIDPLH LIDDYGADAL RFTLAAMAAQ GRDIKLASSR VEGYRNFATK LWNACRFAEM NACAAPIGFD YTTAKETLNR WIAHETVRAV RDVTEAIESY RFNDAAGAAY RFVWNVYCDW YLELAKPVLM GEDSTAKTET RAMVAWARDE ILKLLHPFMP FITEELWAVT GQRDGLLVLA PWSRKAELSV EVFASPVLTD PMVPPVILPA RHDAEFSDPA AEAEIGWVVD LVTAIRSVRA EMNIVPSTLT PLVLAGASAE TRARAERWSD VIKRLARVGE ISFTDAAPQG AVQLLVRGEV AALPLKGVID LAAEQARLDK ELAKAEADIK RVDAKLSNEK FVANAPEEIV EEEKEKREAA VARKVKILEA LLRLKNAT
|
| |