Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2892 |
Symbol | valS |
ID | 3910686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3291322 |
End bp | 3294240 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637884793 |
Product | valyl-tRNA synthetase |
Protein accession | YP_486505 |
Protein GI | 86750009 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCGC TCGCCCGCCC GGCAAGCCCC CATTTTGATT GCATGGCCAT GATCGAGAAA ACCTACCAGC CCGCCGACAT CGAAAGCCGC ATCGCGCGCG CCTGGGAAGA CGCCGAGGCG TTCAAGGCCG GCCGCGCCGA CCGCCGCGAC GCCGAGCCGT ATTCGATCGT GATCCCGCCG CCGAACGTCA CCGGCTCGCT GCATATGGGC CACGCGCTCA ACAATACGCT CCAGGACATC CTGTGCCGGT TCGAGCGGAT GCGCGGCCGC GACGTGCTGT GGCAGCCCGG CACCGACCAC GCCGGCATCG CCACCCAGAT GGTCGTCGAG CGGCAACTGA TGGAGCGGCA GGAGCCGGGC CGGCGTGAGA TGGGCCGGGC GAAATTCCTG GAGCGGGTGT GGCAGTGGAA GGCCGAGAGC GGTGGCGTCA TCGTCAACCA GCTCAAGCGG CTCGGCGCCT CCTGCGACTG GTCGCGCGAG CGCTTCACCA TGGACGAGGG CCTGTCCCGC GCGGTGGTGA AGGTGTTCGT CGAACTGCAC CGGCAGGGGC TGATCTACAA GGACAAGCGG CTGGTCAATT GGGACCCGAA GCTGCTCACC GCGATCTCCG ATCTCGAAGT GCAGCAGATC GAAGTGAAGG GCCATCTCTG GCACCTGCGC TACCCGCTCG AAGGCGTGCC GTTCGATCCC GAGAATCCGT CGAGCTACAT CGTCGTCGCC ACCACGCGGC CGGAAACCAT GCTCGGCGAT ACCGCGGTCG CGGTGAATCC GGACGATGAT CGCTATGTCG ATCTGATCGG CAAGCACGTC GTCCTGCCGC TGGTCGGCCG GCGGATTCCG ATCGTCGCCG ACGAATATTC CGATCCCGAG AAGGGCTCCG GCGCGGTGAA GATCACGCCG GCGCACGACT TCAACGATTT CGAGGTCGGC CGCCGGCACC ACCTGCCGCA GATCAACGTG CTCGACATCG AGGGCCGCAT CGCGGTCGCC GACAACAGCG CCTATCTCGA CGGCCTGCCG GAAGGCGCGC GCGACTTCGC CGCCGAGATG GAAGGCGTCG ATCGATTCGT CGCGCGAAAG ATCATCGTGC AGCGGCTCGA CGATTTCGGC TTCCTGGAGA AGATCGAGCC CAACGTCCAC ATGGTGCCGC ATGGCGACCG ATCGGGCGTG GTGATCGAGC CGTTCCTGAC CGACCAATGG TACGTCGACG CCAAGACCAT GGCGCAGCCG GCGATCGCCG CGGTGCGCTC GGGTGCGACG AATTTCGTGC CGAAGAACTG GGAGAAGACC TACTACGAGT GGATGGAGAA CATCCAGCCC TGGTGCATTT CACGGCAATT GTGGTGGGGC CATCAGATCC CGGCCTGGTA CGGGCCGGAC GGCCGTGTGT TCGTCGCCGA GACCGAGGAA GAGGCGGTGG GCAATGCGCT CGGCTATTAT GTCGAGCAGG AGGTGCTGAC ACCCGAGCAG GCGCACGACA TGGCGGAAGA TCCGCTCAAG CGCGAGGGCT TCATCACCCG CGATGAAGAC GTGCTCGACA CCTGGTTCTC CTCGGCGCTG TGGCCGTTCT CGACGCTGGG CTGGCCGGAC GAGACGCCGG AGCTCGATCG CTACTACCCG ACCAACGTGC TGGTCACCGG CTTCGACATC ATCTTCTTCT GGGTCGCCCG GATGATGATG ATGGGGCTGC ACTTCAAGAG CGATGTGCCG TTCCCGACGG TCTACATCCA CGCCCTCGTC CGCGATGAGA AGGGCGCCAA GATGTCGAAG TCGAAGGGCA ACGTCATCGA TCCGCTCAAC CTGATCGACC AATACGGCGC CGACGCGCTG CGCTTCACAC TGGCGGCGAT GGCGGCGCAG GGCCGCGACA TCAAGCTGGC GTCGAGCCGC GTCGAAGGCT ATCGCAACTT CGCCACCAAG CTGTGGAACG CCTGCCGCTT CGCCGAAATG AACGGCTGCG CGGCGCCGAA GGAATTCGAC TACACCGCGG CGAAGGAAAC GCTGAACCGC TGGATCGCGC ACGAGGCGGT GCGCGCGACC CGCGAGGTCA CCGAGGCGAT CGAGAGCTAT CGCTTCAACG ACGCGGCGGG CGCGGCCTAT CGCTTCGTCT GGAACGTGTA TTGCGACTGG TATCTCGAAC TCGCCAAACC CGTGCTGATG GGCGAGGAGA GCGCCGCCAA GGCCGAGACC CGCGCGATGG TGGCGTGGGC GCGCGACGAG ATCCTCAAAC TGCTGCATCC GTTCATGCCG TTCATCACCG AGGAGCTGTG GGCGGTGACG GCGTCGCGTG ACGGCCTGCT GGTGCTGGCG CCGTGGTCGC GCAAATCCGG GATGAGTGAC GAGCAGATCG CGGTGATGAG CGCGTCGGCG GCGAGCGATC CGATGGTCGG GGCGGCGCTG CTCAGCATTC CGGCCGCTGC GCCGGACTTC ACCGACGACG CGGCCGAAGC CGAGATCGGC TGGGTGGTCG ATCTGATCAC CGCGATGCGC TCGGTGCGCG CCGAGATGAA CATCACGCCG GCGACGCTAA CGCCGCTGGT GCTGGTCGGC GCCTCCGCCG CCACGCGCGG CCGCGCCGAG CGCTGGAGCG ACGTGATCAA GCGGCTGGCG CGCGTCGGCG AGATTTCGTT CGCCGACAGC GCACCGCAGG GCGCCGTGCA GTTGCTGGTG CGCGGCGACG TCGCCGCCTT GCCGCTGAAA GGGCTGATCG ACTTCGCCGC CGAGCAGGCG CGGCTGGAGA AGGAGCTCGG CAAGGCCGAA GCCGACATCA AGCGCGCGGA AGCCAAGCTG GCCAACGAGA AATTCGTCGC CAACGCCGCC GAAGAAGTCG TCGAGGAAGA GAAGGAAAAG CGCGAGGCCG CGGTCGCCCG CAAGATGAAG ATCCTCGAAG CGCTGCTGCG GATCAAGAAC GCGTCGTAA
|
Protein sequence | MRALARPASP HFDCMAMIEK TYQPADIESR IARAWEDAEA FKAGRADRRD AEPYSIVIPP PNVTGSLHMG HALNNTLQDI LCRFERMRGR DVLWQPGTDH AGIATQMVVE RQLMERQEPG RREMGRAKFL ERVWQWKAES GGVIVNQLKR LGASCDWSRE RFTMDEGLSR AVVKVFVELH RQGLIYKDKR LVNWDPKLLT AISDLEVQQI EVKGHLWHLR YPLEGVPFDP ENPSSYIVVA TTRPETMLGD TAVAVNPDDD RYVDLIGKHV VLPLVGRRIP IVADEYSDPE KGSGAVKITP AHDFNDFEVG RRHHLPQINV LDIEGRIAVA DNSAYLDGLP EGARDFAAEM EGVDRFVARK IIVQRLDDFG FLEKIEPNVH MVPHGDRSGV VIEPFLTDQW YVDAKTMAQP AIAAVRSGAT NFVPKNWEKT YYEWMENIQP WCISRQLWWG HQIPAWYGPD GRVFVAETEE EAVGNALGYY VEQEVLTPEQ AHDMAEDPLK REGFITRDED VLDTWFSSAL WPFSTLGWPD ETPELDRYYP TNVLVTGFDI IFFWVARMMM MGLHFKSDVP FPTVYIHALV RDEKGAKMSK SKGNVIDPLN LIDQYGADAL RFTLAAMAAQ GRDIKLASSR VEGYRNFATK LWNACRFAEM NGCAAPKEFD YTAAKETLNR WIAHEAVRAT REVTEAIESY RFNDAAGAAY RFVWNVYCDW YLELAKPVLM GEESAAKAET RAMVAWARDE ILKLLHPFMP FITEELWAVT ASRDGLLVLA PWSRKSGMSD EQIAVMSASA ASDPMVGAAL LSIPAAAPDF TDDAAEAEIG WVVDLITAMR SVRAEMNITP ATLTPLVLVG ASAATRGRAE RWSDVIKRLA RVGEISFADS APQGAVQLLV RGDVAALPLK GLIDFAAEQA RLEKELGKAE ADIKRAEAKL ANEKFVANAA EEVVEEEKEK REAAVARKMK ILEALLRIKN AS
|
| |