Gene Rpal_2853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2853 
SymbolvalS 
ID6410522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3111615 
End bp3114488 
Gene Length2874 bp 
Protein Length957 aa 
Translation table11 
GC content66% 
IMG OID642712733 
Productvalyl-tRNA synthetase 
Protein accessionYP_001991836 
Protein GI192291231 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.250427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGAGA AAACCTACCA GCCAGCCGAC ATCGAGGCCC GCATTTCGCG CGCCTGGGAA 
GACGCCGAGG CCTTCAAGGC CGGCCGTCCG GAGCGCCGCG ACGCCGTGCC ATACTCGATC
GTCATTCCGC CGCCGAACGT CACCGGCTCG CTGCATATGG GCCACGCGCT CAACAATACG
CTGCAGGACA TCCTGTGCCG GTTCGAGCGG ATGCGTGGCC GCGACGTGCT GTGGCAGCCC
GGCACCGACC ACGCCGGCAT CGCCACCCAA ATGGTGGTCG AACGCCAGTT GATGGAGCGG
CAGGAGCCGA GCCGGCGCGA CATGGGCCGC GCCAAGTTCC TGGGGCGCGT TTGGCAGTGG
AAGGCGGAGA GCGGCGGCGT CATCGTCAAC CAGCTCAAGC GGCTCGGTGC GTCGTGCGAC
TGGTCGCGCG AACGCTTCAC CATGGACGAG GGGCTGTCCC GCGCCGTCGC CAAGGTGTTC
GTCGAGCTGC ACCGCCAGGG CCTGATCTAC AAGGACAAGC GGCTGGTCAA TTGGGACCCG
AAGCTGCTGA CCGCGATCTC GGATCTGGAA GTTCAGCAGA TCGAGGTGAA GGGCAATCTC
TGGCATCTGC GCTATCCGAT CGAAGGCAAG ACCTTCGATC CGGCCGATCC GTCGAGCTTC
ATCGTCGTCG CGACCACGCG TCCCGAAACC ATGCTCGGCG ACACTGCGGT CGCGGTGAAT
CCCGAAGACG AGCGCTATAC GCATCTGGTC GGCAAGCACG TCATCCTGCC GCTGGTCGGC
CGGCGGATTC CGATCGTCGC CGACGAATAC TCCGACCCCG AGAAGGGATC GGGCGCGGTG
AAGATCACGC CGGCGCACGA CTTCAACGAC TTCGAGGTCG GCAAGCGCCA TTATTTGCCG
CAGATCAATG TGCTCGATAT CGAGGGCAAG ATCTCGGTCG CCGACAACAG TGCCTATCTC
GAAGGTCTGC CGGAAGGCGC GCGCGAATTC GCCGGGGAGA TCGACGGCAC CGACCGCTTC
GTCGCCCGCA AGATCATCGT GGCGCGGCTG GACGATTTCG GCTTCCTGGA GAAGATCGAG
CCGAACGTGC ACATGGTGCC GCACGGCGAC CGCTCCGGCG TGGTGATCGA GCCGTTCCTC
ACCGACCAGT GGTACGTCGA CGCCAAGACG CTGGCGCAGC CGGCGATCGC CGCCGTGCGC
TCGGGCGAGA CGACCTTCGT GCCCAAGAAC TGGGAGAAGA CCTACTTCGA GTGGATGGAA
AACATCCAGC CGTGGTGCAT CTCGCGCCAG CTGTGGTGGG GTCACCAGAT CCCGGCGTGG
TATGGCCCGG ACGGCAAGGT GTTCGTCGCC GAGACCGAGG AAGAGGCGGT CGGCAACGCG
CTCGGCTATT ACGTCGAGCA GGAAGTGATC ACGCCTGCGC AGGCGCACGA CATGGCGGAA
GATCCCGCCA AGCGTGAGGG CTTCATCACC CGTGACGAGG ACGTGCTCGA CACCTGGTTC
TCGTCGGCGC TGTGGCCGTT CTCGACGCTC GGCTGGCCGG ACGAGACGCC GGAGCTCGAC
CGTTACTACC CGACCAACGT GCTGGTCACC GGCTTCGACA TCATCTTCTT CTGGGTCGCC
CGGATGATGA TGATGGGCCT GCACTTCATG GACGACGTGC CGTTCCCGAC CGTCTACATC
CACGCGCTCG TCCGCGACGA GAAGGGCGCC AAGATGTCGA AGTCGAAGGG CAACGTCATC
GATCCGCTCA ACCTGATCGA CGAATACGGC GCCGACGCGC TGCGCTTCAC GCTGGCTGCG
ATGGCGGCGC AGGGCCGCGA CATCAAGCTC GCGACCAGCC GCGTCGAAGG CTATCGCAAC
TTCGCCACCA AACTCTGGAA CGCCTGCCGC TTCGCCGAGA TGAACGGCTG CGTCGCGCCG
GTCGGGTTCG ACTACACCGC GGCCAAGGAA ACGCTGAACC GCTGGATCGC GCACGAGACG
GTGCGGGCGG TGCGTGAGGT GACCGAAGCG ATCGAATCCT ATCGCTTCAA CGACGCCGCC
GAGGCTGCGT ATCGCTTCGT CTGGAACGTG TATTGCGACT GGTATCTCGA ACTCGCCAAG
CCGGTGCTGA TGGGCGAGGA GGGCGCTGCC AAGACCGAGA CCCGTGCGAT GGTGGCGTGG
GCGCGCGACG AGATCCTGAA GATCCTGCAT CCCTTCATGC CGTTCATCAC CGAAGAGCTG
TGGGCGGTGA CGGCTCCGCG CGACGGACTG CTGGCGCTGG CGCCGTGGTC GCGCAAGGCG
GGTATCTCGG ACGAAGAGGT GTCGGTGCTG GCGGCCTCCG CCGCGACCGA CCCGATGGCC
GGGCCGGCGA TGCTGGCGAT TCCGGAGCCG CAGGAGCCGG ACTTCACCGA CGATGCCGCT
GAAGCGGAAA TCGGCTGGGT GGTCGATCTC GTCACTGCGA TCCGCTCGGT GCGCGCCGAA
ATGAACATCG TGCCCTCGAC CCTCACGCCG CTGGTGCTGG CCGGCGCTTC TGCCGACACC
AATGCGCGGG CGAGCCGCTG GAGCGACGTG ATCAAGCGGC TGGCTCGGGT CGGCGAGATC
TCGTTCGCTG ACGCCGCTCC GCAAGGCGCC GTGCAGCTCC TGGTGCGCGG CGAGGTCGCG
GCGCTGCCGC TGAAGGGCGT GGTCGATTTC GCCGCCGAGC AGGCACGGCT CGAGAAGGAG
CTCGGCAAGG CCGAAGCCGA CATCAAGCGC GCCGAGGCCA AGCTGGCGAA CGAGAAGTTC
GTCGCCAACG CCGCCGAAGA AGTCGTCGAG GAAGAGCGCG AAAAGCGCGA GGCTGCGGTC
GCGCGCAAGG TCAAGATCCT CGAGGCGCTG CTTCGGCTGA AGAACGCGAG CTGA
 
Protein sequence
MIEKTYQPAD IEARISRAWE DAEAFKAGRP ERRDAVPYSI VIPPPNVTGS LHMGHALNNT 
LQDILCRFER MRGRDVLWQP GTDHAGIATQ MVVERQLMER QEPSRRDMGR AKFLGRVWQW
KAESGGVIVN QLKRLGASCD WSRERFTMDE GLSRAVAKVF VELHRQGLIY KDKRLVNWDP
KLLTAISDLE VQQIEVKGNL WHLRYPIEGK TFDPADPSSF IVVATTRPET MLGDTAVAVN
PEDERYTHLV GKHVILPLVG RRIPIVADEY SDPEKGSGAV KITPAHDFND FEVGKRHYLP
QINVLDIEGK ISVADNSAYL EGLPEGAREF AGEIDGTDRF VARKIIVARL DDFGFLEKIE
PNVHMVPHGD RSGVVIEPFL TDQWYVDAKT LAQPAIAAVR SGETTFVPKN WEKTYFEWME
NIQPWCISRQ LWWGHQIPAW YGPDGKVFVA ETEEEAVGNA LGYYVEQEVI TPAQAHDMAE
DPAKREGFIT RDEDVLDTWF SSALWPFSTL GWPDETPELD RYYPTNVLVT GFDIIFFWVA
RMMMMGLHFM DDVPFPTVYI HALVRDEKGA KMSKSKGNVI DPLNLIDEYG ADALRFTLAA
MAAQGRDIKL ATSRVEGYRN FATKLWNACR FAEMNGCVAP VGFDYTAAKE TLNRWIAHET
VRAVREVTEA IESYRFNDAA EAAYRFVWNV YCDWYLELAK PVLMGEEGAA KTETRAMVAW
ARDEILKILH PFMPFITEEL WAVTAPRDGL LALAPWSRKA GISDEEVSVL AASAATDPMA
GPAMLAIPEP QEPDFTDDAA EAEIGWVVDL VTAIRSVRAE MNIVPSTLTP LVLAGASADT
NARASRWSDV IKRLARVGEI SFADAAPQGA VQLLVRGEVA ALPLKGVVDF AAEQARLEKE
LGKAEADIKR AEAKLANEKF VANAAEEVVE EEREKREAAV ARKVKILEAL LRLKNAS