Gene RPD_2579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2579 
SymbolvalS 
ID4023075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2890251 
End bp2893157 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content64% 
IMG OID637962777 
Productvalyl-tRNA synthetase 
Protein accessionYP_569710 
Protein GI91977051 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0215604 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGCGT TCGCCTTTGG CGAAGCGCCC TTTTTTGATT GCATGGCCAT GATCGAGAAA 
ACCTACCAGC CCGCCGAGAT CGAAGGCCGC ATCGCGCGCG CCTGGGAGGA CGCCGACGCC
TTCAAGGCCG GACGCCCCGA CCGCCGCGAC GCCGAACCCT ATTCGATCGT GATCCCGCCG
CCGAACGTGA CCGGTTCGCT GCATATGGGA CACGCGCTCA ACAACACACT GCAGGACATT
CTGTGCCGGT TCGAGCGGAT GCGCGGCCGT GACGTGCTGT GGCAACCAGG CACGGACCAC
GCCGGCATCG CGACTCAGAT GGTCGTCGAG CGGCAGTTGA TGGAGCGGCA GGAGCCGGGC
CGTCGCGAGA TGGGCCGCGC CAAGTTTTTG GAGCGGGTCT GGCAGTGGAA GGCCGAGAGC
GGCGGCGTCA TCGTCAACCA GCTGAAGCGG CTCGGGGCCT CCTGCGACTG GTCGCGCGAG
CGCTTCACGA TGGACGAGGG GCTGTCCCGC GCCGTCGTCA AGGTGTTCGT CGAGCTGCAC
CGGCAGGGGC TGATCTATAA GGACAAGCGG CTGGTCAATT GGGACCCGAA GCTGCTCACC
GCGATCTCCG ATCTCGAAGT GCAGCAGATT GAGGTCAAGG GCAATCTCTG GCACCTGCGC
TATCCGATCG AGGGCGTCAG TTTCGATCCG GAGAATCCGG CGAGCTACAT CGTGGTGGCG
ACGACCCGGC CGGAGACGAT GCTCGGCGAC ACCGCGGTGG CGGTGAATCC GGACGATGAG
CGCTATGTCG ATCTGGTCGG CAAGCACGTC ATTCTGCCGC TGGTCGGCCG GCGGATTCCG
ATCGTCGCTG ACGAATATTC CGATCCGGAG AAGGGCTCCG GCGCGGTGAA GATCACGCCG
GCGCACGATT TCAACGACTT CGAGGTCGGA AAGCGGCATC ATTTGCCGCA GATCAATGTG
CTCGACATCG AGGGCAAGAT CGCGATCGCC GACAACAGCG CCTATCTCGA AGGCCTACCC
GAGGGCGCGC GGGAATTCGC CGAAGAGATC GATGGCACCG ACCGCTTCGC CGCGCGCAAA
CTGATCGTGG GGCGGCTCGA CGAATTCGGA TTCCTCGAAA AGATCGAGCC CAACGTCCAC
ATGGTTCCGC ATGGCGACCG CTCCGGCGTG GTGATCGAAC CCTATCTGAC CGACCAGTGG
TACGTCGACG CCAAGACGAT GGCGCAGCCG GCGATCGCCG CGGTGCGCTC CGGCGCCACC
ACCTTCGTGC CGAAGAACTG GGAGAAGACC TATTACGAGT GGATGGATAA CATCCAGCCG
TGGTGCATCT CGCGGCAATT GTGGTGGGGC CACCAAATTC CGGCCTGGTA CGGCCCCGAT
GGCAAGGTGT TCGTCGCCGA GACCGAGGAA GAGGCAATCG GCAACGCGCT CGGCTATTAC
GTCGAGCAGG AAGTCATCAC GCCCGCGCAG GCGCACGAGA TGACGCAGGA CGCCAGCAAG
CGTGACGGCT TCATCACCCG CGATGAAGAC GTGCTCGACA CCTGGTTCTC CTCGGCGCTG
TGGCCGTTCT CGACACTCGG CTGGCCCGAC GAGACGTCCG AACTCGAGCG CTACTACCCG
ACAAACGTTC TCGTCACCGG CTTCGACATC ATCTTCTTCT GGGTCGCCCG GATGATGATG
ATGGGCCTGC ATTTCAAGAA CGACGTGCCT TTCCCGACGG TCTACATCCA CGCCCTCGTC
CGCGACGAGA AGGGCGCCAA GATGTCGAAG TCGAAAGGCA ACGTCATCGA TCCGCTGCAT
CTGATCGACG ACTACGGCGC CGATGCGCTG CGCTTCACGC TGGCGGCGAT GGCGGCACAG
GGCCGTGACA TCAAACTGGC GTCGAGCCGG GTCGAGGGCT ATCGCAATTT CGCGACCAAG
CTATGGAATG CCTGCCGCTT CGCCGAGATG AACGCCTGCG CCGCGCCGAT CGGCTTCGAT
TACACGACGG CGAAAGAGAC CCTGAACCGC TGGATCGCGC ACGAGACCGT GCGGGCGGTC
CGCGACGTCA CCGAGGCGAT CGAGTCGTAT CGCTTCAACG ATGCAGCGGG CGCCGCCTAT
CGCTTCGTCT GGAACGTGTA TTGCGATTGG TATCTCGAAC TCGCCAAGCC CGTCTTGATG
GGCGAGGACA GCACGGCCAA GACCGAGACC CGGGCGATGG TGGCGTGGGC GCGCGACGAG
ATTCTGAAGC TGCTGCATCC GTTCATGCCG TTCATCACCG AGGAGTTGTG GGCGGTGACG
GGCCAGCGCG ACGGCCTTCT GGTGCTGGCG CCGTGGTCCC GCAAGGCCGA ACTCAGCGTC
GAGGTTTTTG CGAGTCCGGT GCTCACCGAC CCGATGGTGC CGCCGGTGAT CCTGCCGGCG
CGGCACGACG CCGAATTCAG CGATCCGGCT GCGGAAGCCG AGATCGGCTG GGTGGTCGAT
CTGGTGACCG CGATCCGCTC GGTGCGCGCC GAGATGAACA TCGTGCCGTC GACACTGACG
CCGTTGGTGC TGGCCGGCGC GTCTGCGGAG ACCCGCGCCC GTGCCGAACG CTGGAGTGAC
GTCATCAAGC GGCTGGCGCG GGTCGGCGAG ATTTCGTTCA CCGACGCCGC GCCGCAGGGC
GCGGTACAGC TCCTGGTGCG CGGCGAGGTG GCGGCGCTGC CGCTGAAGGG CGTGATCGAT
CTTGCCGCCG AGCAGGCGCG GCTGGACAAG GAACTGGCCA AGGCGGAAGC CGACATCAAG
CGCGTCGACG CCAAGCTCTC GAACGAGAAG TTCGTCGCCA ACGCGCCGGA GGAGATCGTC
GAGGAAGAGA AGGAAAAGCG CGAGGCCGCC GTCGCGCGCA AGGTCAAGAT CCTCGAAGCC
TTGCTGCGGC TGAAGAACGC GACGTAA
 
Protein sequence
MRAFAFGEAP FFDCMAMIEK TYQPAEIEGR IARAWEDADA FKAGRPDRRD AEPYSIVIPP 
PNVTGSLHMG HALNNTLQDI LCRFERMRGR DVLWQPGTDH AGIATQMVVE RQLMERQEPG
RREMGRAKFL ERVWQWKAES GGVIVNQLKR LGASCDWSRE RFTMDEGLSR AVVKVFVELH
RQGLIYKDKR LVNWDPKLLT AISDLEVQQI EVKGNLWHLR YPIEGVSFDP ENPASYIVVA
TTRPETMLGD TAVAVNPDDE RYVDLVGKHV ILPLVGRRIP IVADEYSDPE KGSGAVKITP
AHDFNDFEVG KRHHLPQINV LDIEGKIAIA DNSAYLEGLP EGAREFAEEI DGTDRFAARK
LIVGRLDEFG FLEKIEPNVH MVPHGDRSGV VIEPYLTDQW YVDAKTMAQP AIAAVRSGAT
TFVPKNWEKT YYEWMDNIQP WCISRQLWWG HQIPAWYGPD GKVFVAETEE EAIGNALGYY
VEQEVITPAQ AHEMTQDASK RDGFITRDED VLDTWFSSAL WPFSTLGWPD ETSELERYYP
TNVLVTGFDI IFFWVARMMM MGLHFKNDVP FPTVYIHALV RDEKGAKMSK SKGNVIDPLH
LIDDYGADAL RFTLAAMAAQ GRDIKLASSR VEGYRNFATK LWNACRFAEM NACAAPIGFD
YTTAKETLNR WIAHETVRAV RDVTEAIESY RFNDAAGAAY RFVWNVYCDW YLELAKPVLM
GEDSTAKTET RAMVAWARDE ILKLLHPFMP FITEELWAVT GQRDGLLVLA PWSRKAELSV
EVFASPVLTD PMVPPVILPA RHDAEFSDPA AEAEIGWVVD LVTAIRSVRA EMNIVPSTLT
PLVLAGASAE TRARAERWSD VIKRLARVGE ISFTDAAPQG AVQLLVRGEV AALPLKGVID
LAAEQARLDK ELAKAEADIK RVDAKLSNEK FVANAPEEIV EEEKEKREAA VARKVKILEA
LLRLKNAT