Gene RPB_2892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2892 
SymbolvalS 
ID3910686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3291322 
End bp3294240 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content66% 
IMG OID637884793 
Productvalyl-tRNA synthetase 
Protein accessionYP_486505 
Protein GI86750009 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCGC TCGCCCGCCC GGCAAGCCCC CATTTTGATT GCATGGCCAT GATCGAGAAA 
ACCTACCAGC CCGCCGACAT CGAAAGCCGC ATCGCGCGCG CCTGGGAAGA CGCCGAGGCG
TTCAAGGCCG GCCGCGCCGA CCGCCGCGAC GCCGAGCCGT ATTCGATCGT GATCCCGCCG
CCGAACGTCA CCGGCTCGCT GCATATGGGC CACGCGCTCA ACAATACGCT CCAGGACATC
CTGTGCCGGT TCGAGCGGAT GCGCGGCCGC GACGTGCTGT GGCAGCCCGG CACCGACCAC
GCCGGCATCG CCACCCAGAT GGTCGTCGAG CGGCAACTGA TGGAGCGGCA GGAGCCGGGC
CGGCGTGAGA TGGGCCGGGC GAAATTCCTG GAGCGGGTGT GGCAGTGGAA GGCCGAGAGC
GGTGGCGTCA TCGTCAACCA GCTCAAGCGG CTCGGCGCCT CCTGCGACTG GTCGCGCGAG
CGCTTCACCA TGGACGAGGG CCTGTCCCGC GCGGTGGTGA AGGTGTTCGT CGAACTGCAC
CGGCAGGGGC TGATCTACAA GGACAAGCGG CTGGTCAATT GGGACCCGAA GCTGCTCACC
GCGATCTCCG ATCTCGAAGT GCAGCAGATC GAAGTGAAGG GCCATCTCTG GCACCTGCGC
TACCCGCTCG AAGGCGTGCC GTTCGATCCC GAGAATCCGT CGAGCTACAT CGTCGTCGCC
ACCACGCGGC CGGAAACCAT GCTCGGCGAT ACCGCGGTCG CGGTGAATCC GGACGATGAT
CGCTATGTCG ATCTGATCGG CAAGCACGTC GTCCTGCCGC TGGTCGGCCG GCGGATTCCG
ATCGTCGCCG ACGAATATTC CGATCCCGAG AAGGGCTCCG GCGCGGTGAA GATCACGCCG
GCGCACGACT TCAACGATTT CGAGGTCGGC CGCCGGCACC ACCTGCCGCA GATCAACGTG
CTCGACATCG AGGGCCGCAT CGCGGTCGCC GACAACAGCG CCTATCTCGA CGGCCTGCCG
GAAGGCGCGC GCGACTTCGC CGCCGAGATG GAAGGCGTCG ATCGATTCGT CGCGCGAAAG
ATCATCGTGC AGCGGCTCGA CGATTTCGGC TTCCTGGAGA AGATCGAGCC CAACGTCCAC
ATGGTGCCGC ATGGCGACCG ATCGGGCGTG GTGATCGAGC CGTTCCTGAC CGACCAATGG
TACGTCGACG CCAAGACCAT GGCGCAGCCG GCGATCGCCG CGGTGCGCTC GGGTGCGACG
AATTTCGTGC CGAAGAACTG GGAGAAGACC TACTACGAGT GGATGGAGAA CATCCAGCCC
TGGTGCATTT CACGGCAATT GTGGTGGGGC CATCAGATCC CGGCCTGGTA CGGGCCGGAC
GGCCGTGTGT TCGTCGCCGA GACCGAGGAA GAGGCGGTGG GCAATGCGCT CGGCTATTAT
GTCGAGCAGG AGGTGCTGAC ACCCGAGCAG GCGCACGACA TGGCGGAAGA TCCGCTCAAG
CGCGAGGGCT TCATCACCCG CGATGAAGAC GTGCTCGACA CCTGGTTCTC CTCGGCGCTG
TGGCCGTTCT CGACGCTGGG CTGGCCGGAC GAGACGCCGG AGCTCGATCG CTACTACCCG
ACCAACGTGC TGGTCACCGG CTTCGACATC ATCTTCTTCT GGGTCGCCCG GATGATGATG
ATGGGGCTGC ACTTCAAGAG CGATGTGCCG TTCCCGACGG TCTACATCCA CGCCCTCGTC
CGCGATGAGA AGGGCGCCAA GATGTCGAAG TCGAAGGGCA ACGTCATCGA TCCGCTCAAC
CTGATCGACC AATACGGCGC CGACGCGCTG CGCTTCACAC TGGCGGCGAT GGCGGCGCAG
GGCCGCGACA TCAAGCTGGC GTCGAGCCGC GTCGAAGGCT ATCGCAACTT CGCCACCAAG
CTGTGGAACG CCTGCCGCTT CGCCGAAATG AACGGCTGCG CGGCGCCGAA GGAATTCGAC
TACACCGCGG CGAAGGAAAC GCTGAACCGC TGGATCGCGC ACGAGGCGGT GCGCGCGACC
CGCGAGGTCA CCGAGGCGAT CGAGAGCTAT CGCTTCAACG ACGCGGCGGG CGCGGCCTAT
CGCTTCGTCT GGAACGTGTA TTGCGACTGG TATCTCGAAC TCGCCAAACC CGTGCTGATG
GGCGAGGAGA GCGCCGCCAA GGCCGAGACC CGCGCGATGG TGGCGTGGGC GCGCGACGAG
ATCCTCAAAC TGCTGCATCC GTTCATGCCG TTCATCACCG AGGAGCTGTG GGCGGTGACG
GCGTCGCGTG ACGGCCTGCT GGTGCTGGCG CCGTGGTCGC GCAAATCCGG GATGAGTGAC
GAGCAGATCG CGGTGATGAG CGCGTCGGCG GCGAGCGATC CGATGGTCGG GGCGGCGCTG
CTCAGCATTC CGGCCGCTGC GCCGGACTTC ACCGACGACG CGGCCGAAGC CGAGATCGGC
TGGGTGGTCG ATCTGATCAC CGCGATGCGC TCGGTGCGCG CCGAGATGAA CATCACGCCG
GCGACGCTAA CGCCGCTGGT GCTGGTCGGC GCCTCCGCCG CCACGCGCGG CCGCGCCGAG
CGCTGGAGCG ACGTGATCAA GCGGCTGGCG CGCGTCGGCG AGATTTCGTT CGCCGACAGC
GCACCGCAGG GCGCCGTGCA GTTGCTGGTG CGCGGCGACG TCGCCGCCTT GCCGCTGAAA
GGGCTGATCG ACTTCGCCGC CGAGCAGGCG CGGCTGGAGA AGGAGCTCGG CAAGGCCGAA
GCCGACATCA AGCGCGCGGA AGCCAAGCTG GCCAACGAGA AATTCGTCGC CAACGCCGCC
GAAGAAGTCG TCGAGGAAGA GAAGGAAAAG CGCGAGGCCG CGGTCGCCCG CAAGATGAAG
ATCCTCGAAG CGCTGCTGCG GATCAAGAAC GCGTCGTAA
 
Protein sequence
MRALARPASP HFDCMAMIEK TYQPADIESR IARAWEDAEA FKAGRADRRD AEPYSIVIPP 
PNVTGSLHMG HALNNTLQDI LCRFERMRGR DVLWQPGTDH AGIATQMVVE RQLMERQEPG
RREMGRAKFL ERVWQWKAES GGVIVNQLKR LGASCDWSRE RFTMDEGLSR AVVKVFVELH
RQGLIYKDKR LVNWDPKLLT AISDLEVQQI EVKGHLWHLR YPLEGVPFDP ENPSSYIVVA
TTRPETMLGD TAVAVNPDDD RYVDLIGKHV VLPLVGRRIP IVADEYSDPE KGSGAVKITP
AHDFNDFEVG RRHHLPQINV LDIEGRIAVA DNSAYLDGLP EGARDFAAEM EGVDRFVARK
IIVQRLDDFG FLEKIEPNVH MVPHGDRSGV VIEPFLTDQW YVDAKTMAQP AIAAVRSGAT
NFVPKNWEKT YYEWMENIQP WCISRQLWWG HQIPAWYGPD GRVFVAETEE EAVGNALGYY
VEQEVLTPEQ AHDMAEDPLK REGFITRDED VLDTWFSSAL WPFSTLGWPD ETPELDRYYP
TNVLVTGFDI IFFWVARMMM MGLHFKSDVP FPTVYIHALV RDEKGAKMSK SKGNVIDPLN
LIDQYGADAL RFTLAAMAAQ GRDIKLASSR VEGYRNFATK LWNACRFAEM NGCAAPKEFD
YTAAKETLNR WIAHEAVRAT REVTEAIESY RFNDAAGAAY RFVWNVYCDW YLELAKPVLM
GEESAAKAET RAMVAWARDE ILKLLHPFMP FITEELWAVT ASRDGLLVLA PWSRKSGMSD
EQIAVMSASA ASDPMVGAAL LSIPAAAPDF TDDAAEAEIG WVVDLITAMR SVRAEMNITP
ATLTPLVLVG ASAATRGRAE RWSDVIKRLA RVGEISFADS APQGAVQLLV RGDVAALPLK
GLIDFAAEQA RLEKELGKAE ADIKRAEAKL ANEKFVANAA EEVVEEEKEK REAAVARKMK
ILEALLRIKN AS