Gene Rsph17025_3087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3087 
SymbolvalS 
ID5083174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp3160028 
End bp3162991 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content68% 
IMG OID640484659 
Productvalyl-tRNA synthetase 
Protein accessionYP_001169276 
Protein GI146279117 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.705018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.448197 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATGG ACAAGACCTT CAACGCCGCC GAGGCCGAGG CCCGGCTTTA CGACACGTGG 
GAGAAGGCGG GCGCCTTCCG CGCGGGCGCC AATGCCTCGC GGCCCGAGAC CTTCTGCATC
ATGATCCCGC CGCCGAACGT GACGGGGTCG CTGCACATGG GTCACGCCTT CAACAACACG
CTGCAGGACA TCCTGACGCG CTGGCACCGG ATGCGTGGCT TCGACACGCT CTGGCAACCG
GGGCAGGACC ATGCCGGCAT CGCCACCCAG ATGGTGGTCG AACGGGAACT CGCGAAAGCC
GGCCAGCCGG GCCGCCGCGA GATGGGCCGT GAGGCGTTCC TGTCGAAGGT CTGGGAGTGG
AAGGAGCAGT CGGGCGGCAC GATCGTCAAC CAGCTCAAGC GCCTCGGCGC CTCGTGCGAC
TGGTCGCGCA ACGCCTTCAC GATGGATGCC AACTTCCAGC GCGCCGTGCT GAAGGTGTTC
GTGGACCTCT ACGAAAAGGG CTTCATCTAC CGCGGCAAGC GGCTGGTGAA CTGGGATCCC
CACTTCGAGA CGGCGATTTC GGATCTGGAA GTCGAGCAGG TCGAGGTGAA CGGCAGCATG
TGGCGCCTGC GCTACCCGCT GGCCGATGGC GCGACCTACC GCCACCCGGT CGCCTTCGAC
GAGGAGGGCC GCGCGACCGA GTGGGAGGAG CGCGACTATC TGACCGTCGC CACCACCCGC
CCCGAGACGA TGCTGGGCGA CACCGGCATC GCGGTGAACC CCGCCGACGA ACGCTACGCC
CACCTGATCG GCAAGGAGGT CATCCTGCCG CTGGTCGGCC GCCGCATCCC GATCGTGGCT
GATGAGTATG CCGACCCGAC CAAGGGCACC GGCGCGGTAA AGATCACGCC CGCGCACGAC
TTCAACGACT GGGGCGTGGG CCAGCGCACC GGGCTGCGCG CGATCAACGT GATGAGCGGC
CGCGCGGCCA TGTTCCTCGC CGAGAACGCC GACTTCCTCG AAGGCTGCAC GCCGTCCGAG
GCGGCGCTGG CGCTGGATGG CCTCGACCGT TACGAGGCGC GCAAGCGCAT CGTGGCGCTG
GCCGAGGAGC AGGGCTGGCT CGACGGCATC GACCAGGACA AGCACATGGT CCCGCACGGC
GACCGATCGA AGGTCGCGAT CGAGCCGATG CTGACCGACC AGTGGTTCGT GGACACGGCC
AAGATCGTCC AGCCCGCCAT CGACGCCGTG CGCGACGGCC GGACCGAGAT CCTGCCCGAG
CGCGACGCCA AGACCTACTT CCACTGGCTC GAGAACATCG AGCCCTGGTG CATCTCGCGC
CAGCTCTGGT GGGGCCACCA GATCCCGGTC TGGTATGGCC TCGACATCTG GCCCGCGCGC
TTCGACGACG ACGGCGACGA CACGCTCGAC GAGGTCGAGA TCTTCGAGTT GCTGGAAGAC
GGCGCCTTCA ACCATGCCGA GCCTACGCAT CATTGCGCCT TCGACTTCGA CGCGGTGTCG
GAGAAGTTCC TCGACGATCT CGCCTCGCTG CCGGCGCCGC TGAACCATGC GCGCGTGGTC
GAGGTGGCAA ACCGCGCCGA GGCCGTCGAC CGGCTGGCGC AGGCGCTCGC CGACTACAAC
GTCAGCGAGG ACCCGACCCA TCTCGTCTAT CCGGTCTGGC GCGACCCGGA CGTGCTCGAC
ACCTGGTTCT CCTCCGGCCT CTGGCCGATC GGCACGCTGG GCTGGCCCGA GGATACGGAC
GAGCTTCGCC GCTATTTCCC GACCAACGTG CTGATCACCG GCTTTGACAT CATCTTCTTC
TGGGTCGCCC GGATGATGAT GATGCAGCTT GCCGTGGTGA ACGAGGTCCC CTTCAAGACC
GTCTATGTCC ACGCCCTCGT GCGGGACGAG AAGGGCAAGA AGATGTCGAA GTCGCTCGGC
AACGTGCTTG ACCCGCTGGA ACTGATCGAC GAGTTCGGGG CCGATGCCGT GCGCTTCACC
CTGACGGCGA TGGCCGCCAT GGGCCGCGAC CTCAAGCTGT CCACCGCGCG CATCCAGGGC
TATCGCAACT TCGGCACCAA GCTCTGGAAC GCCTGCCGCT TCGCCGAGAT GAACGGCGTG
TGGGAGGGTC ACGCAACCCA GGCCGCCCCG CCGCAGGCCA CCGCCACCGT GAACCGCTGG
ATCATCGGCG AAACGGGGCG CGTGCGCGAA GACGTGGACG CGGCACTGGC GGCCTATCGG
TTCGACGTGG CGGCCAATGC GCTTTATGCC TTCGTCTGGG GCAAGGTCTG CGACTGGTAT
GTCGAGTTTT CCAAGCCGCT CTTCGACACC GAGGCGGCCG CCGAGACACG CGCCACGATG
GCCTGGGTGC TCGACCAGTG CATGATCCTG CTGCACCCGA TCATGCCCTT CGTGACCGAG
GAGCTGTGGG CCACCACCGG CAGCCGCTCG AAGATGCTGG TCCACACCGA CTGGCCGTCC
TTCGGCGCCG AACTGGTCGA TCCCGCGGCC GATCGCGAGA TGAGCTGGGT GATCTCGCTC
ATCGAAGAGA TCCGCTCGGC CCGGGCCCAG GTGCGCGTGC CGGCGGGGCT CAAGCTGCCG
GTGCTGCAAC TGGCGCTGGA CGAGGCCGGA CGTGCGGCCC TGGCCCGCAA CGAGGCCCTC
ATCCTGCGCC TCGCCCGCCT CGAGGGTTTC ACCGAGGCCG CGACGGCACC CAAAGGGGCC
CTGACCATTG CCGTCGAAGG CGGCAGCTTC GCCATCCCGC TCGAGGGCGT GATCGACATC
GCCGCCGAAC GCACCCGCCT CGCCAGGACG CTCGAAAAGC TCGAGAAGGA TCTCGGCGGC
CTGCGTGGGC GCCTCAACAA TCCCGCCTTC GTCGCCTCCG CGCCGGAAGA GGTGGTGGAC
GAGGCCCGCA CCCGGCTTGA ACAGGGCGAG GAAGAGGCGG CGAAACTGTC CGCGGCCCTC
GCCCGCCTGT CCGAAATCGA CTGA
 
Protein sequence
MPMDKTFNAA EAEARLYDTW EKAGAFRAGA NASRPETFCI MIPPPNVTGS LHMGHAFNNT 
LQDILTRWHR MRGFDTLWQP GQDHAGIATQ MVVERELAKA GQPGRREMGR EAFLSKVWEW
KEQSGGTIVN QLKRLGASCD WSRNAFTMDA NFQRAVLKVF VDLYEKGFIY RGKRLVNWDP
HFETAISDLE VEQVEVNGSM WRLRYPLADG ATYRHPVAFD EEGRATEWEE RDYLTVATTR
PETMLGDTGI AVNPADERYA HLIGKEVILP LVGRRIPIVA DEYADPTKGT GAVKITPAHD
FNDWGVGQRT GLRAINVMSG RAAMFLAENA DFLEGCTPSE AALALDGLDR YEARKRIVAL
AEEQGWLDGI DQDKHMVPHG DRSKVAIEPM LTDQWFVDTA KIVQPAIDAV RDGRTEILPE
RDAKTYFHWL ENIEPWCISR QLWWGHQIPV WYGLDIWPAR FDDDGDDTLD EVEIFELLED
GAFNHAEPTH HCAFDFDAVS EKFLDDLASL PAPLNHARVV EVANRAEAVD RLAQALADYN
VSEDPTHLVY PVWRDPDVLD TWFSSGLWPI GTLGWPEDTD ELRRYFPTNV LITGFDIIFF
WVARMMMMQL AVVNEVPFKT VYVHALVRDE KGKKMSKSLG NVLDPLELID EFGADAVRFT
LTAMAAMGRD LKLSTARIQG YRNFGTKLWN ACRFAEMNGV WEGHATQAAP PQATATVNRW
IIGETGRVRE DVDAALAAYR FDVAANALYA FVWGKVCDWY VEFSKPLFDT EAAAETRATM
AWVLDQCMIL LHPIMPFVTE ELWATTGSRS KMLVHTDWPS FGAELVDPAA DREMSWVISL
IEEIRSARAQ VRVPAGLKLP VLQLALDEAG RAALARNEAL ILRLARLEGF TEAATAPKGA
LTIAVEGGSF AIPLEGVIDI AAERTRLART LEKLEKDLGG LRGRLNNPAF VASAPEEVVD
EARTRLEQGE EEAAKLSAAL ARLSEID