Gene Rsph17029_0699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0699 
SymbolvalS 
ID4895341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp705356 
End bp708319 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content68% 
IMG OID640111283 
Productvalyl-tRNA synthetase 
Protein accessionYP_001042584 
Protein GI126461470 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATGG ACAAGACCTT CAACGCCGCC GAGGCCGAGG CCCGGCTCTA CGACGCCTGG 
GAGAAGGCAG GCGCCTTCCG CGCCGGGGCC AATGCCTCGC GTCCCGAGAC CTTCTGCATC
ATGATCCCGC CGCCGAACGT GACGGGCTCG CTCCACATGG GGCATGCCTT CAACAACACG
TTGCAGGATA TCCTGACCCG CTGGCACCGG ATGCGCGGCT TCGACACGCT CTGGCAGCCG
GGGCAGGACC ACGCCGGGAT CGCCACGCAG ATGGTGGTGG AGCGCGAACT GGCCAGGGCC
GGGAACCCGG GCCGCCGCGA GATGGGCCGC GAGGCCTTCC TCGAGAAGGT CTGGGAGTGG
AAGGAACAGT CGGGCGGCAC CATCGTCAAC CAGCTGAAGC GGCTCGGCGC CTCCTGCGAC
TGGTCGCGCA ACGCCTTCAC CATGGACCCG AATTTCCAGC GCGCGGTGCT GAAGGTCTTC
GTGGATCTCT ATGAGAAGGG CTTCATCTAC CGCGGCAAGC GGCTGGTGAA CTGGGACCCC
CATTTCGAGA CCGCGATCTC GGACCTCGAG GTGGAGCAGG TCGAGGTCAA CGGCAACATG
TGGCGCCTGC GCTACCAGCT GGCCGATGGC GCGACCTACC GGCATCCGGT GGCCTTCGAC
GAGGAGGGCC GCCCGACCGA GTGGGAAGAG CGCGACTATC TGACCGTCGC CACCACCCGC
CCCGAGACCA TGCTGGGCGA CACCGGTATC GCGGTGAACC CCTCGGACGA ACGCTATGCC
CATCTGATCG GGAAAGAGGT GGTCCTGCCG CTGGTCGGCC GCCGCATCCC GATCGTGGCC
GACGACTATG CCGATCCCTC GAAGGGCACC GGCGCCGTGA AGATCACGCC CGCGCACGAT
TTCAACGACT GGGGGGTGGG TCAGCGCACC GGCCTCCGCG CGATCAACGT CATGTCCGGG
CGTGCGACGA TGTTCCTCAT CGAGAACCCG GACTTCACCG AGGGCTGTGC GCCCTCGGAA
GAGGCGCTGG CCCTCGACGG GCTCGACCGC TACGAGGCCC GCAAGCGCGT CGTGGCGCTG
GCCGAGGAGC AGGGCTGGCT CGACGGGATC GATCAGGACA GGCACATGGT GCCGCACGGC
GACCGCTCGA AGGTCGCCAT CGAGCCGATG CTGACCGACC AGTGGTTCGT GGATACGGCC
CAGATCGTCC AGCCCGCCAT CGACGCGGTG CGAACCGGCC GGACCGAGAT CCTGCCCGAG
CGCGACGCCA AGACCTATTT CCACTGGCTC GAGAACATCG AGCCTTGGTG CATCTCGCGC
CAGCTCTGGT GGGGCCACCA GATCCCGGTC TGGTACGGGC TCGACATCTG GCCTGCCCGC
TTCGAGGATG ACGGCGACGA CACGCTCGAC GAAGTCGAGA TCTTCGAGCT GCTCGAGGAC
GGCGCGTTCA ACCACGCCGA TCCCACGCAT CACTGCGCCT TCGACTTCGA GGGCGTGTCC
GAGAAGTTCC TCGACGATCT CGCCTCGCTG CCCCATCCGC TGAACAATGC GCGCGTAGTC
GAGGTGGCGA GCCGCGCCGA GGCCATCGAC CGGCTGGCGC AGGCGCTGGC CGACTACAAC
CTGAACGAGG ATCCGACCCA TCTGGTCTAC CCCGTCTGGC GCGATCCGGA CGTGCTCGAC
ACCTGGTTCT CGTCGGGACT CTGGCCCATC GGCACGCTGG GCTGGCCCGA GGAGACGCCC
GAGCTCGCCC GCTACTTCCC GACGAACGTG CTCATCACCG GCTTCGACAT CATCTTCTTC
TGGGTCGCCC GGATGATGAT GATGCAGCTC GCCGTGGTGA ACGAGGTGCC CTTCAAGACC
GTCTATGTCC ATGCGCTCGT GCGGGACGAG AAGGGCAAGA AGATGTCGAA GTCGCTCGGC
AACGTGCTCG ACCCGCTGGA GCTGATCGAC GAATTCGGCG CGGACGCCGT GCGCTTCACC
CTGACGGCCA TGGCCGCGAT GGGACGCGAC CTCAAGCTCT CGACGGCGCG CATCCAGGGC
TACCGCAACT TCGGCACCAA GCTCTGGAAC GCCTGCCGCT TCGCCGAGAT GAACGGCGTC
TGGGAGGGCC ACGGCACGCA GGCCGCCCCT CCCGCCGCGA CCGCCACGGT CAACCGCTGG
ATCATCGGCG AGACGGGACG CGTGCGCGAG GAAGTGGATG CGGCGCTGGC GGCCTACCGG
TTCGATTCGG CGGCCAACGC CCTCTATGCC TTCGTCTGGG GCAAGGTCTG CGACTGGTAT
GTGGAATTCT CGAAGCCGCT CTTCGATACG GAGGCGGCCG CCGAGACCCG CGCCACCATG
GGCTGGGTGC TCGACCAGTG CATGGTCCTG CTCCACCCGA TCATGCCCTT CATCACCGAA
GACCTCTGGG CCACCACGGG CAGCCGGACC AAGATGCTGG TTCACACCGA CTGGCCGTCC
TTCGGGGCCG AACTGGTCGA TCCCGCGGCC GACCGCGAGA TGAGCTGGGT GATCTCGCTC
ATCGAGGAGA TCCGCTCGGC CCGCGCTCAG GTCCATGTGC CGGCGGGGCT GAAACTGCCG
GTCGTGCAGC TCGCGCTGGA TGCGGCCGGG CGCGAGGCGC TGGCGCGGAA CGAGGCGCTC
ATCCTGCGTC TGGCGCGGCT CGAGGGCTTC ACCGAGGCGG CGAGCGCGCC GAAGGGGGCG
CTCACCATCG CGGTCGAGGG CGGCAGCTTC GCCATCCCGC TCGAAGGCGT GATCGACATC
GGCGCCGAGA AGGCGCGCCT CGCCAAGACC CTCGAGAAGC TCGAGAAGGA CATGGCGGGC
CTCCGCGGGC GGCTCGGCAA CCCGAACTTC GTGGCCTCGG CCCCGGAAGA GGTGGTGGAC
GAGGCGCGCA CCCGGCTCGA ACAGGGGGAA GAAGAAGGGG CCAAGCTCTC GGCCGCCCTC
GCCCGGCTGT CCGAGATTGC CTGA
 
Protein sequence
MPMDKTFNAA EAEARLYDAW EKAGAFRAGA NASRPETFCI MIPPPNVTGS LHMGHAFNNT 
LQDILTRWHR MRGFDTLWQP GQDHAGIATQ MVVERELARA GNPGRREMGR EAFLEKVWEW
KEQSGGTIVN QLKRLGASCD WSRNAFTMDP NFQRAVLKVF VDLYEKGFIY RGKRLVNWDP
HFETAISDLE VEQVEVNGNM WRLRYQLADG ATYRHPVAFD EEGRPTEWEE RDYLTVATTR
PETMLGDTGI AVNPSDERYA HLIGKEVVLP LVGRRIPIVA DDYADPSKGT GAVKITPAHD
FNDWGVGQRT GLRAINVMSG RATMFLIENP DFTEGCAPSE EALALDGLDR YEARKRVVAL
AEEQGWLDGI DQDRHMVPHG DRSKVAIEPM LTDQWFVDTA QIVQPAIDAV RTGRTEILPE
RDAKTYFHWL ENIEPWCISR QLWWGHQIPV WYGLDIWPAR FEDDGDDTLD EVEIFELLED
GAFNHADPTH HCAFDFEGVS EKFLDDLASL PHPLNNARVV EVASRAEAID RLAQALADYN
LNEDPTHLVY PVWRDPDVLD TWFSSGLWPI GTLGWPEETP ELARYFPTNV LITGFDIIFF
WVARMMMMQL AVVNEVPFKT VYVHALVRDE KGKKMSKSLG NVLDPLELID EFGADAVRFT
LTAMAAMGRD LKLSTARIQG YRNFGTKLWN ACRFAEMNGV WEGHGTQAAP PAATATVNRW
IIGETGRVRE EVDAALAAYR FDSAANALYA FVWGKVCDWY VEFSKPLFDT EAAAETRATM
GWVLDQCMVL LHPIMPFITE DLWATTGSRT KMLVHTDWPS FGAELVDPAA DREMSWVISL
IEEIRSARAQ VHVPAGLKLP VVQLALDAAG REALARNEAL ILRLARLEGF TEAASAPKGA
LTIAVEGGSF AIPLEGVIDI GAEKARLAKT LEKLEKDMAG LRGRLGNPNF VASAPEEVVD
EARTRLEQGE EEGAKLSAAL ARLSEIA