Gene RSP_1989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_1989 
SymbolvalS 
ID3719322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp583987 
End bp586950 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content67% 
IMG OID640070152 
Productvalyl-tRNA synthetase 
Protein accessionYP_352040 
Protein GI77462536 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGATGG ACAAGACCTT CAACGCCGCC GAGGCCGAGG CCCGGCTCTA CGACGCCTGG 
GAGAAGGCAG GCGCCTTCCG CGCCGGGGCC AATGCCTCGC GTCCCGAGAC CTTCTGCATC
ATGATCCCGC CGCCGAACGT GACGGGCTCG CTCCACATGG GGCATGCGTT CAACAACACG
TTGCAGGACA TCCTGACCCG CTGGCACCGG ATGCGCGGCT TCGACACGCT CTGGCAGCCG
GGGCAGGACC ATGCCGGGAT CGCCACGCAG ATGGTGGTGG AGCGGGAGCT TGCGAAGTCC
GGCCAGCCGG GCCGCCGCGA GATGGGCCGC GAGGCCTTCC TCGAGAAGGT CTGGGAGTGG
AAGGAGCAGT CGGGCGGCAC CATCGTCAAC CAGCTGAAGC GGCTCGGCGC CTCCTGCGAC
TGGTCGCGCA ATGCCTTCAC CATGGACCCG AATTTCCAGC GCGCAGTGCT GAAGGTCTTC
GTGGACCTCT ATGAGAAGGG CTTCATCTAC CGCGGCAAGC GGCTGGTGAA CTGGGACCCC
CATTTCGAGA CCGCGATCTC GGACCTCGAG GTGGAGCAGG TAGAGGTCAA CGGCAACATG
TGGCGCCTGC GCTACCAGCT GGCCGATGGC GCGACCTACC GGCATCCGGT GGCCTTCGAC
GAGGAGGGCC GCCCGACAGA GTGGGAAGAG CGCGACTACC TGACCGTCGC CACCACCCGC
CCCGAGACCA TGCTGGGCGA CACCGGCATC GCGGTGAATC CGGCGGACGA GCGCTATGCC
CACCTGATCG GGAAAGAGGT GGTCCTGCCG CTGGTCGGCC GCCGCATCCC GATCGTGGCC
GACGACTATG CCGATCCCTC GAAGGGCACC GGCGCCGTGA AGATCACGCC CGCGCACGAT
TTCAACGACT GGGGGGTGGG TCAGCGCACC GGCCTCCGCG CCATCAACGT CATGTCCGGG
CGCGCGACGA TGTTCCTCAT CGAGAACCCG GACTTCACCG AGGGCTGTGC GCCCTCGGAA
GAGGCGCTGG CCCTCGACGG GCTCGACCGC TACGAGGCCC GCAAGCGCGT CGTGGCGCTG
GCCGAGGAGC AGGGCTGGCT CGACGGGATC GATCAGGACA GGCACATGGT GCCGCACGGC
GATCGCTCGA AGGTCGCCAT CGAGCCGATG CTGACCGATC AGTGGTTCGT GGATACGGCC
CAGATCGTCC AGCCCGCCAT CGATGCGGTG CGGACCGGCC GGACCGAGAT CCTGCCCGAG
CGCGACGCCA AAACCTATTT CCACTGGCTC GAGAACATTG AGCCCTGGTG CATCTCGCGC
CAGCTCTGGT GGGGTCACCA GATCCCGGTC TGGTACGGGC TCGACATCTG GCCTGCCCGC
TTCGAGGATG ACGGCGACGA CACGCTCGAC GAGGTCGAGA TCTTCGAGCT GCTCGAGGAC
GGCGCGTTCA ACCATGCCGA TCCCACGCAT CACTGCGCCT TCGACTTCGA GGGGGTGTCC
GAGAAGTTCC TCGACGATCT AGCCTCGCTG CCCCATCCGC TGAACAATGC GCGCGTGGTC
GAGGTGGCGA GCCGCGCCGA GGCCATCGAC CGGCTGGCAC AGGCGCTGGC CGACTACAAC
CTCAACGAGG ATCCGACCCA TCTGGTCTAC CCCGTCTGGC GCGATCCGGA CGTGCTCGAC
ACCTGGTTCT CGTCGGGGCT CTGGCCCATC GGCACGCTGG GCTGGCCCGA GGAGACGCCC
GAGCTCGCCC GCTACTTCCC GACGAACGTG CTTATCACCG GCTTCGACAT CATCTTCTTC
TGGGTCGCCC GGATGATGAT GATGCAGCTT GCCGTGGTGA ACGAGGTGCC CTTCAAGACC
GTCTATGTCC ATGCGCTCGT GCGGGACGAG AAGGGCAAGA AGATGTCGAA GTCGCTCGGC
AATGTGCTCG ACCCGCTGGA GCTGATCGAC GAATTCGGCG CGGATGCCGT GCGCTTCACC
CTGACGGCCA TGGCCGCGAT GGGACGCGAC CTGAAGCTTT CGACGGCGCG CATCCAGGGC
TACCGCAACT TCGGCACCAA GCTCTGGAAC GCTTGCCGCT TCGCCGAGAT GAACGGCGTC
TGGGAAGGCC ACGCCACGCA GGCCGCCCCG CCCGCCGCGA CCGCCACGGT CAACCGCTGG
ATCATCGGCG AGACGGGACG CGTGCGCGAG GAAGTGGATG CGGCGCTGGC GGCCTACCGG
TTCGATTCGG CGGCCAACGC CCTCTATGCC TTCGTCTGGG GCAAGGTCTG CGACTGGTAT
GTGGAATTCT CGAAGCCGCT CTTCGATACG GAGGCGGCCG CCGAGACCCG CGCCACCATG
GGCTGGGTGC TCGACCAGTG CATGGTCCTG CTCCACCCGA TCATGCCCTT CATCACGGAA
GACCTCTGGG CCACCACGGG CAGCCGGACC AAGATGCTGG TTCACTCCGA CTGGCCGTCC
TTCGGGGCCG AACTGGTCGA TCCCGCGGCG GATCGCGAGA TGAGCTGGGT GATCTCGCTC
ATCGAGGAGA TCCGCTCGGC CCGCGCTCAG GTCCATGTGC CGGCGGGGCT GAAACTACCG
GTCGTGCAGC TCGCGCTGGA TGCGGCCGGG CGCGAGGCGC TGGCGCGGAA CGAGGCGCTC
ATCCTGCGTC TGGCGCGGCT CGAGGGCTTC ACCGAGGCGG CGAGCGCGCC GAAGGGGGCG
CTCACCATCG CGGTCGAGGG CGGCAGCTTC GCCATTCCGC TCGAGGGCGT GATCGACATC
GGCGCCGAGA AGGCGCGCCT CGCCAAGACC CTCGAGAAGC TCGAGAAGGA CATGGCGGGC
CTCCGCGGGC GGCTCGGCAA CCCGAACTTC GTGGCCTCGG CCCCGGAAGA AGTGGTGGAC
GAGGCGCGCA CCCGGCTCGA ACAGGGGGAA GAGGAAGGGG CCAAGCTCTC GGCCGCCCTC
GCCCGGCTGT CCGAGATCGC CTGA
 
Protein sequence
MPMDKTFNAA EAEARLYDAW EKAGAFRAGA NASRPETFCI MIPPPNVTGS LHMGHAFNNT 
LQDILTRWHR MRGFDTLWQP GQDHAGIATQ MVVERELAKS GQPGRREMGR EAFLEKVWEW
KEQSGGTIVN QLKRLGASCD WSRNAFTMDP NFQRAVLKVF VDLYEKGFIY RGKRLVNWDP
HFETAISDLE VEQVEVNGNM WRLRYQLADG ATYRHPVAFD EEGRPTEWEE RDYLTVATTR
PETMLGDTGI AVNPADERYA HLIGKEVVLP LVGRRIPIVA DDYADPSKGT GAVKITPAHD
FNDWGVGQRT GLRAINVMSG RATMFLIENP DFTEGCAPSE EALALDGLDR YEARKRVVAL
AEEQGWLDGI DQDRHMVPHG DRSKVAIEPM LTDQWFVDTA QIVQPAIDAV RTGRTEILPE
RDAKTYFHWL ENIEPWCISR QLWWGHQIPV WYGLDIWPAR FEDDGDDTLD EVEIFELLED
GAFNHADPTH HCAFDFEGVS EKFLDDLASL PHPLNNARVV EVASRAEAID RLAQALADYN
LNEDPTHLVY PVWRDPDVLD TWFSSGLWPI GTLGWPEETP ELARYFPTNV LITGFDIIFF
WVARMMMMQL AVVNEVPFKT VYVHALVRDE KGKKMSKSLG NVLDPLELID EFGADAVRFT
LTAMAAMGRD LKLSTARIQG YRNFGTKLWN ACRFAEMNGV WEGHATQAAP PAATATVNRW
IIGETGRVRE EVDAALAAYR FDSAANALYA FVWGKVCDWY VEFSKPLFDT EAAAETRATM
GWVLDQCMVL LHPIMPFITE DLWATTGSRT KMLVHSDWPS FGAELVDPAA DREMSWVISL
IEEIRSARAQ VHVPAGLKLP VVQLALDAAG REALARNEAL ILRLARLEGF TEAASAPKGA
LTIAVEGGSF AIPLEGVIDI GAEKARLAKT LEKLEKDMAG LRGRLGNPNF VASAPEEVVD
EARTRLEQGE EEGAKLSAAL ARLSEIA