Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0699 |
Symbol | valS |
ID | 4895341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 705356 |
End bp | 708319 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640111283 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001042584 |
Protein GI | 126461470 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGATGG ACAAGACCTT CAACGCCGCC GAGGCCGAGG CCCGGCTCTA CGACGCCTGG GAGAAGGCAG GCGCCTTCCG CGCCGGGGCC AATGCCTCGC GTCCCGAGAC CTTCTGCATC ATGATCCCGC CGCCGAACGT GACGGGCTCG CTCCACATGG GGCATGCCTT CAACAACACG TTGCAGGATA TCCTGACCCG CTGGCACCGG ATGCGCGGCT TCGACACGCT CTGGCAGCCG GGGCAGGACC ACGCCGGGAT CGCCACGCAG ATGGTGGTGG AGCGCGAACT GGCCAGGGCC GGGAACCCGG GCCGCCGCGA GATGGGCCGC GAGGCCTTCC TCGAGAAGGT CTGGGAGTGG AAGGAACAGT CGGGCGGCAC CATCGTCAAC CAGCTGAAGC GGCTCGGCGC CTCCTGCGAC TGGTCGCGCA ACGCCTTCAC CATGGACCCG AATTTCCAGC GCGCGGTGCT GAAGGTCTTC GTGGATCTCT ATGAGAAGGG CTTCATCTAC CGCGGCAAGC GGCTGGTGAA CTGGGACCCC CATTTCGAGA CCGCGATCTC GGACCTCGAG GTGGAGCAGG TCGAGGTCAA CGGCAACATG TGGCGCCTGC GCTACCAGCT GGCCGATGGC GCGACCTACC GGCATCCGGT GGCCTTCGAC GAGGAGGGCC GCCCGACCGA GTGGGAAGAG CGCGACTATC TGACCGTCGC CACCACCCGC CCCGAGACCA TGCTGGGCGA CACCGGTATC GCGGTGAACC CCTCGGACGA ACGCTATGCC CATCTGATCG GGAAAGAGGT GGTCCTGCCG CTGGTCGGCC GCCGCATCCC GATCGTGGCC GACGACTATG CCGATCCCTC GAAGGGCACC GGCGCCGTGA AGATCACGCC CGCGCACGAT TTCAACGACT GGGGGGTGGG TCAGCGCACC GGCCTCCGCG CGATCAACGT CATGTCCGGG CGTGCGACGA TGTTCCTCAT CGAGAACCCG GACTTCACCG AGGGCTGTGC GCCCTCGGAA GAGGCGCTGG CCCTCGACGG GCTCGACCGC TACGAGGCCC GCAAGCGCGT CGTGGCGCTG GCCGAGGAGC AGGGCTGGCT CGACGGGATC GATCAGGACA GGCACATGGT GCCGCACGGC GACCGCTCGA AGGTCGCCAT CGAGCCGATG CTGACCGACC AGTGGTTCGT GGATACGGCC CAGATCGTCC AGCCCGCCAT CGACGCGGTG CGAACCGGCC GGACCGAGAT CCTGCCCGAG CGCGACGCCA AGACCTATTT CCACTGGCTC GAGAACATCG AGCCTTGGTG CATCTCGCGC CAGCTCTGGT GGGGCCACCA GATCCCGGTC TGGTACGGGC TCGACATCTG GCCTGCCCGC TTCGAGGATG ACGGCGACGA CACGCTCGAC GAAGTCGAGA TCTTCGAGCT GCTCGAGGAC GGCGCGTTCA ACCACGCCGA TCCCACGCAT CACTGCGCCT TCGACTTCGA GGGCGTGTCC GAGAAGTTCC TCGACGATCT CGCCTCGCTG CCCCATCCGC TGAACAATGC GCGCGTAGTC GAGGTGGCGA GCCGCGCCGA GGCCATCGAC CGGCTGGCGC AGGCGCTGGC CGACTACAAC CTGAACGAGG ATCCGACCCA TCTGGTCTAC CCCGTCTGGC GCGATCCGGA CGTGCTCGAC ACCTGGTTCT CGTCGGGACT CTGGCCCATC GGCACGCTGG GCTGGCCCGA GGAGACGCCC GAGCTCGCCC GCTACTTCCC GACGAACGTG CTCATCACCG GCTTCGACAT CATCTTCTTC TGGGTCGCCC GGATGATGAT GATGCAGCTC GCCGTGGTGA ACGAGGTGCC CTTCAAGACC GTCTATGTCC ATGCGCTCGT GCGGGACGAG AAGGGCAAGA AGATGTCGAA GTCGCTCGGC AACGTGCTCG ACCCGCTGGA GCTGATCGAC GAATTCGGCG CGGACGCCGT GCGCTTCACC CTGACGGCCA TGGCCGCGAT GGGACGCGAC CTCAAGCTCT CGACGGCGCG CATCCAGGGC TACCGCAACT TCGGCACCAA GCTCTGGAAC GCCTGCCGCT TCGCCGAGAT GAACGGCGTC TGGGAGGGCC ACGGCACGCA GGCCGCCCCT CCCGCCGCGA CCGCCACGGT CAACCGCTGG ATCATCGGCG AGACGGGACG CGTGCGCGAG GAAGTGGATG CGGCGCTGGC GGCCTACCGG TTCGATTCGG CGGCCAACGC CCTCTATGCC TTCGTCTGGG GCAAGGTCTG CGACTGGTAT GTGGAATTCT CGAAGCCGCT CTTCGATACG GAGGCGGCCG CCGAGACCCG CGCCACCATG GGCTGGGTGC TCGACCAGTG CATGGTCCTG CTCCACCCGA TCATGCCCTT CATCACCGAA GACCTCTGGG CCACCACGGG CAGCCGGACC AAGATGCTGG TTCACACCGA CTGGCCGTCC TTCGGGGCCG AACTGGTCGA TCCCGCGGCC GACCGCGAGA TGAGCTGGGT GATCTCGCTC ATCGAGGAGA TCCGCTCGGC CCGCGCTCAG GTCCATGTGC CGGCGGGGCT GAAACTGCCG GTCGTGCAGC TCGCGCTGGA TGCGGCCGGG CGCGAGGCGC TGGCGCGGAA CGAGGCGCTC ATCCTGCGTC TGGCGCGGCT CGAGGGCTTC ACCGAGGCGG CGAGCGCGCC GAAGGGGGCG CTCACCATCG CGGTCGAGGG CGGCAGCTTC GCCATCCCGC TCGAAGGCGT GATCGACATC GGCGCCGAGA AGGCGCGCCT CGCCAAGACC CTCGAGAAGC TCGAGAAGGA CATGGCGGGC CTCCGCGGGC GGCTCGGCAA CCCGAACTTC GTGGCCTCGG CCCCGGAAGA GGTGGTGGAC GAGGCGCGCA CCCGGCTCGA ACAGGGGGAA GAAGAAGGGG CCAAGCTCTC GGCCGCCCTC GCCCGGCTGT CCGAGATTGC CTGA
|
Protein sequence | MPMDKTFNAA EAEARLYDAW EKAGAFRAGA NASRPETFCI MIPPPNVTGS LHMGHAFNNT LQDILTRWHR MRGFDTLWQP GQDHAGIATQ MVVERELARA GNPGRREMGR EAFLEKVWEW KEQSGGTIVN QLKRLGASCD WSRNAFTMDP NFQRAVLKVF VDLYEKGFIY RGKRLVNWDP HFETAISDLE VEQVEVNGNM WRLRYQLADG ATYRHPVAFD EEGRPTEWEE RDYLTVATTR PETMLGDTGI AVNPSDERYA HLIGKEVVLP LVGRRIPIVA DDYADPSKGT GAVKITPAHD FNDWGVGQRT GLRAINVMSG RATMFLIENP DFTEGCAPSE EALALDGLDR YEARKRVVAL AEEQGWLDGI DQDRHMVPHG DRSKVAIEPM LTDQWFVDTA QIVQPAIDAV RTGRTEILPE RDAKTYFHWL ENIEPWCISR QLWWGHQIPV WYGLDIWPAR FEDDGDDTLD EVEIFELLED GAFNHADPTH HCAFDFEGVS EKFLDDLASL PHPLNNARVV EVASRAEAID RLAQALADYN LNEDPTHLVY PVWRDPDVLD TWFSSGLWPI GTLGWPEETP ELARYFPTNV LITGFDIIFF WVARMMMMQL AVVNEVPFKT VYVHALVRDE KGKKMSKSLG NVLDPLELID EFGADAVRFT LTAMAAMGRD LKLSTARIQG YRNFGTKLWN ACRFAEMNGV WEGHGTQAAP PAATATVNRW IIGETGRVRE EVDAALAAYR FDSAANALYA FVWGKVCDWY VEFSKPLFDT EAAAETRATM GWVLDQCMVL LHPIMPFITE DLWATTGSRT KMLVHTDWPS FGAELVDPAA DREMSWVISL IEEIRSARAQ VHVPAGLKLP VVQLALDAAG REALARNEAL ILRLARLEGF TEAASAPKGA LTIAVEGGSF AIPLEGVIDI GAEKARLAKT LEKLEKDMAG LRGRLGNPNF VASAPEEVVD EARTRLEQGE EEGAKLSAAL ARLSEIA
|
| |