Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1475 |
Symbol | valS |
ID | 6980205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1502676 |
End bp | 1505519 |
Gene Length | 2844 bp |
Protein Length | 947 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643396196 |
Product | valyl-tRNA synthetase |
Protein accession | YP_002280993 |
Protein GI | 209549076 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.515714 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.107806 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACA AGACCTATGA TTCTGCTGCC GTCGAACCCA AAATCGCCGC GAAATGGGAT GAGGAGGACG CTTTCCGCGC CGGCGCGAAC GCCAAGCCCG GGGCTGAGAC CTTCACCATC GTGATCCCGC CGCCGAATGT CACCGGTTCG CTGCACATGG GCCATGCGCT GAATAACACG CTGCAGGACA TCATGGTGCG CTTCGAGCGC ATGCGCGGCA AGGATGTGCT CTGGCAGCCG GGCATGGACC ATGCCGGCAT CGCCACGCAG ATGGTCGTCG AGCGCAAGCT GATGGAACAG CAGCTGCCCG GCCGCCGCGA AATGGGCCGT GAAGCCTTCA TCGACAAGAT CTGGGAATGG AAGGCTGAAT CGGGTGGCCT GATCTTCAAC CAGCTGAAAC GCCTGGGCGC CTCATGCGAC TGGTCGCGCG AACGCTTCAC CATGGACGAG GGGCTGTCGC AGGCGGTTCT CGAGGTTTTC GTCACGCTCT ACAAGGAAGG CCTGATCTAC AAGGACAAGC GCCTGGTCAA TTGGGACCCC AAACTCCTGA CTGCTATCTC CGACATGGAA GTCGAGCAGC ACGAGATGAA GGGCCATCTC TGGCATCTGC GTTATCCCCT CGAGCCGGGC GTCACCTATC AATATCCGAT CGCCTTCGAT GAGGAGGGTA AGCCAACCGA ATTCGAGACG CGCGACTATA TCGTCGTCGC GACGACGCGG CCGGAGACGA TGCTGGGCGA TACCGGTGTT GCCGTGAACC CCGAAGACGA GCGTTACAAA CCGATCGTCG GCAAACATGT CATTCTGCCG ATCGTCGGCC GCAAGATTCC GATCGTCGCC GACGATTATG CCGACCCGAC GGCCGGCACC GGCGCAGTGA AGATCACGCC TGCGCATGAT TTCAACGACT TCGAGGTCGG CAAGCGCGCG AAGCTGCGGG CCATCAACGT CATGAACGTC GATGCCACGA TTACCATCAA GGAGAACGAG GATTTCCTCG AAGGCCTCGA CAATCCGGCC GCGCTGCATG GCGCCTGGGA TCGTCTGGAA GGGCAGGACC GTTTCTATGC GCGCAAGGTG ATCGTCGAGA TCTTCGAAGA GGCCGGGCTC CTCGACAAGA TCGAACCGCA CAAGAATATG GTGCCGCACG GCGACCGCGG CGGCGTGCCG ATCGAGCCGC GGCTGACCGA ACAATGGTTT GTCGACAACA AGACATTGGG ACAACCCGCG CTGGAATCCG TTCGCGAGGG AAAAACCAGG TTCATTCCCA GGAACTGGGA AAACACCTAT TTCAACTGGC TGGAAAACAT CGAGCCGTGG TGCATTTCCC GCCAGCTCTG GTGGGGACAT CAGATTCCCG CCTGGTATGG CCCGGACGGT CAGGTCTTCG TCGAAAAGAC CGAGGAAGAA GCGCTGCAGG CGGCGATCCA GCACTACCTC TCGCATGAGG GGCCGATGAA GGCCTATGTC GAGGACCTGC TCGAAAACTT CAAGCCGGGC GAGATCCTGA CGCGGGACGA GGACGTGCTC GACACTTGGT TCTCCTCCGC GCTCTGGCCC TTCTCGACGC TCGGCTGGCC CGACCAGACG CCGGAACTCG CGCGTTATTA TCCGACGAGT GTTCTGGTCA CCGGTTTCGA TATCATCCCG TTCTGGGTCG TCCGCATGAT GCAGATGGGC CTGCATTTCA TGAAGGACGA GAACGGCGAT CCCGTCGAAC CCTTCCACAC GATCTACATT CACGCGCTGG TGCGCGACAA GAATGGGCAG AAGATGTCGA AGTCGAAGGG CAACGTCATC GATCCCCTGG AACTGATCGA CGAATACGGC GCCGACGCGC TGCGCTTCAC GCTGGCGATC ATGGCGGCGC AGGGCCGTGA CGTGAAGCTC GATCCGGCCC GCATCGCCGG CTACCGCAAT TTCGGCACGA AGCTCTGGAA CGCCACGCGT TTCGCCGAGA TGAACGGTGC CAAGAGCGAT CCGCATTTCG TGCCGGAGGC AGCCGAACTC ACCATCAACC GCTGGATCCT GACAGAACTT GCCCGTACGG AACGTGACGT TACGGAAGCA CTCGAAGCCT TCCGTTTCAA CGATGCCGCC GGCGCGCTCT ATCGCTTCGT CTGGAACGAG GTCTGCGACT GGTATCTCGA GCTTTTGAAG CCGGTCTTCA ACGGCGAGGA CGAGGGCGCC AAGGCCGAGG CCCAGGCCTG CAGCGCCTAT ATCCTCGAAG AGATCTACAA ACTGCTGCAT CCCTTCATGC CTTTCATGAC CGAAGAGCTC TGGGCGCATA CGGCCGGCGA GGGCAAAGAG CGCGATACGC TGGTCTGCCA CGCCGAATGG CCTTCGCCCT CCTATGCCGA TAACGGCGCC GCGGACGAGA TCAACTGGCT GATCGACCTC GTCTCCGGCA TCCGCTCGGT GCGCGCCGAA ATGAATGTGC CGCCGTCGGC CACCGCTCCG CTCGTCGTCG TCAAGGCCAA CAACCTGACC CGCGAAAGGC TGTTCCGCCA CGACGCCGCC ATCAAGCGGC TTGCGCGTGT CGAGGCGATA TCGCAGGCCG ACGATGCGCC GAAGGGGGCT GCTCAAATCG TCGTCGACGA GGCCACCATC TGCCTTCCGC TCGGCAATCT GATCGACCTT TCCACTGAAA AGGCTCGGCT TGAAAAGGCG ATCGGAAAAA TGGAAGGGGA GATCTCGCGC ATCGACGGCA AGCTCTCCAA CGAGAAGTTC GTCGCCAATG CCAATCCGGA GGTGGTCGAG GCCGAGCGCG AGCGCCTCGA GGAACTGAAG GGGCAGATCG CCAGCTTGAA GACCGCCCTC TCCAGGGTGA GCGAAGCCGG ATAA
|
Protein sequence | MLDKTYDSAA VEPKIAAKWD EEDAFRAGAN AKPGAETFTI VIPPPNVTGS LHMGHALNNT LQDIMVRFER MRGKDVLWQP GMDHAGIATQ MVVERKLMEQ QLPGRREMGR EAFIDKIWEW KAESGGLIFN QLKRLGASCD WSRERFTMDE GLSQAVLEVF VTLYKEGLIY KDKRLVNWDP KLLTAISDME VEQHEMKGHL WHLRYPLEPG VTYQYPIAFD EEGKPTEFET RDYIVVATTR PETMLGDTGV AVNPEDERYK PIVGKHVILP IVGRKIPIVA DDYADPTAGT GAVKITPAHD FNDFEVGKRA KLRAINVMNV DATITIKENE DFLEGLDNPA ALHGAWDRLE GQDRFYARKV IVEIFEEAGL LDKIEPHKNM VPHGDRGGVP IEPRLTEQWF VDNKTLGQPA LESVREGKTR FIPRNWENTY FNWLENIEPW CISRQLWWGH QIPAWYGPDG QVFVEKTEEE ALQAAIQHYL SHEGPMKAYV EDLLENFKPG EILTRDEDVL DTWFSSALWP FSTLGWPDQT PELARYYPTS VLVTGFDIIP FWVVRMMQMG LHFMKDENGD PVEPFHTIYI HALVRDKNGQ KMSKSKGNVI DPLELIDEYG ADALRFTLAI MAAQGRDVKL DPARIAGYRN FGTKLWNATR FAEMNGAKSD PHFVPEAAEL TINRWILTEL ARTERDVTEA LEAFRFNDAA GALYRFVWNE VCDWYLELLK PVFNGEDEGA KAEAQACSAY ILEEIYKLLH PFMPFMTEEL WAHTAGEGKE RDTLVCHAEW PSPSYADNGA ADEINWLIDL VSGIRSVRAE MNVPPSATAP LVVVKANNLT RERLFRHDAA IKRLARVEAI SQADDAPKGA AQIVVDEATI CLPLGNLIDL STEKARLEKA IGKMEGEISR IDGKLSNEKF VANANPEVVE AERERLEELK GQIASLKTAL SRVSEAG
|
| |