Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1674 |
Symbol | valS |
ID | 8012743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 1665401 |
End bp | 1668244 |
Gene Length | 2844 bp |
Protein Length | 947 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644824261 |
Product | valyl-tRNA synthetase |
Protein accession | YP_002975500 |
Protein GI | 241204404 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACA AGACCTATGA TTCCGCCGCC GTCGAACCGA AAATCGCCGC GAAATGGGAT GAGGCGGACG CTTTCCGCGC CGGCGCGAAC GCCAGACCGG GGGCCGAGAC CTTCGCCATC GTGATCCCGC CGCCGAATGT CACCGGTTCG CTGCACATGG GGCACGCACT GAACAACACG CTGCAGGACA TCCTGGTGCG CTTCGAGCGC ATGCGCGGCA AGGACGTGCT CTGGCAGCCG GGCATGGACC ACGCCGGCAT CGCCACGCAG ATGGTCGTCG AGCGCAAGCT GATGGAACAG CAACTGCCGG GCCGCCGCGA CATGGGCCGC GAAGCCTTCA TCGACAAGAT TTGGGAGTGG AAGGCGGAAT CGGGCGGCCT GATCTTCAAC CAGCTGAAGC GTCTTGGTGC GTCATGCGAT TGGTCGCGCG AACGTTTCAC CATGGACGAG GGCCTGTCGA AGGCGGTTAT CGAAGTTTTC GTCACGCTCT ACAAGGAAGG TCTGATCTAC AAGGACAATC GCCTGGTCAA TTGGGATCCG AAGCTGCTGA CGGCGATTTC CGATATCGAA GTCGAGCAGC ACGAGGTCAA GGGCAATCTC TGGCACCTGC GCTATCCATT GGAAAAGGGC GTAACCTACC AATATCCGAT TGCATTTGAT GAGGAAGGCA AGCCGACCGA ATTCGAGACG CGCGATTATG TGGTCGTCGC AACGACACGA CCTGAGACCA TGCTGGGCGA TACCGGTGTT GCCGTCAATC CGAAGGACGA ACGCTATCAG GGCATTGTCG GCAAGCATGT CATTCTGCCG ATCGTCGGTC GCAGAATTCC GATCGTTGCC GACGATTATG CCGATCCGGC TGCCGGCACC GGCGCGGTGA AGATAACGCC CGCGCACGAT TTCAACGACT TCGACGTCGG CAAGCGTGCA GGTCTTCGCG TCATCAATAT CATGACCGGC GACGGCACGA TCACCATCAA GGACAATGAG GACTTTCTCG AAGGTCTCGA CAATCCGGCG GCGCTGCACG GCGCCTGGGA CCGCCTGGAA GGGCAGGACC GCTTCTATGC GCGCAAGGTG ATCGTCGAGA TTTTCGAAGA GGCGGGCCTC GTCGACAAGA TCGAGCCGCA CAAGCATATG GTCCCGCACG GCGATCGCGG TGGCGTGCCG ATCGAGCCGC GGCTGACCGA ACAATGGTAT GTCGATGCCA AGACGCTCGC CGAGCCGGCG ATCGCCTCGG TCCGCGAGGG CCGCACCAAG ATGGTGCCGA AGAGCTGGGA CAAGACCTAT TACGAATGGA TGGAAAATAT CCAGCCCTGG TGCGTCTCCC GCCAGCTCTG GTGGGGGCAT CAGATTCCCG CCTGGTACGG CCCGGACGGC CAGGTCTTCG TCGAAAAGAC CGAGGAAGAG GCGCTGCAGG CGGCGATCCA GCACTACCTC TCGCATGAGG GGCCGATGAA GGCCTATGTC GAGGACCTGC TCGAAAACTT CAAGCCGGGC GAAATCCTGA CGCGTGACGA GGACGTGCTC GACACCTGGT TCTCCTCAGC ACTCTGGCCT TTCTCGACGC TCGGCTGGCC GGACGAGACG CCGGAGCTGG CGCGTTATTA CCCGACCAAC GTTCTCGTCA CCGGCTTCGA CATCATCTTC TTCTGGGTCG CGCGCATGAT GATGATGGGC CTGCACCTTA TGAAGGATGA GGATGGCGAA CCCGTCGAGC CCTTCGAGAC CGTCTATGTC CACGCGCTGG TGCGCGACAA GAATGGGCAG AAGATGTCGA AATCGAAGGG CAACGTCATC GATCCCTTGG AACTGATCGA CGAATACGGC GCCGACGCGC TGCGGTTCAC CCTGGCGATC ATGGCGGCGC AGGGCCGCGA CGTGAAGCTC GATCCGGCCC GCATCGCCGG CTACCGCAAT TTCGGCACCA AGCTCTGGAA CGCCACACGC TTCGCCGAGA TGAACGGCGC GAAGAGCGAT CCGCATTTCG TGCCCGAAGC CGCCGAGCTC ACCATCAACC GCTGGATCCT GACGGAACTT GCCCGTACGG AACGTGACGT TACGGAAGCG CTCGAAGCCT TCCGCTTCAA TGATGCTGCC GGCGCGCTCT ACCGCTTCGT TTGGAACGAG GTCTGCGACT GGTATCTCGA ACTGTTGAAG CCGGTCTTCA ATGGTGAGGA CGAGGGCGCC AAGGCCGAAG CCCAGGCCTG CAGCGCTTAT ATTCTCGAAG AGATCTACAA GCTGCTGCAT CCCTTCATGC CCTTCATGAC CGAAGAGCTT TGGGCCCATA CGGCAGGCGA AGGCAAAGAG CGCGACACAT TGGTCTGCCA CGCCGAATGG CCGGCGCCGT CCTACGCCGA TGACGGGGCC GCCGACGAGA TCAACTGGCT GATCGACCTC GTTTCCGGCA TCCGCTCGGT GCGTGCTGAG ATGAACGTGC CGCCATCGGC GACAGCCCCG CTCGTCGTCG TCAAGGCCAA CAACCTGACG CGTGAAAGGC TGTTCCGCCA CGACGCCGCC ATCAAGCGCC TTGCGCGCGT CGAGGCGATA TCGCTGGCTG ACGATGCGCC GAAGGGTGCC GCTCAGATCG TCATCGCCGA GGCCACCATC TGCCTGCCGC TCGGCAATCT GATCGACCTT TCCGCCGAAA AGGCTCGTTT GGAAAAGGCG ATTGCCAAGA TGGAGGGCGA GATCTCCCGC ATAGACGGCA AACTCTCCAA CGAGAAGTTC GTCGCCAACG CCAATCCTGA AGTGGTCGAG GCCGAGCGCG ATCGTCTCGA GGAACTGAAG GGGCAGATCG CAAGCCTGGG GATCGCTCTT TCCAGGGTAA GCGAAGCCGG GTAA
|
Protein sequence | MLDKTYDSAA VEPKIAAKWD EADAFRAGAN ARPGAETFAI VIPPPNVTGS LHMGHALNNT LQDILVRFER MRGKDVLWQP GMDHAGIATQ MVVERKLMEQ QLPGRRDMGR EAFIDKIWEW KAESGGLIFN QLKRLGASCD WSRERFTMDE GLSKAVIEVF VTLYKEGLIY KDNRLVNWDP KLLTAISDIE VEQHEVKGNL WHLRYPLEKG VTYQYPIAFD EEGKPTEFET RDYVVVATTR PETMLGDTGV AVNPKDERYQ GIVGKHVILP IVGRRIPIVA DDYADPAAGT GAVKITPAHD FNDFDVGKRA GLRVINIMTG DGTITIKDNE DFLEGLDNPA ALHGAWDRLE GQDRFYARKV IVEIFEEAGL VDKIEPHKHM VPHGDRGGVP IEPRLTEQWY VDAKTLAEPA IASVREGRTK MVPKSWDKTY YEWMENIQPW CVSRQLWWGH QIPAWYGPDG QVFVEKTEEE ALQAAIQHYL SHEGPMKAYV EDLLENFKPG EILTRDEDVL DTWFSSALWP FSTLGWPDET PELARYYPTN VLVTGFDIIF FWVARMMMMG LHLMKDEDGE PVEPFETVYV HALVRDKNGQ KMSKSKGNVI DPLELIDEYG ADALRFTLAI MAAQGRDVKL DPARIAGYRN FGTKLWNATR FAEMNGAKSD PHFVPEAAEL TINRWILTEL ARTERDVTEA LEAFRFNDAA GALYRFVWNE VCDWYLELLK PVFNGEDEGA KAEAQACSAY ILEEIYKLLH PFMPFMTEEL WAHTAGEGKE RDTLVCHAEW PAPSYADDGA ADEINWLIDL VSGIRSVRAE MNVPPSATAP LVVVKANNLT RERLFRHDAA IKRLARVEAI SLADDAPKGA AQIVIAEATI CLPLGNLIDL SAEKARLEKA IAKMEGEISR IDGKLSNEKF VANANPEVVE AERDRLEELK GQIASLGIAL SRVSEAG
|
| |