Gene Rleg_1674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1674 
SymbolvalS 
ID8012743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1665401 
End bp1668244 
Gene Length2844 bp 
Protein Length947 aa 
Translation table11 
GC content61% 
IMG OID644824261 
Productvalyl-tRNA synthetase 
Protein accessionYP_002975500 
Protein GI241204404 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACA AGACCTATGA TTCCGCCGCC GTCGAACCGA AAATCGCCGC GAAATGGGAT 
GAGGCGGACG CTTTCCGCGC CGGCGCGAAC GCCAGACCGG GGGCCGAGAC CTTCGCCATC
GTGATCCCGC CGCCGAATGT CACCGGTTCG CTGCACATGG GGCACGCACT GAACAACACG
CTGCAGGACA TCCTGGTGCG CTTCGAGCGC ATGCGCGGCA AGGACGTGCT CTGGCAGCCG
GGCATGGACC ACGCCGGCAT CGCCACGCAG ATGGTCGTCG AGCGCAAGCT GATGGAACAG
CAACTGCCGG GCCGCCGCGA CATGGGCCGC GAAGCCTTCA TCGACAAGAT TTGGGAGTGG
AAGGCGGAAT CGGGCGGCCT GATCTTCAAC CAGCTGAAGC GTCTTGGTGC GTCATGCGAT
TGGTCGCGCG AACGTTTCAC CATGGACGAG GGCCTGTCGA AGGCGGTTAT CGAAGTTTTC
GTCACGCTCT ACAAGGAAGG TCTGATCTAC AAGGACAATC GCCTGGTCAA TTGGGATCCG
AAGCTGCTGA CGGCGATTTC CGATATCGAA GTCGAGCAGC ACGAGGTCAA GGGCAATCTC
TGGCACCTGC GCTATCCATT GGAAAAGGGC GTAACCTACC AATATCCGAT TGCATTTGAT
GAGGAAGGCA AGCCGACCGA ATTCGAGACG CGCGATTATG TGGTCGTCGC AACGACACGA
CCTGAGACCA TGCTGGGCGA TACCGGTGTT GCCGTCAATC CGAAGGACGA ACGCTATCAG
GGCATTGTCG GCAAGCATGT CATTCTGCCG ATCGTCGGTC GCAGAATTCC GATCGTTGCC
GACGATTATG CCGATCCGGC TGCCGGCACC GGCGCGGTGA AGATAACGCC CGCGCACGAT
TTCAACGACT TCGACGTCGG CAAGCGTGCA GGTCTTCGCG TCATCAATAT CATGACCGGC
GACGGCACGA TCACCATCAA GGACAATGAG GACTTTCTCG AAGGTCTCGA CAATCCGGCG
GCGCTGCACG GCGCCTGGGA CCGCCTGGAA GGGCAGGACC GCTTCTATGC GCGCAAGGTG
ATCGTCGAGA TTTTCGAAGA GGCGGGCCTC GTCGACAAGA TCGAGCCGCA CAAGCATATG
GTCCCGCACG GCGATCGCGG TGGCGTGCCG ATCGAGCCGC GGCTGACCGA ACAATGGTAT
GTCGATGCCA AGACGCTCGC CGAGCCGGCG ATCGCCTCGG TCCGCGAGGG CCGCACCAAG
ATGGTGCCGA AGAGCTGGGA CAAGACCTAT TACGAATGGA TGGAAAATAT CCAGCCCTGG
TGCGTCTCCC GCCAGCTCTG GTGGGGGCAT CAGATTCCCG CCTGGTACGG CCCGGACGGC
CAGGTCTTCG TCGAAAAGAC CGAGGAAGAG GCGCTGCAGG CGGCGATCCA GCACTACCTC
TCGCATGAGG GGCCGATGAA GGCCTATGTC GAGGACCTGC TCGAAAACTT CAAGCCGGGC
GAAATCCTGA CGCGTGACGA GGACGTGCTC GACACCTGGT TCTCCTCAGC ACTCTGGCCT
TTCTCGACGC TCGGCTGGCC GGACGAGACG CCGGAGCTGG CGCGTTATTA CCCGACCAAC
GTTCTCGTCA CCGGCTTCGA CATCATCTTC TTCTGGGTCG CGCGCATGAT GATGATGGGC
CTGCACCTTA TGAAGGATGA GGATGGCGAA CCCGTCGAGC CCTTCGAGAC CGTCTATGTC
CACGCGCTGG TGCGCGACAA GAATGGGCAG AAGATGTCGA AATCGAAGGG CAACGTCATC
GATCCCTTGG AACTGATCGA CGAATACGGC GCCGACGCGC TGCGGTTCAC CCTGGCGATC
ATGGCGGCGC AGGGCCGCGA CGTGAAGCTC GATCCGGCCC GCATCGCCGG CTACCGCAAT
TTCGGCACCA AGCTCTGGAA CGCCACACGC TTCGCCGAGA TGAACGGCGC GAAGAGCGAT
CCGCATTTCG TGCCCGAAGC CGCCGAGCTC ACCATCAACC GCTGGATCCT GACGGAACTT
GCCCGTACGG AACGTGACGT TACGGAAGCG CTCGAAGCCT TCCGCTTCAA TGATGCTGCC
GGCGCGCTCT ACCGCTTCGT TTGGAACGAG GTCTGCGACT GGTATCTCGA ACTGTTGAAG
CCGGTCTTCA ATGGTGAGGA CGAGGGCGCC AAGGCCGAAG CCCAGGCCTG CAGCGCTTAT
ATTCTCGAAG AGATCTACAA GCTGCTGCAT CCCTTCATGC CCTTCATGAC CGAAGAGCTT
TGGGCCCATA CGGCAGGCGA AGGCAAAGAG CGCGACACAT TGGTCTGCCA CGCCGAATGG
CCGGCGCCGT CCTACGCCGA TGACGGGGCC GCCGACGAGA TCAACTGGCT GATCGACCTC
GTTTCCGGCA TCCGCTCGGT GCGTGCTGAG ATGAACGTGC CGCCATCGGC GACAGCCCCG
CTCGTCGTCG TCAAGGCCAA CAACCTGACG CGTGAAAGGC TGTTCCGCCA CGACGCCGCC
ATCAAGCGCC TTGCGCGCGT CGAGGCGATA TCGCTGGCTG ACGATGCGCC GAAGGGTGCC
GCTCAGATCG TCATCGCCGA GGCCACCATC TGCCTGCCGC TCGGCAATCT GATCGACCTT
TCCGCCGAAA AGGCTCGTTT GGAAAAGGCG ATTGCCAAGA TGGAGGGCGA GATCTCCCGC
ATAGACGGCA AACTCTCCAA CGAGAAGTTC GTCGCCAACG CCAATCCTGA AGTGGTCGAG
GCCGAGCGCG ATCGTCTCGA GGAACTGAAG GGGCAGATCG CAAGCCTGGG GATCGCTCTT
TCCAGGGTAA GCGAAGCCGG GTAA
 
Protein sequence
MLDKTYDSAA VEPKIAAKWD EADAFRAGAN ARPGAETFAI VIPPPNVTGS LHMGHALNNT 
LQDILVRFER MRGKDVLWQP GMDHAGIATQ MVVERKLMEQ QLPGRRDMGR EAFIDKIWEW
KAESGGLIFN QLKRLGASCD WSRERFTMDE GLSKAVIEVF VTLYKEGLIY KDNRLVNWDP
KLLTAISDIE VEQHEVKGNL WHLRYPLEKG VTYQYPIAFD EEGKPTEFET RDYVVVATTR
PETMLGDTGV AVNPKDERYQ GIVGKHVILP IVGRRIPIVA DDYADPAAGT GAVKITPAHD
FNDFDVGKRA GLRVINIMTG DGTITIKDNE DFLEGLDNPA ALHGAWDRLE GQDRFYARKV
IVEIFEEAGL VDKIEPHKHM VPHGDRGGVP IEPRLTEQWY VDAKTLAEPA IASVREGRTK
MVPKSWDKTY YEWMENIQPW CVSRQLWWGH QIPAWYGPDG QVFVEKTEEE ALQAAIQHYL
SHEGPMKAYV EDLLENFKPG EILTRDEDVL DTWFSSALWP FSTLGWPDET PELARYYPTN
VLVTGFDIIF FWVARMMMMG LHLMKDEDGE PVEPFETVYV HALVRDKNGQ KMSKSKGNVI
DPLELIDEYG ADALRFTLAI MAAQGRDVKL DPARIAGYRN FGTKLWNATR FAEMNGAKSD
PHFVPEAAEL TINRWILTEL ARTERDVTEA LEAFRFNDAA GALYRFVWNE VCDWYLELLK
PVFNGEDEGA KAEAQACSAY ILEEIYKLLH PFMPFMTEEL WAHTAGEGKE RDTLVCHAEW
PAPSYADDGA ADEINWLIDL VSGIRSVRAE MNVPPSATAP LVVVKANNLT RERLFRHDAA
IKRLARVEAI SLADDAPKGA AQIVIAEATI CLPLGNLIDL SAEKARLEKA IAKMEGEISR
IDGKLSNEKF VANANPEVVE AERDRLEELK GQIASLGIAL SRVSEAG