Gene Rleg2_1475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1475 
SymbolvalS 
ID6980205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1502676 
End bp1505519 
Gene Length2844 bp 
Protein Length947 aa 
Translation table11 
GC content61% 
IMG OID643396196 
Productvalyl-tRNA synthetase 
Protein accessionYP_002280993 
Protein GI209549076 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.515714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.107806 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACA AGACCTATGA TTCTGCTGCC GTCGAACCCA AAATCGCCGC GAAATGGGAT 
GAGGAGGACG CTTTCCGCGC CGGCGCGAAC GCCAAGCCCG GGGCTGAGAC CTTCACCATC
GTGATCCCGC CGCCGAATGT CACCGGTTCG CTGCACATGG GCCATGCGCT GAATAACACG
CTGCAGGACA TCATGGTGCG CTTCGAGCGC ATGCGCGGCA AGGATGTGCT CTGGCAGCCG
GGCATGGACC ATGCCGGCAT CGCCACGCAG ATGGTCGTCG AGCGCAAGCT GATGGAACAG
CAGCTGCCCG GCCGCCGCGA AATGGGCCGT GAAGCCTTCA TCGACAAGAT CTGGGAATGG
AAGGCTGAAT CGGGTGGCCT GATCTTCAAC CAGCTGAAAC GCCTGGGCGC CTCATGCGAC
TGGTCGCGCG AACGCTTCAC CATGGACGAG GGGCTGTCGC AGGCGGTTCT CGAGGTTTTC
GTCACGCTCT ACAAGGAAGG CCTGATCTAC AAGGACAAGC GCCTGGTCAA TTGGGACCCC
AAACTCCTGA CTGCTATCTC CGACATGGAA GTCGAGCAGC ACGAGATGAA GGGCCATCTC
TGGCATCTGC GTTATCCCCT CGAGCCGGGC GTCACCTATC AATATCCGAT CGCCTTCGAT
GAGGAGGGTA AGCCAACCGA ATTCGAGACG CGCGACTATA TCGTCGTCGC GACGACGCGG
CCGGAGACGA TGCTGGGCGA TACCGGTGTT GCCGTGAACC CCGAAGACGA GCGTTACAAA
CCGATCGTCG GCAAACATGT CATTCTGCCG ATCGTCGGCC GCAAGATTCC GATCGTCGCC
GACGATTATG CCGACCCGAC GGCCGGCACC GGCGCAGTGA AGATCACGCC TGCGCATGAT
TTCAACGACT TCGAGGTCGG CAAGCGCGCG AAGCTGCGGG CCATCAACGT CATGAACGTC
GATGCCACGA TTACCATCAA GGAGAACGAG GATTTCCTCG AAGGCCTCGA CAATCCGGCC
GCGCTGCATG GCGCCTGGGA TCGTCTGGAA GGGCAGGACC GTTTCTATGC GCGCAAGGTG
ATCGTCGAGA TCTTCGAAGA GGCCGGGCTC CTCGACAAGA TCGAACCGCA CAAGAATATG
GTGCCGCACG GCGACCGCGG CGGCGTGCCG ATCGAGCCGC GGCTGACCGA ACAATGGTTT
GTCGACAACA AGACATTGGG ACAACCCGCG CTGGAATCCG TTCGCGAGGG AAAAACCAGG
TTCATTCCCA GGAACTGGGA AAACACCTAT TTCAACTGGC TGGAAAACAT CGAGCCGTGG
TGCATTTCCC GCCAGCTCTG GTGGGGACAT CAGATTCCCG CCTGGTATGG CCCGGACGGT
CAGGTCTTCG TCGAAAAGAC CGAGGAAGAA GCGCTGCAGG CGGCGATCCA GCACTACCTC
TCGCATGAGG GGCCGATGAA GGCCTATGTC GAGGACCTGC TCGAAAACTT CAAGCCGGGC
GAGATCCTGA CGCGGGACGA GGACGTGCTC GACACTTGGT TCTCCTCCGC GCTCTGGCCC
TTCTCGACGC TCGGCTGGCC CGACCAGACG CCGGAACTCG CGCGTTATTA TCCGACGAGT
GTTCTGGTCA CCGGTTTCGA TATCATCCCG TTCTGGGTCG TCCGCATGAT GCAGATGGGC
CTGCATTTCA TGAAGGACGA GAACGGCGAT CCCGTCGAAC CCTTCCACAC GATCTACATT
CACGCGCTGG TGCGCGACAA GAATGGGCAG AAGATGTCGA AGTCGAAGGG CAACGTCATC
GATCCCCTGG AACTGATCGA CGAATACGGC GCCGACGCGC TGCGCTTCAC GCTGGCGATC
ATGGCGGCGC AGGGCCGTGA CGTGAAGCTC GATCCGGCCC GCATCGCCGG CTACCGCAAT
TTCGGCACGA AGCTCTGGAA CGCCACGCGT TTCGCCGAGA TGAACGGTGC CAAGAGCGAT
CCGCATTTCG TGCCGGAGGC AGCCGAACTC ACCATCAACC GCTGGATCCT GACAGAACTT
GCCCGTACGG AACGTGACGT TACGGAAGCA CTCGAAGCCT TCCGTTTCAA CGATGCCGCC
GGCGCGCTCT ATCGCTTCGT CTGGAACGAG GTCTGCGACT GGTATCTCGA GCTTTTGAAG
CCGGTCTTCA ACGGCGAGGA CGAGGGCGCC AAGGCCGAGG CCCAGGCCTG CAGCGCCTAT
ATCCTCGAAG AGATCTACAA ACTGCTGCAT CCCTTCATGC CTTTCATGAC CGAAGAGCTC
TGGGCGCATA CGGCCGGCGA GGGCAAAGAG CGCGATACGC TGGTCTGCCA CGCCGAATGG
CCTTCGCCCT CCTATGCCGA TAACGGCGCC GCGGACGAGA TCAACTGGCT GATCGACCTC
GTCTCCGGCA TCCGCTCGGT GCGCGCCGAA ATGAATGTGC CGCCGTCGGC CACCGCTCCG
CTCGTCGTCG TCAAGGCCAA CAACCTGACC CGCGAAAGGC TGTTCCGCCA CGACGCCGCC
ATCAAGCGGC TTGCGCGTGT CGAGGCGATA TCGCAGGCCG ACGATGCGCC GAAGGGGGCT
GCTCAAATCG TCGTCGACGA GGCCACCATC TGCCTTCCGC TCGGCAATCT GATCGACCTT
TCCACTGAAA AGGCTCGGCT TGAAAAGGCG ATCGGAAAAA TGGAAGGGGA GATCTCGCGC
ATCGACGGCA AGCTCTCCAA CGAGAAGTTC GTCGCCAATG CCAATCCGGA GGTGGTCGAG
GCCGAGCGCG AGCGCCTCGA GGAACTGAAG GGGCAGATCG CCAGCTTGAA GACCGCCCTC
TCCAGGGTGA GCGAAGCCGG ATAA
 
Protein sequence
MLDKTYDSAA VEPKIAAKWD EEDAFRAGAN AKPGAETFTI VIPPPNVTGS LHMGHALNNT 
LQDIMVRFER MRGKDVLWQP GMDHAGIATQ MVVERKLMEQ QLPGRREMGR EAFIDKIWEW
KAESGGLIFN QLKRLGASCD WSRERFTMDE GLSQAVLEVF VTLYKEGLIY KDKRLVNWDP
KLLTAISDME VEQHEMKGHL WHLRYPLEPG VTYQYPIAFD EEGKPTEFET RDYIVVATTR
PETMLGDTGV AVNPEDERYK PIVGKHVILP IVGRKIPIVA DDYADPTAGT GAVKITPAHD
FNDFEVGKRA KLRAINVMNV DATITIKENE DFLEGLDNPA ALHGAWDRLE GQDRFYARKV
IVEIFEEAGL LDKIEPHKNM VPHGDRGGVP IEPRLTEQWF VDNKTLGQPA LESVREGKTR
FIPRNWENTY FNWLENIEPW CISRQLWWGH QIPAWYGPDG QVFVEKTEEE ALQAAIQHYL
SHEGPMKAYV EDLLENFKPG EILTRDEDVL DTWFSSALWP FSTLGWPDQT PELARYYPTS
VLVTGFDIIP FWVVRMMQMG LHFMKDENGD PVEPFHTIYI HALVRDKNGQ KMSKSKGNVI
DPLELIDEYG ADALRFTLAI MAAQGRDVKL DPARIAGYRN FGTKLWNATR FAEMNGAKSD
PHFVPEAAEL TINRWILTEL ARTERDVTEA LEAFRFNDAA GALYRFVWNE VCDWYLELLK
PVFNGEDEGA KAEAQACSAY ILEEIYKLLH PFMPFMTEEL WAHTAGEGKE RDTLVCHAEW
PSPSYADNGA ADEINWLIDL VSGIRSVRAE MNVPPSATAP LVVVKANNLT RERLFRHDAA
IKRLARVEAI SQADDAPKGA AQIVVDEATI CLPLGNLIDL STEKARLEKA IGKMEGEISR
IDGKLSNEKF VANANPEVVE AERERLEELK GQIASLKTAL SRVSEAG