Gene Rleg_0038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0038 
Symbol 
ID8011285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp34214 
End bp35278 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content62% 
IMG OID644822628 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_002973888 
Protein GI241202792 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0701529 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.176582 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAT TCAAGAAACT CGTATTCTCC GGCGTGCAGC CGACCGGCAA TCTGCATCTC 
GGCAATTATC TCGGCGCGAT CCGCCGGTTC GTGGCGCTGC AGGAAGGCAA TGACTGCATC
TACTGCGTCG TCGACATGCA TGCGCTCACC GCCCAGCTCG TGCATGAGGA CATGCCGAGC
CAGACGCGCT CGATCGCCGC CGCCTTCATC GCTGCCGGCA TCGATCCGGA AAAGCATATC
GTCTTCAATC AGTCGGCCGT GCCGCAGCAT GCCGAACTCG CCTGGATCTT CAACTGCGTC
GCCCGCATCG GCTGGATGAA CCGGATGACG CAGTTCAAGG ACAAGGCCGG CAAGGACCGC
GAGCAGGCCT CGCTCGGGCT CTACGCCTAT CCGAGCCTGA TGGCCGCCGA CATTCTCGTC
TATCGCGCCA CCCATGTGCC TGTTGGTGAG GACCAGAAGC AGCATCTGGA GCTTGCCCGC
GACATCGCGA TGAAGTTCAA CCTCGACTAT GCCGAGCATA TCAGCAGGAC CGGTTACGGC
GTCGACATCA CCGTCGGCAA CGAGCCGGTG CATGCCTATT TCCCGATGGT CGAGCCGTTG
ATCGGCGGGC CGGCGCCGCG CGTCATGTCG CTGCGCGACG GCACCAAGAA AATGTCGAAG
TCGGACCCTT CCGATCTCTC GCGCATCAAC CTGATGGACG ACGAGGACGC TATCTCGAAG
AAGATCCGCA AGGCCAAGAC CGATCCTGAC GGCTTGCCGA GCGAGATCGA CGGGCTGCAG
GGCCGTCCGG AAGCCGACAA TCTGGTGGCG ATCTATGCCG CACTCGCCGA CAAGTCGAAG
GCGGACGTGC TTGCCGAATT CGGCGGCCAG CAATTCTCCG TCTTCAAGCC GGCGCTGGTC
GACCTGGCGA TCAACGTGCT CGCACCGATC ACCGGCGAAA TGCGCCGGCT GATGGATGAT
ACCAGCCATA TCGACGCGAT CCTGCGCAAG GGCGGCGAGC GCGCAAGGGC GCGCGCAGAG
GTGACGATGC GCCAAGTGCG CGACGTCATC GGCTTCCTGT ATTGA
 
Protein sequence
MSEFKKLVFS GVQPTGNLHL GNYLGAIRRF VALQEGNDCI YCVVDMHALT AQLVHEDMPS 
QTRSIAAAFI AAGIDPEKHI VFNQSAVPQH AELAWIFNCV ARIGWMNRMT QFKDKAGKDR
EQASLGLYAY PSLMAADILV YRATHVPVGE DQKQHLELAR DIAMKFNLDY AEHISRTGYG
VDITVGNEPV HAYFPMVEPL IGGPAPRVMS LRDGTKKMSK SDPSDLSRIN LMDDEDAISK
KIRKAKTDPD GLPSEIDGLQ GRPEADNLVA IYAALADKSK ADVLAEFGGQ QFSVFKPALV
DLAINVLAPI TGEMRRLMDD TSHIDAILRK GGERARARAE VTMRQVRDVI GFLY