Gene Rleg2_1845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1845 
SymbolthrS 
ID6980583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1894104 
End bp1896110 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content60% 
IMG OID643396567 
Productthreonyl-tRNA synthetase 
Protein accessionYP_002281356 
Protein GI209549439 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.215697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.072473 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAG CCATTTTCCT GACATTTCCC GATGGTTCCG TGCGCAGCTT CCCGGCTGGC 
GCGACCGGCA GGGATGTCGC CGAATCCATT TCCAAGTCGC TCGCCAAGAG CGCCGTCGCC
ATTGCGATCG ATGGCACAGT GCAAGACCTT TCCGATACCG TCGCCGACGG CAAGATCGAG
ATCATCACCC GCAAGGACGG CCGTGCGCTG GAGCTTATCC GGCATGACGC CGCACACGTG
ATGGCTGAAG CCGTGCAGGA ACTTTGGCCC GGCACGCAGG TGACGATCGG CCCGGTCATC
GAAAACGGTT TCTATTACGA CTTCGCCAAG AACGAGCCCT TTACGCCCGA TGATCTGCCG
AAGATCGAAA AGAAGATGAA GGAGATCATT GCTCGCAATG CGCCCTTCAC CAAGCAGATC
TGGTCGCGCG AAAAGGCCAA GGAAGTCTTT GCGGCCAAGG GCGAAAACTA CAAGGTAGAA
CTTGTCGATG CGATCCCGGC AGGGCAGGAT CTGAAGATCT ACAATCAGGG CGATTGGTTC
GACCTTTGCC GCGGTCCGCA CATGGCCTCC ACCGGCCAGG TCGGCACGGC CTTCAAGCTG
ATGAAGGTTG CCGGCGCCTA TTGGCGCGGC GACAGCAACA ACGCCATGCT GTCACGCATC
TACGGCACGG CATGGGCTGA CCAGGCCGAT CTCGACAACT ATCTGCATAT GCTGGCGGAA
GCTGAAAAGC GCGACCACCG CAAGCTCGGC CGCGAAATGG ACCTGTTCCA TTTCCAGGAG
GAAGGCCCGG GTGTGGTCTT CTGGCATGGC AAGGGCTGGC GCATCTTCCA GGCGCTCGTC
TCCTATATGC GTCGCCGGCT CGCCGTCGAC TATGAAGAAG TCAACGCGCC GCAGGTGCTC
GACACCGCGC TCTGGGAAAC CTCGGGGCAT TGGGGCTGGT ATCAGGAAAA CATGTTCGCC
GTGAAATCGG CGCATGCGAT GACGCATCCG GAAGACAAGG AAGCGGATAA CCGCGTCTTT
GCGCTGAAGC CGATGAACTG CCCCGGCCAC GTGCAGATCT TCAAGCATGG GCTGAAGTCC
TATCGCGAAC TGCCGATCCG CCTCGCCGAA TTCGGCCTGG TACATCGCTA CGAGCCTTCG
GGCGCGCTGC ACGGGCTGAT GCGTGTGCGC GGCTTCACGC AGGACGACGC GCACATCTTC
TGCACCGACG AGCAGATGGC AGCCGAATGC CTGAAGATCA ACGATCTCAT CCTGTCGGTC
TATGAAGACT TCGGCTTCAA GGAAATCGTC GTCAAGCTTT CCACCCGGCC GGAAAAGCGT
GTCGGTTCCG ACGCACTCTG GGATCGCGCC GAAGCCGTGA TGACCGACGT GTTGAAAACG
ATCGAGGCGC AGTCCGAGGG CCGCATCAAG ACCGGCATTC TGCCGGGCGA GGGCGCCTTC
TACGGGCCGA AGTTCGAATA CACGCTGAAG GATGCGATCG GCCGTGAATG GCAGTGCGGC
ACGACGCAGG TCGATTTTAA TCTGCCTGAG CGATTCGGCG CCTTCTATAT CGACAGCAAC
TCCGAAAAGA CGCAGCCGGT GATGATCCAT CGCGCCATCT GCGGCTCAAT GGAACGCTTC
CTCGGTATCC TGATCGAAAA CTTCGCCGGC CATCTGCCGC TCTGGGTGTC GCCGCTGCAG
GTGGTGGTCG CGACAATCAC TTCGGAAGCC GACGCTTACG GGCTTGAAGT AGCCGAGGCG
CTGCGCGAGG CCGGTCTCAA CGTCGAGACC GATTTCCGCA ACGAGAAGAT CAACTACAAG
GTCCGCGAAC ACTCGGTCAC CAAGGTGCCC GTCATCATCG TCTGCGGCAG GAAGGAAGCC
GAGGAGCGCA CGGTCAACAT CCGCCGCCTC GGCAGCCAGG ACCAGGTTTC GATGGGGCTC
GACGCCGCCG TCGACAGCCT TGCCTTGGAA GCGACACCGC CAGACGTCCG TCGCAAGGCC
GAAGCAAAGA AAGCCAGGGC GGCCTGA
 
Protein sequence
MSEAIFLTFP DGSVRSFPAG ATGRDVAESI SKSLAKSAVA IAIDGTVQDL SDTVADGKIE 
IITRKDGRAL ELIRHDAAHV MAEAVQELWP GTQVTIGPVI ENGFYYDFAK NEPFTPDDLP
KIEKKMKEII ARNAPFTKQI WSREKAKEVF AAKGENYKVE LVDAIPAGQD LKIYNQGDWF
DLCRGPHMAS TGQVGTAFKL MKVAGAYWRG DSNNAMLSRI YGTAWADQAD LDNYLHMLAE
AEKRDHRKLG REMDLFHFQE EGPGVVFWHG KGWRIFQALV SYMRRRLAVD YEEVNAPQVL
DTALWETSGH WGWYQENMFA VKSAHAMTHP EDKEADNRVF ALKPMNCPGH VQIFKHGLKS
YRELPIRLAE FGLVHRYEPS GALHGLMRVR GFTQDDAHIF CTDEQMAAEC LKINDLILSV
YEDFGFKEIV VKLSTRPEKR VGSDALWDRA EAVMTDVLKT IEAQSEGRIK TGILPGEGAF
YGPKFEYTLK DAIGREWQCG TTQVDFNLPE RFGAFYIDSN SEKTQPVMIH RAICGSMERF
LGILIENFAG HLPLWVSPLQ VVVATITSEA DAYGLEVAEA LREAGLNVET DFRNEKINYK
VREHSVTKVP VIIVCGRKEA EERTVNIRRL GSQDQVSMGL DAAVDSLALE ATPPDVRRKA
EAKKARAA