Gene Rleg_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2044 
SymbolthrS 
ID8013075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2037176 
End bp2039161 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content61% 
IMG OID644824630 
Productthreonyl-tRNA synthetase 
Protein accessionYP_002975861 
Protein GI241204765 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.565288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAAG CTATTTCCCT GACATTTCCC GATGGTTCCG TGCGCGGCTA TGATGCCGGC 
GCAACCGGCC GGGATGTCGC TGAATCCATT TCCAAGTCGC TGGCCAAGAA GGCGGTCGCC
GTTGCGATCG ACGGCACCGT GCGCGACCTT TCCGATCCCG TGACGACGGG CAGGATCGAG
ATCATCACCC GCAATGACGA CCGCGCGCTC GAACTCATCC GCCATGACGC GGCGCATGTC
ATGGCCGAAG CGGTGCAGGA GATCTGGCCC GGCACCCAGG TGACGATCGG CCCGGTCATC
GAAAACGGCT TCTATTACGA CTTTGCCAAG AACGAGCCCT TCACACTCGA CGATCTGCCG
AAGATCGAAA AGAAGATGAA GGAGATCATC GCCCGCAACG CCGCCTTCAC CAAGCAGGTC
TGGTCGCGCG AGAAGGCGAA ACAGGTCTTC GCCGACAAGG GCGAGCAGTA CAAGGTCGAA
CTCGTCGATG CGATCCCGGA AGGGCAAGAT CTCAAGATCT ATTACCAGGG CGACTGGTTC
GACCTCTGCC GCGGCCCGCA CATGGCCTCG ACCGGCCAGA TCGGCAGCGC CTTCAAGCTC
TTGAAGGTGG CTGGCGCCTA TTGGCGCGGC GACAGCAACA ATCCGATGCT GAGCCGCATC
TACGGCACGG CCTTCGCCGA GCAGTCGGAA CTCGACAACT ACCTGCATAT GCTTGCCGAA
GCCGAAAAGC GCGATCATCG CCGGCTCGGC CGCGAGATGG ATCTCTTCCA TTTCCAGGAA
GAGGGTCCCG GCGTCGTCTT CTGGCACGGC AAGGGCTGGC GCGTCTTTCA GACGCTGGTT
GCCTATATGC GCCGCCGGCT GGCCGGCGAC TATCAGGAAG TCAATGCGCC ACAGGTGCTC
GACAAGTCGC TGTGGGAGAC CTCCGGCCAC TGGGGCTGGT ATCGCGACAA CATGTTCAAG
GTAACGGTTG CCGGCGACGA CACCGACGAT GATCGCGTCT TCGCGCTGAA GCCGATGAAC
TGCCCCGGCC ATATCCAGAT CTTCAAGCAC GGGCTGAAAT CCTACCGCGA ACTGCCGATC
CGGCTGGCCG AATTCGGCAA TGTCCACCGT TACGAGCCGT CGGGCGCGCT GCACGGGCTG
ATGCGCGTGC GCGGCTTCAC GCAGGACGAT GCGCATATCT TCTGCACGGA CGAGCAGATG
GCCGCCGAAT GCCTGAAGAT CAACGACCTG ATTCTTTCGG TCTATAAGGA TTTCGGCTTC
GACGAAGTCA CCATCAAGCT CTCGACAAGG CCGGACAAGC GCGTCGGCTC GGACGATCTC
TGGGACCGCG CCGAAAGCGT GATGATGGGC GTGCTGGAGA CGATCCAGCA GCAGTCGAAC
AACATCAAGA CCGGCATCCT GCCGGGCGAG GGCGCCTTCT ACGGTCCGAA GTTCGAATAT
ACGCTGAAGG ATGCGATCGG CCGAGAATGG CAGTGCGGCA CGACGCAGGT CGACTTCAAC
CTGCCGGAAC GCTTCGGCGC CTTCTACATC GACAGCAACT CCGAAAAGAC GCAGCCGGTG
ATGATCCACC GCGCCATCTG CGGCTCGATG GAGCGCTTCC TCGGCATCCT GATCGAGAAC
TTTGCCGGCC ATATGCCACT CTGGGTATCG CCGCTGCAGG TGGTGGTCGC AACGATCACC
TCGGAAGCCG ACGCCTACGG GCTTGAAGTG GCCGAGGCGC TGCGCGAGGC CGGTCTCAAC
GTTGAAACCG ATTTCCGCAA CGAGAAGATC AACTACAAGG TCCGCGAGCA TTCGGTCACG
AAGGTTCCTG TCATCATCGT CTGCGGCAGG AAGGAAGCCG AGGAGCGCAC GGTCAACATC
CGCCGCCTCG GCAGCCAGGA CCAGGTTTCG ATGGGGCTCG ACGCCGCCGT CGAGAGCCTT
GCCCTCGAAG CGACACCGCC TGACATCCGT CGGAAGGCCG AAGCAAAAAA AGCCAAGGCG
GCCTGA
 
Protein sequence
MSQAISLTFP DGSVRGYDAG ATGRDVAESI SKSLAKKAVA VAIDGTVRDL SDPVTTGRIE 
IITRNDDRAL ELIRHDAAHV MAEAVQEIWP GTQVTIGPVI ENGFYYDFAK NEPFTLDDLP
KIEKKMKEII ARNAAFTKQV WSREKAKQVF ADKGEQYKVE LVDAIPEGQD LKIYYQGDWF
DLCRGPHMAS TGQIGSAFKL LKVAGAYWRG DSNNPMLSRI YGTAFAEQSE LDNYLHMLAE
AEKRDHRRLG REMDLFHFQE EGPGVVFWHG KGWRVFQTLV AYMRRRLAGD YQEVNAPQVL
DKSLWETSGH WGWYRDNMFK VTVAGDDTDD DRVFALKPMN CPGHIQIFKH GLKSYRELPI
RLAEFGNVHR YEPSGALHGL MRVRGFTQDD AHIFCTDEQM AAECLKINDL ILSVYKDFGF
DEVTIKLSTR PDKRVGSDDL WDRAESVMMG VLETIQQQSN NIKTGILPGE GAFYGPKFEY
TLKDAIGREW QCGTTQVDFN LPERFGAFYI DSNSEKTQPV MIHRAICGSM ERFLGILIEN
FAGHMPLWVS PLQVVVATIT SEADAYGLEV AEALREAGLN VETDFRNEKI NYKVREHSVT
KVPVIIVCGR KEAEERTVNI RRLGSQDQVS MGLDAAVESL ALEATPPDIR RKAEAKKAKA
A