Gene Smed_1054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1054 
SymbolthrS 
ID5321900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1121556 
End bp1123538 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content61% 
IMG OID640789997 
Productthreonyl-tRNA synthetase 
Protein accessionYP_001326742 
Protein GI150396275 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.290687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.56724 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCATT CCGTTTCCCT TACATTCCCC GATGGCTCCG TGCGAGAATT CGCCCCCGGC 
ACGACCGGCC GCGATGTTGC GGAATCGATC TCGAAGTCGC TCGCCAAAAA GTCCGTCGCG
ATCGCCATCG ACGGCGAGCT GCGTGACCTC TCGGATCCCG TGACGGAAGG CAGGATTGAG
ATCGTGACGC GCGAGGACAA GCGCGCGCTA GAGCTCATCC GCCACGACGC GGCCCATGTC
ATGGCGGAAG CCGTCCAGGA GTTGTGGCCG GGAACGCAGG TGACCATTGG CCCGGTCATC
GACAATGGTT TCTATTACGA CTTCGCAAAG AACGAGCCCT TCACGCCCGA CGACCTGCCG
GTCATCGAGA AGAGGATGAG GGAGATCATC GCGCGCAACA AGCCTTTCAC CAAGGAAGTC
TGGTCGCGCG ACAAGGCCAA GGAGGTCTTT GCCGTCAAGG GCGAGAGTTA CAAGGTTGAA
CTCGTCGACG CTATCCCCGA AGGCCAGGAC CTGAAGATTT ATTACCAGGG CGACTGGTTC
GATCTCTGCC GGGGGCCACA TATGGCTTCG ACGGGCCAGA TCGGCACCGC CTTCAAATTG
ATGAAGGTCG CCGGCGCCTA CTGGCGTGGC GACAGCAACA ATCCGATGCT GACGCGCATC
TACGGCACCG CCTGGCACAC GCAGGAGGAG CTCGATCAGT ATCTGCACGT GCTGGCCGAG
GCCGAAAAGC GCGACCATCG CCGGCTCGGC CGCGAGATGG ACCTGTTCCA TTTCCAGGAG
GAAGGACCGG GCGTGGTCTT CTGGCACGGA AAGGGCTGGC GCATCTTCCA GAGCCTCGTC
GCCTATATGC GCCGCCGCCT CGAAGGCGAT TATCAGGAGG TCAACGCTCC GCAGGTGCTC
GACAAGTCTC TCTGGGAGAC CTCCGGCCAC TGGGGATGGT ATCGTGACAA CATGTTCAAG
GTGACTGTCG CCGGGGACGA TACGGATGAC GATCGCGTCT TCGCGCTGAA GCCGATGAAT
TGCCCCGGGC ACATCCAGAT CTTCAAGCAT GGCTTGAAGT CTTACCGGGA ACTACCTGTA
AGGCTGGCAG AATTCGGTGC CGTCCATCGC TACGAGCCAT CAGGTGCGCT GCACGGCCTG
ATGCGCGTGC GCGGCTTCAC CCAGGACGAC GCGCACATCT TCTGCACCGA TGAGCAGATG
GCGGCCGAAT GCCTGAAGAT CAACGATCTT ATCCTCTCCG TCTATGAGGA CTTCGGCTTC
AAGGAGATCG TCGTGAAGCT GTCGACGCGG CCGGAAAAGC GTGTGGGCTC CGATGAACTG
TGGGACCGCG CCGAGGCGGT GATGACGGAG GTCTTGAAGA CGATCGAGGC GCAGTCCGAG
GGCCGCATCA AGACCGGCAT CCTGCCGGGC GAGGGTGCGT TCTACGGTCC GAAATTCGAA
TATACGCTGA AGGACGCAAT CGGCCGCGAA TGGCAGTGCG GAACGACGCA GGTCGATTTC
AATCTGCCGG AGCGCTTCGG CGCCTTCTAC ATCGACAGCG AATCCGAGAA GCGTCAGCCG
GTCATGATCC ATCGCGCCAT TTGCGGGTCG ATGGAGCGTT TCCTCGGCAT CCTGCTCGAG
AATTTTGCGG GCCATATGCC GCTGTGGATC TCGCCTCTGC AGGTGGTGGT CGCAACGATC
ACCTCGGAAG CGGATGATTA TGGCCGCGAA GTGGCCGAAC GCTTGCGGGA CGCCGGCCTG
ACGGTCGAGA CCGATTTCCG AAACGAGAAG ATCAACTACA AGGTCCGCGA ACACTCGGTA
ACAAAGGTTC CGGTGATCGT CGTCTGCGGC AAGCGGGAAG CAGAGGAACG TTCCGTCAAC
ATCCGTCGCC TGGGCTCCCA GGCGCAGACG GCTATGTCGC TCGACGAGGC AGTTGCTTCG
CTCTCCGCCG AAGCCACGGC GCCGGACCTC AAGCGCAAGG CGGAGCGGAC CGCCCGCGCC
TGA
 
Protein sequence
MSHSVSLTFP DGSVREFAPG TTGRDVAESI SKSLAKKSVA IAIDGELRDL SDPVTEGRIE 
IVTREDKRAL ELIRHDAAHV MAEAVQELWP GTQVTIGPVI DNGFYYDFAK NEPFTPDDLP
VIEKRMREII ARNKPFTKEV WSRDKAKEVF AVKGESYKVE LVDAIPEGQD LKIYYQGDWF
DLCRGPHMAS TGQIGTAFKL MKVAGAYWRG DSNNPMLTRI YGTAWHTQEE LDQYLHVLAE
AEKRDHRRLG REMDLFHFQE EGPGVVFWHG KGWRIFQSLV AYMRRRLEGD YQEVNAPQVL
DKSLWETSGH WGWYRDNMFK VTVAGDDTDD DRVFALKPMN CPGHIQIFKH GLKSYRELPV
RLAEFGAVHR YEPSGALHGL MRVRGFTQDD AHIFCTDEQM AAECLKINDL ILSVYEDFGF
KEIVVKLSTR PEKRVGSDEL WDRAEAVMTE VLKTIEAQSE GRIKTGILPG EGAFYGPKFE
YTLKDAIGRE WQCGTTQVDF NLPERFGAFY IDSESEKRQP VMIHRAICGS MERFLGILLE
NFAGHMPLWI SPLQVVVATI TSEADDYGRE VAERLRDAGL TVETDFRNEK INYKVREHSV
TKVPVIVVCG KREAEERSVN IRRLGSQAQT AMSLDEAVAS LSAEATAPDL KRKAERTARA