Gene Msil_3538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3538 
SymbolthrS 
ID7092395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3887215 
End bp3889161 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content62% 
IMG OID643466829 
Productthreonyl-tRNA synthetase 
Protein accessionYP_002363789 
Protein GI217979642 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.211804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGATG TTACCTTTCC CGATGGCGCC ACGCGTCAGT TTCCCGCCGG CGCAAGCGGG 
TTCGACATCG CCAAAGCCAT TTCCCCCTCG CTCGCCAAAC GCACGGTCGC CATGATCGTC
GACGGCTCCC TAAGCGATCT CGCCGATCCC GTCCGCCACG GGGCGCGCGT GGAGTTCATC
GGCCGCGAGG ATAAGCGCGC CCTTGAGCTC ATCCGCCACG ACGCCGCGCA TGTCCTCGCC
GAAGCGGTGC AGACGCTCTA TCCGGGCACG CAAGTGACGA TCGGTCCCGT CATCGAGAAC
GGGTTCTATT ACGATTTTTT CCGCAACGAG CCGTTCACGC CGGACGATTT TGCCGCCGTC
GAAAAAAAAA TGGCCGAAAT CATCGCGCGC GACAGGCCCT TCACGAAGGA GGTCTGGAGC
CGCGATCAGG CGCGCTCCTT CTTCGAGACA AGGGGCGAGG CGTTCAAGGT TGAACTCGTC
GACTCTATTC CGGGCGACGA GGATCTGAAG ATCTACCGGC AGGGCGACTG GCTCGACCTC
TGCCGCGGCC CGCATATGAC CTCAACCGGC AAGATCGGCG ACGCATTCAA GCTGATGAAG
GTCGCCGGCG CCTATTGGCG CGGCGATTCG AACCGCCCCA TGCTGCAACG CATCTATGCG
ACGGCCTTCG CCACGCGCGA GGAGCTCGCG GCCTATCTGA GGCAGATCGA GGAGGCCGAG
AAGCGCGACC ATCGGCGGCT CGGGCGGGAA ATGGACCTGT TCCATTTTCA GGAGGAAGGC
CCCGGCGCGA TCTTCTGGCA CCCCAAGGGC TGGACCCTGT TCCAGACCCT CATCGCCTAT
ATGCGGCGAC GCATGGAGGC TGACGGCTAT CTCGAAGTGA ATACGCCGCA GGTGCTCGAC
CGCGCCCTCT GGGAAACCTC CGGCCACTGG CAGACTTTTC GCGAACACAT GTTCCTTACC
AAGACCGAGG ACGAGCGGAT CTTCGCGCTG AAGCCGATGA ACTGTCCCGG CCATGTGCAG
ATCTTCAAGA ACGGCCTCAA ATCCTATCGC GACCTGCCGC TGAAGATCGG CGAGTTCGGC
GTCGTGCATC GCTATGAGCC GTCCGGCGCG CTGCATGGGG TCATGCGGGT GCGCGCCTTC
ACGCAGGATG ACGCGCATAT CTTCTGCACG GAAGATCAGA TCATGGCTGA GAGCCTCAAG
GTCAATGATC TGATTCTCTC GATCTATCAA GACTTTGGCT TTGACGACGT CATTATCAAG
CTCTCGACGC GCCCCGAAAA ACGCGTCGGC TCGGAAGAGG CCTGGGACAA GGCGGAGGCG
GCCCTCGGCC GGGTCGTCGA CATGCTGGCG GCTGGAGGCG CGAAGACCGG CGTCAATCCC
GGCGAGGGCG CGTTTTACGG TCCAAAGCTC GAATATACGC TACGCGACGC GATCGGCCGC
GAATGGCAAT GCGGGACGAC GCAAGTCGAT TTCAATATGC CCGGCCGCTT CGGGGCCTTC
TATATCGACG CCGACAGCGA CAAAAAGACG CCAGTCATGA TCCATCGCGC GATCTTCGGA
TCGCTCGAAC GATTCACCGG CATTCTGATC GAGCATTTTT CAGGCAATCT GCCTTTGTGG
CTGGCGCCGG TGCAGGTGAT CGTCGCGACG ATCACCCAGG ACGCCGACGA CTATGCGCTC
GAAGTGGCGG CGGCGGCCCG CAAATACGGC CTGCGCGTCG AAGCGGATTT GCGCAACGAA
AAGATCTCCT ACAAGATTAG GGAGCATTCC CTCGCCAAGA CGCCGGTGCT GATCGTCGTC
GGCAAACGGG AGGGGCTGGA GCGCACTGTT TCGATTCGGC GGCTCGGCGC TTCCGATACG
ACAAGTCTGA GCCTCGAGAC GGCGCTGGAG CGCCTCGCGG ACGAAGCAAC GCCGCCAGAT
CTCAGGCGCC TTCGCAGCGC AGCATGA
 
Protein sequence
MIDVTFPDGA TRQFPAGASG FDIAKAISPS LAKRTVAMIV DGSLSDLADP VRHGARVEFI 
GREDKRALEL IRHDAAHVLA EAVQTLYPGT QVTIGPVIEN GFYYDFFRNE PFTPDDFAAV
EKKMAEIIAR DRPFTKEVWS RDQARSFFET RGEAFKVELV DSIPGDEDLK IYRQGDWLDL
CRGPHMTSTG KIGDAFKLMK VAGAYWRGDS NRPMLQRIYA TAFATREELA AYLRQIEEAE
KRDHRRLGRE MDLFHFQEEG PGAIFWHPKG WTLFQTLIAY MRRRMEADGY LEVNTPQVLD
RALWETSGHW QTFREHMFLT KTEDERIFAL KPMNCPGHVQ IFKNGLKSYR DLPLKIGEFG
VVHRYEPSGA LHGVMRVRAF TQDDAHIFCT EDQIMAESLK VNDLILSIYQ DFGFDDVIIK
LSTRPEKRVG SEEAWDKAEA ALGRVVDMLA AGGAKTGVNP GEGAFYGPKL EYTLRDAIGR
EWQCGTTQVD FNMPGRFGAF YIDADSDKKT PVMIHRAIFG SLERFTGILI EHFSGNLPLW
LAPVQVIVAT ITQDADDYAL EVAAAARKYG LRVEADLRNE KISYKIREHS LAKTPVLIVV
GKREGLERTV SIRRLGASDT TSLSLETALE RLADEATPPD LRRLRSAA