Gene Amir_2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_2089 
SymbolthrS 
ID8326278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2310424 
End bp2312472 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content70% 
IMG OID644942639 
Productthreonyl-tRNA synthetase 
Protein accessionYP_003099880 
Protein GI256376220 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCCAGC CAAGTCCCGC ATCCGCACTC GCCCCGCCTC GCGTGGTGGT GACGGCGGGG 
ACCACCGCTG GGACCGCCGT GCGCGAGGCA GGGCTGCCCG GCAAGGGCCC TGACGCGATC
GTGGTCGTCC GCGACGCCGA GGGCCACCTG CGCGACCTGT CCTGGACCCC GCAGGTCGAC
GTGGAGGTCG AGGCGGTCGC CGCCGACACC GAGGACGGCC GCTCGGTCAT CCGGCACTCC
ACCGCGCACG TGCTCGCCCA GGCCGTGCAG CAGCAGTTCC CCGAGGCCAA GCTGGGTATC
GGCCCGCCGG TCAAGGACGG CTTCTACTAC GACTTCCAGG TCGACAGGCC GTTCACCCCG
GAAGACCTCG CCGCGCTGGA GAAGCGCATG AAGGCGATCG TCAAGGGCGC GCAGCGCTTC
ACCCGCCGCG TGGTCGAGTC GACCGACGCC GCGAAGGCCG AGCTGGCCTC CGAGCCGTTC
AAGCTGGAGC TGGTCGACGT CAAGGGCGGC GTGGACACCG CCGAGGTCAT GGAGGTGGGC
GGCGGCGAGC TGACCATCTA CGACAACCTC GACCCGCGCT CCGGCGAACG CGTGTGGGGC
GACCTGTGCC GCGGCCCGCA CCTGCCCACC ACCAAGCACA TCCCGGCGTT CAAGCTCACC
AGGGTCGCCG CCGCCTACTG GCGCGGCAAC GAGAAGAACC CGCAGCTCCA GCGCATCTAC
GGCACCGCCT GGGAGTCGCA GGAGGCGCTG GACAAGCACG TCGAGCTGAT CGCCGAGGCC
GAGCGCCGCG ACCACCGCAA GCTCGGCGTC GAGCTGGACC TGTTCAGCTT CCCCGACGAG
ATCGGCTCCG GCCTCGCGGT CTTCCACCCG CGCGGCGGCA TCATCCGCAA GGCCATGGAG
GACTACTCGC GGGCCCGGCA CGAGGCCGAG GGCTACGAGT TCGTCTACTC GCCGCACATC
ACCAAGGGCA ACCTGTTCGA GACCTCCGGG CACCTCGACT GGTACCGCGA CGGCATGTAC
CCGGCGATGC ACCTGGACGC CGAGCTCAAC GAGGACGGCA CGATCCGCCG CCCCGGCCAG
GACTACTACC TCAAGCCGAT GAACTGCCCG TTCCACGACC TGATCTTCCG GTCGCGCGGG
CGCTCCTACC GCGAGCTGCC GCTGCGCATG TTCGAGTTCG GCTCGGTCTA CCGCTACGAG
AAGTCCGGCG TGATCCACGG CCTGACCCGC GTGCGCGGCA TGACGCAGGA CGACGCGCAC
ATCTTCTGCA CCCTGGACCA GGTGCAGGAG GAGCTGAAGT CGCTCCTGGC GTTCGTGCTC
GGCCTGCTGC GCGACTACGG CCTCGACGAC TTCTACCTGG AGCTGTCGAC CCGCAACGAC
GAGAAGTACG TCGGCAGCGA CGAGCTGTGG GAGACGGCCA CCGAGACGCT GCGCGTCGCC
GCCGAGGACT CCGGCCTCGA ACTCGTGCCC GACCCCGGCG GCGCGGCGTT CTACGGCCCG
AAGATCTCCG TGCAGGCCAA GGACGCGCTC GGCCGCACCT GGCAGATGTC CACCATCCAG
CTGGACTTCA ACCTGCCCGA GCGCTTCGAG CTGGAGTACA CCGGCCCGGA CGGCTCCCGC
CAGCGCCCGG TGATGATCCA CCGCGCCCTG TTCGGCTCGA TCGAGCGGTT CTTCGGCGTG
CTGACCGAGC ACTACGCGGG CGCGTTCCCG GCGTGGCTGG CCCCGGTGCA GGTCGTGGGC
ATCCCGATCG CCGACGAGCA CGCCGACCAC CTGTTCGCGG TGGCCAAGGA GCTCAAGAAG
CACGGCGTGC GGGTCGAGAT CGACGCCTCC GACGACCGGA TGCAGAAGAA GATCCGCAAC
CACACCACGC AGAAGGTGCC GTTCATGCTG CTCGCGGGCG GCAAGGACGT CGAGTCCGGC
GCGGTGTCGT TCCGGTTCCG CGACGGCACC CAGATCAACG GCGTCCCGGT CGAGCAGGCC
GTCGCCACGG TCGTCGGCTG GATCTCCCGC CGCGAGAACG CCTCCCCCAC GGCGGAACTC
GTCAAGTGA
 
Protein sequence
MSQPSPASAL APPRVVVTAG TTAGTAVREA GLPGKGPDAI VVVRDAEGHL RDLSWTPQVD 
VEVEAVAADT EDGRSVIRHS TAHVLAQAVQ QQFPEAKLGI GPPVKDGFYY DFQVDRPFTP
EDLAALEKRM KAIVKGAQRF TRRVVESTDA AKAELASEPF KLELVDVKGG VDTAEVMEVG
GGELTIYDNL DPRSGERVWG DLCRGPHLPT TKHIPAFKLT RVAAAYWRGN EKNPQLQRIY
GTAWESQEAL DKHVELIAEA ERRDHRKLGV ELDLFSFPDE IGSGLAVFHP RGGIIRKAME
DYSRARHEAE GYEFVYSPHI TKGNLFETSG HLDWYRDGMY PAMHLDAELN EDGTIRRPGQ
DYYLKPMNCP FHDLIFRSRG RSYRELPLRM FEFGSVYRYE KSGVIHGLTR VRGMTQDDAH
IFCTLDQVQE ELKSLLAFVL GLLRDYGLDD FYLELSTRND EKYVGSDELW ETATETLRVA
AEDSGLELVP DPGGAAFYGP KISVQAKDAL GRTWQMSTIQ LDFNLPERFE LEYTGPDGSR
QRPVMIHRAL FGSIERFFGV LTEHYAGAFP AWLAPVQVVG IPIADEHADH LFAVAKELKK
HGVRVEIDAS DDRMQKKIRN HTTQKVPFML LAGGKDVESG AVSFRFRDGT QINGVPVEQA
VATVVGWISR RENASPTAEL VK