Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0204 |
Symbol | thrS |
ID | 3916192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 209688 |
End bp | 211682 |
Gene Length | 1995 bp |
Protein Length | 664 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640442929 |
Product | threonyl-tRNA synthetase |
Protein accession | YP_495486 |
Protein GI | 87198229 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0441] Threonyl-tRNA synthetase |
TIGRFAM ID | [TIGR00418] threonyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGAGT TGCTGAAGAT CACCCTGCCC GACGGTTCCG TGCGCGAGGT CGCGCCGGGC AGCACCCCGG CAGACATTGC CGCCGCGATC GGCCCGGGCC TTGCGAAGGC TGCACTGGCG GCGAAAGTCG ATGGCGAACT GGTGGACCTG ACGCGGCCAT TCACGGCGGA CGCGCAACTG GCGCTGGTCA CGGCGAAGGA CGAGGCCGAA GCGCTCGACC TTGCACGGCA CGATTATGCG CACGTCCTCG CCGAAGCGGT GCAGGCGCTG TTTCCGGGCA CGCAGATCAC CTTCGGGCCG AGCACGGACG ACGGCTTCTA CTACGACTTC GCGCCGAAGG ACCGGCCCTT CACCGACGAG GATCTGCCCG CCATCGAGGC GGAAATGCGC AAGATCATCG CCGCGAACAA GCCACTGCGC CGCGAGGTCT GGAGCCGCGA GCAACTGATC AGCCGCTGGA AGCAGCAGGG CGAGAGCTTC AAGGCCGAAT GGGCGGCGGA ACTGCCTGAG AACGAGGAAT TGACCGTCTA CTGGTCGGGC GACGACTGGC TCGACATGTG CCGCGGCCCG CACCTGCCCT CGACCGGCAA GCTCGATCCC AATGCGTTCA AGCTGACTCG CGTTTCGGGG GCCTACTGGC GCGGCGACCA GAAGAACGCG ATGCTCAGCC GAGTCTACGG TACCGGCTGG CTCAACAAGA AGCAGCTCGA CGCGCACCTG ACGCGGCTGG AGGAAGCCGC CAAGCGCGAC CATCGCAAGC TGGGCAACGA GATGGACCTG TTCCATCTCC AGCAGGAAGC GCACGGTTCG GTGTTCTGGC ACCCGAAGGG CTATCTGATC TGGCGCGAGC TGGAAGCCTA CATGCGCCGC GCTATCGACG GCGCGGGCTA TCGCGAGGTC AAGACCCCGC AGGTCATGGA CGCGCGCCAG TGGGAGCAAT CGGGCCACTG GGGCAAGTAC CGCGAGAACA TGTTCGTCAT TCCCGACGAA GTGCCCAACG TCGACGATGA AGGGCCGATC GTTTCGAACG ATGCGGACTG GATGGCGCTG AAGCCGATGA ACTGCCCGGC GCACGTCCTG ATCTTCCGCC AGGGCATCAA GTCCTACCGC GAACTGCCGC TGCGCCTGTA CGAGAACGGC TGCTGCCACC GCAACGAGCC GCACGGCGCG CTGCACGGGT TGATGCGGGT GCGCCAGTTC ACGCAGGACG ACGCGCACAT CTTCTGCCGC GAAGACCAGA TCGTTTCGGA AGTGCAGGCC TTCTGCGAGC TGGCCGACCG CATCTACAAG CACTTCGGTT TCACCTACTC GATCAAGCTC GCGCTGCGCC CGGAAAAGCG CTTCGGCACC GAGGAGATGT GGGACAAGGC CGAGCGCGAA CTGCGCGACG CGGTGGTGCG CGCAGGCCTT GCCACCGAGG AATACGGCTG GGAGGAACTG CCGGGCGAAG GCGCGTTCTA CGCGCCCAAG CTGGAATGGC ACCTGACCGA CGCTATCGGC CGTACCTGGC AGGTCGGCAC GATCCAGTCG GACCGCGTCC TGCCCGAACG CCTCGACGCA AGCTACATCG GCGAGGATGG CGAGAAGCAC CGCCCGGTCA TGCTGCACCG CGCGATCTTC GGTTCCTACG AGCGCTTCAT CGGCATCCTG ATCGAGCACT TCGCCGGTCG CCTGCCGGTG TGGCTCGCGC CGGTCCAGGC AGTGGTCGCC ACGATCGTTT CGGACGCCGA CGACTATGCC AGGGACGCGC TGGCCAAGCT GAAGGCGGCG GGCATCCGCG CCGATACCGA CCTGCGCAAC GAGAAGATCA ACTACAAGGT GCGCGAACAC TCGCTGCAAA AGGTTCCGTA CCTGCTGGTG GTGGGCAAGC GCGAGGCCGA GGAAGGCACC GTGGCGATCC GCATCCTGGG CGAGCAGCAC CAGAAGGTGA TGCCGCTCGA CGAGGCGATT GCCCTGCTCA AGGGTGAGGC CACGGCGCCG GATCTCAGGG CCTGA
|
Protein sequence | MSELLKITLP DGSVREVAPG STPADIAAAI GPGLAKAALA AKVDGELVDL TRPFTADAQL ALVTAKDEAE ALDLARHDYA HVLAEAVQAL FPGTQITFGP STDDGFYYDF APKDRPFTDE DLPAIEAEMR KIIAANKPLR REVWSREQLI SRWKQQGESF KAEWAAELPE NEELTVYWSG DDWLDMCRGP HLPSTGKLDP NAFKLTRVSG AYWRGDQKNA MLSRVYGTGW LNKKQLDAHL TRLEEAAKRD HRKLGNEMDL FHLQQEAHGS VFWHPKGYLI WRELEAYMRR AIDGAGYREV KTPQVMDARQ WEQSGHWGKY RENMFVIPDE VPNVDDEGPI VSNDADWMAL KPMNCPAHVL IFRQGIKSYR ELPLRLYENG CCHRNEPHGA LHGLMRVRQF TQDDAHIFCR EDQIVSEVQA FCELADRIYK HFGFTYSIKL ALRPEKRFGT EEMWDKAERE LRDAVVRAGL ATEEYGWEEL PGEGAFYAPK LEWHLTDAIG RTWQVGTIQS DRVLPERLDA SYIGEDGEKH RPVMLHRAIF GSYERFIGIL IEHFAGRLPV WLAPVQAVVA TIVSDADDYA RDALAKLKAA GIRADTDLRN EKINYKVREH SLQKVPYLLV VGKREAEEGT VAIRILGEQH QKVMPLDEAI ALLKGEATAP DLRA
|
| |