Gene Saro_0204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0204 
SymbolthrS 
ID3916192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp209688 
End bp211682 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content66% 
IMG OID640442929 
Productthreonyl-tRNA synthetase 
Protein accessionYP_495486 
Protein GI87198229 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGAGT TGCTGAAGAT CACCCTGCCC GACGGTTCCG TGCGCGAGGT CGCGCCGGGC 
AGCACCCCGG CAGACATTGC CGCCGCGATC GGCCCGGGCC TTGCGAAGGC TGCACTGGCG
GCGAAAGTCG ATGGCGAACT GGTGGACCTG ACGCGGCCAT TCACGGCGGA CGCGCAACTG
GCGCTGGTCA CGGCGAAGGA CGAGGCCGAA GCGCTCGACC TTGCACGGCA CGATTATGCG
CACGTCCTCG CCGAAGCGGT GCAGGCGCTG TTTCCGGGCA CGCAGATCAC CTTCGGGCCG
AGCACGGACG ACGGCTTCTA CTACGACTTC GCGCCGAAGG ACCGGCCCTT CACCGACGAG
GATCTGCCCG CCATCGAGGC GGAAATGCGC AAGATCATCG CCGCGAACAA GCCACTGCGC
CGCGAGGTCT GGAGCCGCGA GCAACTGATC AGCCGCTGGA AGCAGCAGGG CGAGAGCTTC
AAGGCCGAAT GGGCGGCGGA ACTGCCTGAG AACGAGGAAT TGACCGTCTA CTGGTCGGGC
GACGACTGGC TCGACATGTG CCGCGGCCCG CACCTGCCCT CGACCGGCAA GCTCGATCCC
AATGCGTTCA AGCTGACTCG CGTTTCGGGG GCCTACTGGC GCGGCGACCA GAAGAACGCG
ATGCTCAGCC GAGTCTACGG TACCGGCTGG CTCAACAAGA AGCAGCTCGA CGCGCACCTG
ACGCGGCTGG AGGAAGCCGC CAAGCGCGAC CATCGCAAGC TGGGCAACGA GATGGACCTG
TTCCATCTCC AGCAGGAAGC GCACGGTTCG GTGTTCTGGC ACCCGAAGGG CTATCTGATC
TGGCGCGAGC TGGAAGCCTA CATGCGCCGC GCTATCGACG GCGCGGGCTA TCGCGAGGTC
AAGACCCCGC AGGTCATGGA CGCGCGCCAG TGGGAGCAAT CGGGCCACTG GGGCAAGTAC
CGCGAGAACA TGTTCGTCAT TCCCGACGAA GTGCCCAACG TCGACGATGA AGGGCCGATC
GTTTCGAACG ATGCGGACTG GATGGCGCTG AAGCCGATGA ACTGCCCGGC GCACGTCCTG
ATCTTCCGCC AGGGCATCAA GTCCTACCGC GAACTGCCGC TGCGCCTGTA CGAGAACGGC
TGCTGCCACC GCAACGAGCC GCACGGCGCG CTGCACGGGT TGATGCGGGT GCGCCAGTTC
ACGCAGGACG ACGCGCACAT CTTCTGCCGC GAAGACCAGA TCGTTTCGGA AGTGCAGGCC
TTCTGCGAGC TGGCCGACCG CATCTACAAG CACTTCGGTT TCACCTACTC GATCAAGCTC
GCGCTGCGCC CGGAAAAGCG CTTCGGCACC GAGGAGATGT GGGACAAGGC CGAGCGCGAA
CTGCGCGACG CGGTGGTGCG CGCAGGCCTT GCCACCGAGG AATACGGCTG GGAGGAACTG
CCGGGCGAAG GCGCGTTCTA CGCGCCCAAG CTGGAATGGC ACCTGACCGA CGCTATCGGC
CGTACCTGGC AGGTCGGCAC GATCCAGTCG GACCGCGTCC TGCCCGAACG CCTCGACGCA
AGCTACATCG GCGAGGATGG CGAGAAGCAC CGCCCGGTCA TGCTGCACCG CGCGATCTTC
GGTTCCTACG AGCGCTTCAT CGGCATCCTG ATCGAGCACT TCGCCGGTCG CCTGCCGGTG
TGGCTCGCGC CGGTCCAGGC AGTGGTCGCC ACGATCGTTT CGGACGCCGA CGACTATGCC
AGGGACGCGC TGGCCAAGCT GAAGGCGGCG GGCATCCGCG CCGATACCGA CCTGCGCAAC
GAGAAGATCA ACTACAAGGT GCGCGAACAC TCGCTGCAAA AGGTTCCGTA CCTGCTGGTG
GTGGGCAAGC GCGAGGCCGA GGAAGGCACC GTGGCGATCC GCATCCTGGG CGAGCAGCAC
CAGAAGGTGA TGCCGCTCGA CGAGGCGATT GCCCTGCTCA AGGGTGAGGC CACGGCGCCG
GATCTCAGGG CCTGA
 
Protein sequence
MSELLKITLP DGSVREVAPG STPADIAAAI GPGLAKAALA AKVDGELVDL TRPFTADAQL 
ALVTAKDEAE ALDLARHDYA HVLAEAVQAL FPGTQITFGP STDDGFYYDF APKDRPFTDE
DLPAIEAEMR KIIAANKPLR REVWSREQLI SRWKQQGESF KAEWAAELPE NEELTVYWSG
DDWLDMCRGP HLPSTGKLDP NAFKLTRVSG AYWRGDQKNA MLSRVYGTGW LNKKQLDAHL
TRLEEAAKRD HRKLGNEMDL FHLQQEAHGS VFWHPKGYLI WRELEAYMRR AIDGAGYREV
KTPQVMDARQ WEQSGHWGKY RENMFVIPDE VPNVDDEGPI VSNDADWMAL KPMNCPAHVL
IFRQGIKSYR ELPLRLYENG CCHRNEPHGA LHGLMRVRQF TQDDAHIFCR EDQIVSEVQA
FCELADRIYK HFGFTYSIKL ALRPEKRFGT EEMWDKAERE LRDAVVRAGL ATEEYGWEEL
PGEGAFYAPK LEWHLTDAIG RTWQVGTIQS DRVLPERLDA SYIGEDGEKH RPVMLHRAIF
GSYERFIGIL IEHFAGRLPV WLAPVQAVVA TIVSDADDYA RDALAKLKAA GIRADTDLRN
EKINYKVREH SLQKVPYLLV VGKREAEEGT VAIRILGEQH QKVMPLDEAI ALLKGEATAP
DLRA