Gene Sare_1770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1770 
SymbolthrS 
ID5704512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2040267 
End bp2042282 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content69% 
IMG OID641271273 
Productthreonyl-tRNA synthetase 
Protein accessionYP_001536648 
Protein GI159037395 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0365193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000990893 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCCGCAC CCCGTAACCC CGTCGTGGCC GACCCCGTCG TCGTGGCCGC CGGGACGACG 
GCGGCCGACG CGGTCGCGGC GGCCGGGCTT CCGGTGGCCG GTCGGGCCGC GGTCGTGGTG
GTGCGGGATC CGCAGGGCCA GCTTCGCGAC CTGGACTGGA CCCCGGTGGA GGAGACCGTG
GTCGAGCCGG TCGGCATCGA CACCCCGGAC GGGCTGGCCG TGCTGCGGCA CTCCACCGCC
CACGTGCTCG CCCAGGCCGT GCAGGACCTC TTTCCCGAGG CGAAGCTGGG CATCGGCCCA
CCGATCGAGA ATGGTTTCTA CTACGACTTC GCGGTCGACC GGCCGTTCCA GCCCGAGGAC
CTGACAAAGC TGGAAAAGCG GATGCAGGAG ATCATCAAAT CCGGCCAGCG GTTCCGTCGC
CGCCGCTTCC GCAACCTGGA CGAGGCGCGC GGTGAACTCG CGGCCGAGCC GTTCAAGCTG
GAGCTGATCG AACTCAAGGG TGACGGGCTG GACTCCTCCC AGGTGATGGA GGTGGGTGGC
GGCGAACTGA CCAGCTACGA CAACCTTGCC GCCGATGCGG ACAAGGTCTG CTGGTCGGAC
CTGTGCCGCG GTCCCCACCT GCCGAACACC CGACTCATCG GGGCCTTCAA ACTGATGCGG
TCGGCCGCCG CGTACTGGCG TGGTTCGGAG AAGAACCCAC AGCTCCAGCG GGTGTACGGC
ACCGCCTGGC CGACCCGGGA CGAGTTGAAG GCGTACCTGC GGCTCCTGGA GGAGGCCGCC
CGGCGCGACC ACCGCAAGCT CGGTACCGAC CTCGACCTGT TCAGCTTCCC GGAGGAGATC
GGCTCAGGCC TGCCGGTGTT CCACCCCAAG GGCGGCGTGC TCAAGCGGAC GATGGAGGAC
TACGTCCGTG CCCGGCACAT CGAGGAGGGC TTCGACTACG TCGGCACCCC GCACATCTCG
AAGGAAGGGC TCTTCCACAC CTCGGGCCAC CTGCCGTACT ACGCCGACGG GATGTTCCCG
CCCATGCACC TGGAGGGGGC GGACTACTAC CTCAAGGCGA TGAACTGCCC GATGCACAAC
CTGATCTACC GGTCCCGCGG GCGGTCCTAC CGGGAACTGC CGATGCGGCT GTTCGAGTTC
GGCTCGGTCT ACCGTGACGA GAAGTCCGGC GTCATCCATG GGCTGACCCG GGTGCGCGGT
TTCACCCAGG ACGACTCGCA CTCCTACTGC ACCAAGGAGC AGGCGCCCGC CGAGATCAAG
CATCTGCTGG CCTTCGTGCT CGGGCTCCTG CAGGACTTCG GGATCACCGA CTTCGTCCTC
GAGTTGTCCA CCCGCGACGA CGCCAACCCG GACAAGTTCG TCGGCTCCGC CGAGGACTGG
GCAACGGCGA CGGCGGTGCT GGAGCAGTGC GCCAGGGACA CCGGGCTCGA CCTGGTGCCG
GATCCGGGAG GCGCGGCCTT CTACGGTCCG AAGATCTCGG TGCAGGCCAA GGACGCCATC
GGCCGGACCT GGCAGATGTC GACCATCCAG TATGACTTCA ACCAGCCGGC GGGCTTCGGG
CTGGAGTACC AGGCCGCCGA TGGCAGCCGT CAGCGGCCGG TCATGATCCA CTGTGCGAAG
TTCGGCTCGA TCGAGCGGTT CATCGGCGTG CTCACCGAGC ACTACGCCGG GGCGTTCCCG
GCGTGGCTGG CGCCGGTGCA GGTGGTCGGC ATCCCGATCC GCGAGGACCA CACCGAGTAC
CTGGACGGGT TCGTCGCCGG GCTGCGCGCT GCGGGGATCC GGGCCCAGGT GGACGCCGGT
GACGACCGGA TGCAGAAGAA GATCCGCACC GCCCAGCAGC AGAAGATCCC GTTCATGGCG
ATCGCGGGGG ACGACGACGT GGCGGCGGGC ACCGTGTCGT TCCGCTACCG GGACGGCTCG
CAACGTAACG GGGTGCCGGT CGCCGAGGCG GTCAGCCACG TCCTCGACGT GGTCAACTCC
CGCGCCAACC AGGGCCCCTC CGCCGCCCAG GAGTAA
 
Protein sequence
MSAPRNPVVA DPVVVAAGTT AADAVAAAGL PVAGRAAVVV VRDPQGQLRD LDWTPVEETV 
VEPVGIDTPD GLAVLRHSTA HVLAQAVQDL FPEAKLGIGP PIENGFYYDF AVDRPFQPED
LTKLEKRMQE IIKSGQRFRR RRFRNLDEAR GELAAEPFKL ELIELKGDGL DSSQVMEVGG
GELTSYDNLA ADADKVCWSD LCRGPHLPNT RLIGAFKLMR SAAAYWRGSE KNPQLQRVYG
TAWPTRDELK AYLRLLEEAA RRDHRKLGTD LDLFSFPEEI GSGLPVFHPK GGVLKRTMED
YVRARHIEEG FDYVGTPHIS KEGLFHTSGH LPYYADGMFP PMHLEGADYY LKAMNCPMHN
LIYRSRGRSY RELPMRLFEF GSVYRDEKSG VIHGLTRVRG FTQDDSHSYC TKEQAPAEIK
HLLAFVLGLL QDFGITDFVL ELSTRDDANP DKFVGSAEDW ATATAVLEQC ARDTGLDLVP
DPGGAAFYGP KISVQAKDAI GRTWQMSTIQ YDFNQPAGFG LEYQAADGSR QRPVMIHCAK
FGSIERFIGV LTEHYAGAFP AWLAPVQVVG IPIREDHTEY LDGFVAGLRA AGIRAQVDAG
DDRMQKKIRT AQQQKIPFMA IAGDDDVAAG TVSFRYRDGS QRNGVPVAEA VSHVLDVVNS
RANQGPSAAQ E