Gene Hlac_1764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1764 
SymbolthrS 
ID7399636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1780411 
End bp1782360 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content65% 
IMG OID643708829 
Productthreonyl-tRNA synthetase 
Protein accessionYP_002566413 
Protein GI222480176 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0076873 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCGAGA CCGATGCGCA GGCCGACGTG ACGGTCGTCC TTCCGGACGG AGCAGAGCTC 
GACGTGCCGG CGGGGGCGAC AGTCGAAGAC GTTGCCTTCG AGATCGGCCC CGGGCTCGGT
CGCGACACGG TCGCGGGGAA GATCGACGGC GAACTCGTCG AAAAGTACGC GACGGTCCAC
GACGGCGCAC GGATCGAGAT CGTCACGGAT CAGTCCGACG AGTACCTCAC GGTGTTGCGC
CACTCCGCCG CCCACGTGTT CGCGCAGGCC CTCCAGCGGC TCCATCCCGA AGCGACGCTC
ACGATCGGCC CCCCGACCGA CGAGGGGTTC TACTACGACG TGACCGACGT CGACCTCGAC
GAGGACGACC TCGACGCCAT CGAGGAGGAG ATGGACGAGA TCATCGCGGC CGACTACGAC
ATTGAGCGCG AGGTCCGGTC CCGTGAGGAG GCCAAAGAGA TCTACGCCGA CAACGAGTAC
AAACTGCAGA TACTCGACGA GGAAGCCGAC GACGAGGAGG TCACCTTCTA CGTGCAGGAC
GACTGGCAGG ACCTCTGTCA GGGCCCCCAC GTCGAGTCGA CGGGCGAGAT CGGGGCGACG
ACCCTCTTGG AGGTCTCGGC GGCCTACTGG CGCGGTGACG AGGACAACGA CACGCTGACG
CGCGTGTACG GGACGGCGTT CGCCTCTGAA TCGGATCTGA AAGAGCATCT CGAACTGCGC
GAGGAGGCGT TAGAGCGCGA CCACCGGAAG ATCGGACAGG AGATGAACCT CTTCTCGATC
CCGACGGTGA CCGGTCCCGG CCTCCCGCTG TACCACCCGC CGGGCAAGAC CGTGCTCCGT
GAACTCTCGA ACTTCGCGAA CGAGCTCAAC CGCGAGAACG GCTACGAGGA GGTCGAGACC
CCGCACGTGT TCCGGACGGA GCTGTGGAAG AAGTCCGGCC ACTACGAGAA CTACAAAGAC
GACATGTTCC TCCTCGACGT CAACGACGAG GAGTACGGGC TCAAGCCGAT GAACTGTCCG
GGCCACGCGA CCATCTTCGA CCAGCAGTCG TGGTCGTACC GCGACCTGCC GCAGCGCTAC
TTCGAGAACG GGAAGGTGTA CCGGAAAGAG CAGCGCGGCG AGCTCTCCGG GCTCTCGCGC
GTCTGGTCGT TCACCATCGA CGACGGGCAC CTGTTCGTCC GGCCCGACCA GATCCGTCAG
GAGATCGAGT CGGTCATCGA GATGATCTTC GAGGTCGTCG AGACGCTGGA TCTGGACGTC
GAGGTCGCGC TGGCGACCCG CCCCGACAAA TCCGTCGGCG GCGACGAGAT CTGGGAGTCC
GCGGAAGAGC AGCTCCGCGA CGTGCTCGAA TCGGGCGGCT ACGAGTACGA CGTCGAGCCC
GGCGACGGCG CCTTCTACGG CCCGAAGATC GACTTCGGCT TTGAGGACGC GCTGGGTCGC
GTCTGGGACG GCCCCACCGT CCAGCTCGAC TTCAACATGC CCGACCGGTT CGACCTGACC
TACACGGGCG AGGACAACGA AGACCACCAG CCGGTGATGA TCCACCGCGC GCTGTACGGG
AGCTACGAGC GCTTCTTCAT GGTGCTCATC GAGCACTTCG ACGGCGACTT CCCGCTGTGG
CTCGCGCCCG AACAGGTCCG CATCCTCCCC GTCTCCGACG AGACGCTCGG CTACGCCCAC
CGCGTGAAAA ACGAACTGGA GGACGCCGGC TTCCGCGTCG AGGTCGAGGA CCGCGACTGG
ACGGTCGGTC GCAAGATCCG CGCCGGTCAC GACGACCGGC TCCCGTACAT GGTCATCGTC
GGCGAGGACG AGCAGGAGGC GGGCACCGTC TCGGTCCGTG ACCGCTTCGA GAACCAACGC
GGCGATGTCG ACCTCGACGC GTTCGTCGAC CACCTCGTCG CCGAGCGCGA CGAGAAGCGT
ACTGAGCCGG AGTTCGTCGA CGCGGAGTGA
 
Protein sequence
MSETDAQADV TVVLPDGAEL DVPAGATVED VAFEIGPGLG RDTVAGKIDG ELVEKYATVH 
DGARIEIVTD QSDEYLTVLR HSAAHVFAQA LQRLHPEATL TIGPPTDEGF YYDVTDVDLD
EDDLDAIEEE MDEIIAADYD IEREVRSREE AKEIYADNEY KLQILDEEAD DEEVTFYVQD
DWQDLCQGPH VESTGEIGAT TLLEVSAAYW RGDEDNDTLT RVYGTAFASE SDLKEHLELR
EEALERDHRK IGQEMNLFSI PTVTGPGLPL YHPPGKTVLR ELSNFANELN RENGYEEVET
PHVFRTELWK KSGHYENYKD DMFLLDVNDE EYGLKPMNCP GHATIFDQQS WSYRDLPQRY
FENGKVYRKE QRGELSGLSR VWSFTIDDGH LFVRPDQIRQ EIESVIEMIF EVVETLDLDV
EVALATRPDK SVGGDEIWES AEEQLRDVLE SGGYEYDVEP GDGAFYGPKI DFGFEDALGR
VWDGPTVQLD FNMPDRFDLT YTGEDNEDHQ PVMIHRALYG SYERFFMVLI EHFDGDFPLW
LAPEQVRILP VSDETLGYAH RVKNELEDAG FRVEVEDRDW TVGRKIRAGH DDRLPYMVIV
GEDEQEAGTV SVRDRFENQR GDVDLDAFVD HLVAERDEKR TEPEFVDAE