Gene Dtox_0749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0749 
Symbol 
ID8427687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp759192 
End bp760361 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content49% 
IMG OID645033107 
Producthistidyl-tRNA synthetase 2 
Protein accessionYP_003190282 
Protein GI258514060 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00443] ATP phosphoribosyltransferase, regulatory subunit 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCATTAA AGGATCGTTT CGGTCGTGTG CCGCCGGGGG TGCGTGATTT ACTGCCGGCG 
GAAGCGGGAA CTAAGCGGGA AATTGAAACA AAATTTGCAC AACTGGTGCA TGCCTGGGGT
TACCAGGAAG TTAGCACACC TATTTATGAA TATTACGAGA ACATCCTGGT AAAGGAAAGC
AGCCAGGAGG ATAAGCTATT TAAATTTTTA GACAGAAACG GCCATTTGCT GGTCCTGAGG
CCCGATATGA CGCTGCCCAT AGCTCGACTG GCAGCTACCG GGTTAAAGAC TGAGACTCTG
CCGCTGCGCC TTTTCTATAC TGGCCATGCT TTTCTTTACG AGTCACCCCA GGCCGGCAGG
CAGAGAGAAT TTTACCAGGC GGGTGTGGAG ATACTCGGTG ACAGCAGTGC CGATGCTGAT
GCAGAGACAG TCATATTGGC GGTCAAGTCG CTGCTTGCCG TCGGAATTCA AGAATTTCAA
ATCAGCCTGG GCCATGTTGG AATATTTCAC GGTTTAGCTG ATGATTTGGG CTTACCGGTT
GTAGAAAAAG AAGAATTGAA GACAGCCATA GGCAATAAAG ATTTCGTGCT CCTGAGGGAA
TTATTGAGCA GGTTTATGAT TGCTCCCGCG GATCAATCAA GACTCTTAAA AGTTTTAAAT
CTGAGAGGCA GCGCAAAAGT TTTGGTCGAG GCCCGGCAGT TGATTGGCGG CGGCAGGGCG
CAAAGCGCCC TTGATAACCT GGAGGAAATA TATGCGGTAT TGCAAGCATA TGGCGTAGAA
AGCCAGGTAA CCTTAGATTT TGGTCTTTTA AGAGAACTGG ACTATTATAC GGGTGCTGTC
TTTGAGGGCT ACACCGGGTC ACTCGGCTTC CCTCTGTGTG GCGGCGGTAG GTATGACAAC
CTTACCGGGC AGTTTGGTTA TGATTTGCCG GCAACCGGCT TTGCCTTGGG TGTGGATAAA
TTGATGCTGG TATTAGATCG GCAGGGCGCT TTAAACAGGG AAGTCTGTTG CGATTACCTG
ATCAGGTATA CCAGAGAGCA AAGGTCTGAG GCTGTCCGGC AGGCAGGTGA ATTGCGGCAG
GCAGGCTATA CTGTGACAAC GCAAATAATT AATTCAGCTG AGCAATCCGG AACCGGTGTA
GCTGCGAAAA AAACTATGGT TCTGGGATAA
 
Protein sequence
MALKDRFGRV PPGVRDLLPA EAGTKREIET KFAQLVHAWG YQEVSTPIYE YYENILVKES 
SQEDKLFKFL DRNGHLLVLR PDMTLPIARL AATGLKTETL PLRLFYTGHA FLYESPQAGR
QREFYQAGVE ILGDSSADAD AETVILAVKS LLAVGIQEFQ ISLGHVGIFH GLADDLGLPV
VEKEELKTAI GNKDFVLLRE LLSRFMIAPA DQSRLLKVLN LRGSAKVLVE ARQLIGGGRA
QSALDNLEEI YAVLQAYGVE SQVTLDFGLL RELDYYTGAV FEGYTGSLGF PLCGGGRYDN
LTGQFGYDLP ATGFALGVDK LMLVLDRQGA LNREVCCDYL IRYTREQRSE AVRQAGELRQ
AGYTVTTQII NSAEQSGTGV AAKKTMVLG