Gene Dtox_1983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1983 
Symbol 
ID8428965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2139991 
End bp2141703 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content44% 
IMG OID645034310 
ProductNitrilase/cyanide hydratase and apolipoprotein N- acyltransferase 
Protein accessionYP_003191441 
Protein GI258515219 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.314908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0215342 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGGAA AATTCCAAGA AGAAAATGCA GACCAGGAAG TTAAATTTGC CTTAATCCAT 
CCTGCTTTGG AATGGAAGAA TAAAGAAAAC AATATCCAAA AACTAATGAT TTTGAATGAG
AAGGCCGCCA GTGAGGGTGC CAGGATAATT TTAAATACAG AAATGGCTGC AACAGGCTAC
TCCTTTGCCG GCAGTTCTGA GATTGCTCCA TTGACAGAAG TTATACCAGG TCCTACTACT
GAGCGCTTTG GCAGCATTGC CCGGAAATAT CATTGTTATA TTTGTATCGG GCTGCCGGAA
GTGGATCCAG GAGTAGGTAG TTTATATAAT TCAGCAGCCC TGATTGGACC TGATGGTGAA
GTGATCGGTA AGTACCGAAA GGTTTTTCCG GCTTTTAAAG AAAACCTTTG GGCCAGAAAA
GGGAATCTTC CCATACTGGT GGCAGAAACT GAGTATGGTA AGTTGGGAGT GATCATTTGT
GCCGATGCCT ATTCCTATAA GCCGCCTAGG ATTGCAGCTT TAAAAGGCGC CAGATTGCTG
CTCATATTGG CCAACTGGCC CCCTCACCAC CATAACCCGC AGGATATCTG GCGTGCCCGG
GCGGTTGAAA ATGGGATATA TATACTGGTC TGTAATCGGA CAGGAAAAGA TAAGACTATG
AACTATATTT TTGCCGAATC TTTCATTATT GACAATAAGG GAAAGATAAT AACAAGAATG
CAATCGGCAG AGGATACCAT TATTTATGGG ACAGTGCCAT TGGTAAAGGG AACCTTTATC
TCGTCAGCAG ATTTCATTCT TGGCCAGAGA CAGCCCGAAT TGTATGGAAA AATATCTTTA
GATACTTTTT CCCAGCCGGT CCCGGAGGCG CTTCTCAGTC TGCCCGAGCC TAAATTATTT
GGTGTGGCGA CTGTGCAATT TCGCCCCGTC GCTGAAAAGG TGGAAGAGAA CAGACAAAAG
ATGCTGGAAT TGATTGACCG GGCAACAGCT GTTGCCGCCC AAAAGGGTAT AGAGCTTAAC
CTTATTCTTT TCCCTGAATT AGCTGCCACA GGAGCTATTT CAGATGCTCG CAAGATACAA
GAACTTGCCG AAGAGATTCC GGGTGCCGGT ACAGCGGTGT TCACCGAGAA GGCCGGAGAG
AACAATGTTT ACATAGTGCT GGGGATAGTG GAAAAGCAAG GAGTGGACTA TTTTAATACC
GCTGTTTTAA TCGGAGCCGA GGGGATGCTG GGGAAATACA GAAAGGTGCA TCTTACCTCA
CAGGATAAAA CATGGGCCTG TGCGGGAAAA GAAGGTTTCC CTACTTTTGA TCTCCCTTTT
GGCAGAGTGG CTATTTTAAT CGGTTATGAT CTGATTTTTC CTGAAAGTGT TGAATGCCTG
GCCAAGTGGG GTACTGATCT GCTATGTGTT CCCTCTCTTT GGGGTGATGA GAAGAGCAAG
TTCATCTGGG AAGCCAGAAT AACAGAACAA ATGCATCTTG CTATAGCTAA CCAGTGGGGA
GATTCCGGCG ATTATCAGTC CTTGGGAGAG AGCCTTATCT ATAGTTATAG TTCTTATCCG
GAAAAGAGGA TAAGACGGAA ATCTCCTGCC GCAGGGGACA TGATTAATAT TTTAACGTTA
AACTCAAAAA GTACCAGAGA AAAAAGGTTT TTGGAAAATA TAGATTACGA TATAATACTT
GGAGTGACAA AAAGGGAAAA GATTAAGACG TAG
 
Protein sequence
MGGKFQEENA DQEVKFALIH PALEWKNKEN NIQKLMILNE KAASEGARII LNTEMAATGY 
SFAGSSEIAP LTEVIPGPTT ERFGSIARKY HCYICIGLPE VDPGVGSLYN SAALIGPDGE
VIGKYRKVFP AFKENLWARK GNLPILVAET EYGKLGVIIC ADAYSYKPPR IAALKGARLL
LILANWPPHH HNPQDIWRAR AVENGIYILV CNRTGKDKTM NYIFAESFII DNKGKIITRM
QSAEDTIIYG TVPLVKGTFI SSADFILGQR QPELYGKISL DTFSQPVPEA LLSLPEPKLF
GVATVQFRPV AEKVEENRQK MLELIDRATA VAAQKGIELN LILFPELAAT GAISDARKIQ
ELAEEIPGAG TAVFTEKAGE NNVYIVLGIV EKQGVDYFNT AVLIGAEGML GKYRKVHLTS
QDKTWACAGK EGFPTFDLPF GRVAILIGYD LIFPESVECL AKWGTDLLCV PSLWGDEKSK
FIWEARITEQ MHLAIANQWG DSGDYQSLGE SLIYSYSSYP EKRIRRKSPA AGDMINILTL
NSKSTREKRF LENIDYDIIL GVTKREKIKT