Gene Dtox_0076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0076 
Symbol 
ID8426998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp80217 
End bp81581 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content45% 
IMG OID645032471 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003189662 
Protein GI258513440 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0100734 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAGT ATAAAGGAAT GGTTCAGCCT GTAAGGGCTA GTGAAGCTGA TTTAAAGAAA 
ATATTTGTGA CACCGGTACC TGATGACAAG AAACCTGAAT TGGCTTTGAA GTACCTGGAT
GATATTCGCA AAAAATTCCG TTCATTTGTT ATGGCCATGG AGTCTTGTGT TAAGTGCGGT
CAATGTGCGG AAAACTGTCA CACATATTTA GGCACCAGGG ATCCTAATAA TATACCAACT
AATCGTGCCG AGCTGATTAG GAAAATCTAC AAGCGTTATT TTACACTTGA AGGCCGCTGG
TTTGGCAAGC TGGTGGGAGC AGAAGATATT AATCTTGATG TTATTGAATC CTGGTACTCC
TATTATTACC AGTGCAGTCA GTGCAGGCGT TGTGCTTATG TTTGCCCGTT CGGCATTGAT
ACCTGTGAAG TTACCTTTAT TGGCCGGCAG ATACTGCACT GGCTGGGTAT AGTGCCCAAG
CTGCATGCCG GTGTGGGTAA TGCTATGGGC AGAGTAGGCA ACCATACTAA TTTGCCTAAG
CCTGGTATTA TTGACACCCT GGAATTCCTG GCTGAGGACG CAGCTGAAGA ATTCGGGGTA
GAGTTCGAAT TTCCGGTTGA CCAGCCGGCA GATATTATGT ATATACCCTC TTCGGCAGAC
TTTTTATTAA ACCCGTATAC CCTGGTTGGC GCAGGTATGT TTTTTCACTA TATCGGCGCC
AAGTGGACGA TTCCCAGTAC GGTAACTGAA GCCGGTAACT TTGGTCTATT ATTTGACCAG
TACTATACTC AGAGGAGTAA CGTTACACGA TTGCTTAATG CCGCTTTTGA ACTGGGAATT
AAGAAAATTG TCTGGGGTGA ATGTGGTCAC GGCTGGCGTG CGGCAAAAAT GTATATACCC
AGTTTAGCTG ATCGTCCCTT AAGTATTCCA ATCACACACG TTCATGAGGA AATTGCGCAG
CTAATCCGTA CTAATCAGCT TAAACTCGAT CCCAGTAAGA ACCCGTATCC GGTTACCTTG
CATGATCCCT GCAACTATGT AAGGGCCTGT GGTCTTTGCG ACGATATGAG GGTGCTTATG
AATGCATCAG TAGCTGATTT TCGTGAAATG ACCCCGAACC GTCAGAAAAA TTTCTGTTGT
GGTGGAGGTT CAGCTATTCT GTTTGATGAT CCGGAAATGT ACCAGCGCCG TATACAGCTT
TCAGCTAAAA AAGCCGAACA GGTTAGAGAA ACCGGAGCAA AAATATTATG TGCACCTTGT
TCCATTTGTA AAGCCCAGCT TTACCCAATG GTTGAGGAGC ATGAATTGGG TGTAGAAGTA
AAAGGTCTGG TTGATTTAGT CGGTAAAGCC TTGGTTTGGA AGTAG
 
Protein sequence
MPEYKGMVQP VRASEADLKK IFVTPVPDDK KPELALKYLD DIRKKFRSFV MAMESCVKCG 
QCAENCHTYL GTRDPNNIPT NRAELIRKIY KRYFTLEGRW FGKLVGAEDI NLDVIESWYS
YYYQCSQCRR CAYVCPFGID TCEVTFIGRQ ILHWLGIVPK LHAGVGNAMG RVGNHTNLPK
PGIIDTLEFL AEDAAEEFGV EFEFPVDQPA DIMYIPSSAD FLLNPYTLVG AGMFFHYIGA
KWTIPSTVTE AGNFGLLFDQ YYTQRSNVTR LLNAAFELGI KKIVWGECGH GWRAAKMYIP
SLADRPLSIP ITHVHEEIAQ LIRTNQLKLD PSKNPYPVTL HDPCNYVRAC GLCDDMRVLM
NASVADFREM TPNRQKNFCC GGGSAILFDD PEMYQRRIQL SAKKAEQVRE TGAKILCAPC
SICKAQLYPM VEEHELGVEV KGLVDLVGKA LVWK