Gene Dtox_3225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3225 
Symbol 
ID8430219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3424531 
End bp3425829 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content48% 
IMG OID645035469 
Productoxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_003192588 
Protein GI258516366 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000826969 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0243619 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATATC ATACGGAAAA CATTTTAATG ACAAGTATAA CAAAAACAAC AAGCGCGGCG 
AATAAAACAG CACCGCTTGC GCAATATAAT GAAGAAATTG GAATGGGACT GTATATACAT
GTCCCATTTT GCATAAGGAA ATGCCTTTAC TGTGATTTTA TTTCCTATCC CTATGAAAAT
GCCGCCGCAG AAATCTACTC GGCAGCCCTG CAAAGAGAAA TAAACTTGTA TGCGGAGCTT
TTCCGGAGCA GTTCCCAGAC GGAAAACCGT AAGAAATTTT TTGATGCTCT AGGAAATTAT
GCTTATGATG AATTTGCTGC TCCTGTGTTT ACCTCTGTTT TTCTGGGAGG CGGAACACCT
ACCTGTTTGC CTGCTTCACT GTTAAGCGTA ATATTAAAAA CGCTCAGACA TTCCCTGCCC
CTGGCTCCCG GTGCCGAATT CACGGCAGAA GCCAACCCCG GAACCGTAAA TGGAGAAAAC
CTGGCGCTTT TTAGGGAATT CGGGGTTAAC CGCTTGAGCC TGGGTGTACA GGCATGTCAA
CCGCAGTTGT TAAAGACGTT GGGCCGTATC CACACTTTTC AGGAGGCTCT GCAGGCTGTA
AAGCTGGCCC GCCGCCAGGG TTTTGACAAT ATCAACCTGG ACTTGATTTT CGGCATTCCC
GGTCAGACAA TGCAGGGCTG GCAGACCTGC TTGGAACAAA TAATTGACCT GAATCCGCAG
CACCTTTCGC TATATGGACT TCAGTTGGAA GAGGGAACTC CCCTGGAGAA ATCGGTAACT
CTTGGCCATA TCGAACCCTG CACAGAGGAA GCCGAGCTGG CCATGTACCG GTATGCCGGA
ACCTTTCTTA AAGAAGCAGG CTTTGAGCAA TACGAAATAT CCAACTTTGC CCGTTCGAAC
AAATACTGCC GCCATAACAT ATTATACTGG CAGCACGGTG AGTACTTGGG CATAGGGCCG
GGAGCCCATT CCTACTTAAA TAAGATCCGC TGCAGCAACA GCGGTGATCT GAAAACCTAC
GCCGAAAAAC TGGCAGCCGG CCAACTTCCA TTGGAGAGCA GTGAAGTGAT CAGTCTGGAA
ACAGAAATAT CCGAAACAAT TTTCCTGGCA CTCAGAATGT TAAATGGTCT GGACCTGGAA
GCCTTTGCCC GCTGCTTTGC TGTCAGAGTG GAAGACTTGT ACAGCCGCCA GATCCGAAAA
CTAACCGGCC TTGGCTTAAT AGAAACAGTA AACGGTTTCC TGCGGCTGAC AGAGGAGGGA
CTCCCCTTGG CCAATATAGT TTTCAGAGAG TTTGTCTAA
 
Protein sequence
MKYHTENILM TSITKTTSAA NKTAPLAQYN EEIGMGLYIH VPFCIRKCLY CDFISYPYEN 
AAAEIYSAAL QREINLYAEL FRSSSQTENR KKFFDALGNY AYDEFAAPVF TSVFLGGGTP
TCLPASLLSV ILKTLRHSLP LAPGAEFTAE ANPGTVNGEN LALFREFGVN RLSLGVQACQ
PQLLKTLGRI HTFQEALQAV KLARRQGFDN INLDLIFGIP GQTMQGWQTC LEQIIDLNPQ
HLSLYGLQLE EGTPLEKSVT LGHIEPCTEE AELAMYRYAG TFLKEAGFEQ YEISNFARSN
KYCRHNILYW QHGEYLGIGP GAHSYLNKIR CSNSGDLKTY AEKLAAGQLP LESSEVISLE
TEISETIFLA LRMLNGLDLE AFARCFAVRV EDLYSRQIRK LTGLGLIETV NGFLRLTEEG
LPLANIVFRE FV