Gene Dtox_0871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0871 
Symbol 
ID8427810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp879519 
End bp881714 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content52% 
IMG OID645033219 
Productcatalase/peroxidase HPI 
Protein accessionYP_003190393 
Protein GI258514171 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0376] Catalase (peroxidase I) 
TIGRFAM ID[TIGR00198] catalase/peroxidase HPI 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0581387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGAAA AAGAAAAATG TCCCGTAACG GGCAGAACTA ACAATGCCAT TGCAGGCGCC 
GCTAATTCTA ACAGGGACTG GTGGCCAAAC CAGTTGAACC TCAAGATTCT TCATCAGAAC
TCAAACTTGA GTAATCCGAT GGCTCTGGAA TTCAATTATG CTGAAGAATT CAAGAAGCTC
GATCTGGATG CCCTGAAGAA GGACCTATAC GCGTTGATGA CCGACTCGCA GGATTGGTGG
CCTGCCGATT ATGGTCACTA CGGGCCGTTC TTCATCCGGA TGGCGTGGCA CAGCGCAGGC
ACGTATCGCA CAAACGACGG TCGTGGCGGC GCAGGTAAGG GGACACAACG CTTTGCACCC
CTCAACAGCT GGCCCGACAA CGTAAACCTC GACAAGGCAC GCCGTTTACT TTGGCCAATC
AAGCAGAAAT ACGGCAGTAA GATCTCCTGG GCTGACCTCA TGATCCTTGC CGGTAATTGT
GCTTTAGAGT CTATGGGACT CAAAACGTTC GGTTTTAGTG GCGGACGCGA GGATGTGTGG
GAGCCGCAAG AAGACATTTA CTGGGGAACC GAGAGTGAGT GGCTGGGCGA CAATCGCTAC
TCCGGTGATC GGGAGCTCGA GAATCCACTC GCCGCCGTGC AGATGGGCCT GATCTACGTG
AACCCGGAAG GGCCGAATGG CAACCCAGAT CCGGTCGCCT CCGGCCGTGA CGTTCGGGAG
ACCTTTGCAC GTATGGCCAT GAATGATGAA GAGACTGTCG CACTCGTCGC CGGCGGGCAT
ACTTTCGGCA AATGTCATGG CATGGGAGAT CCGACGCTTG TCGGTCCGGA ACCTGAAGCT
GCCGGCATTG AGGAACAGGG GCTCGGCTGG AAGAGCAGCT TCGGCAGCGG AAAAGGTGGC
GATACGATCA GTAGCGGTAT TGAGGGTGCC TGGAAGCCAA ACCCGACCAA ATGGGACATG
GGCTATCTAA ACATGCTGTT CAAATACGAG TGGGAGTTGG TCAAGAGCCC GGCGGGAGCG
CATCAGTGGC TGGCTAAAGA TGTAGACGAA GAAGATATGG TGATTGACGC ACACGACCCA
TCGAAGAAGC ACCGGCCGAT GATGACCACG GCGGATCTCT CTCTTCGCTA CGATCCAATC
TACGAGCCAA TCTCGCGACG CTACCAGCAA AACCCCGAGG AATTCACGGA TGCCTTCGCT
CGCGCGTGGT TCAAGCTAAC TCACCGCGAC ATGGGTCCTC GCTCGCGCTA TCTCGGTGCG
GAGGTTCCTG AGGAGGAACT GATTTGGCAA GATCCGGTGC CCGCGATCGA TCATGAATTA
GTTGACGAAC ATGATATCGA AGATCTCAAG AGCAAAATTC TGGCTTCTGG GCTGTCTGTC
TCCCAATTGG TTTCAACAGC CTGGGCTTCG GCGTCTACAT TCCGTGGCTC CGATAAGCGC
GGCGGGGCAA ACGGAGCGCG CATTCGTCTT GCACCACAGA AGGATTGGGA AGTCAACCAG
CCGGCTCAAC TAAACACTGT CCTTAATGTT CTGGAAAAAA TCATAGCCGA GTTTAATAGT
GCACAGACAG GTCCAAAGAA AATTTCGCTC GCGGACTTGA TTGTCCTGGG TGGATGTGCA
GGTATAGAGC AAGCTGCAAA GAATGCCGGT TGCAATGTGT CCGTTCCTTT TATACCGGGA
CGCACGGATG CATCGCAGGA GCAAACCGAC GTACAGTCAT TTTCGGTGCT TGAACCAATT
GCAGATGGGT TCCGAAATTA TCAAAAAACC AAGTATTCTG CATCTGCAGA GGAACTGTTG
GTTGATCGTG CGCAACTGTT GACATTGACC GCTCCTGAAA TGACTGTTCT GCTTGGTGGC
ATGCGCGTCC TGAATACCAA TCACGGGCAA TCTCAACACG GTGTCTTCAC CAAGAGACCA
GAGGCGCTCA CGAATGACTT TTTCGTAAAT CTTCTCGACA TGAGCACTAC ATGGAAGGCG
ATATCTGAAG ACGAAAACGT ATTCGAGGGT CGTGATCGCT CAACGGGCGA AATCAAGTGG
ACTGGTACCA GTGTTGATCT TATCTTCGGT TCAAACTCCC AACTACGGGC TATAGCTGAA
GTCTATGCAT GTGACGACTC TCAGAGTAAG TTCATAAATG ACTTCGTATC TGCTTGGAAT
AAGGTGATGA ACGCAGATCG TTTCGACCTT TCTTGA
 
Protein sequence
MYEKEKCPVT GRTNNAIAGA ANSNRDWWPN QLNLKILHQN SNLSNPMALE FNYAEEFKKL 
DLDALKKDLY ALMTDSQDWW PADYGHYGPF FIRMAWHSAG TYRTNDGRGG AGKGTQRFAP
LNSWPDNVNL DKARRLLWPI KQKYGSKISW ADLMILAGNC ALESMGLKTF GFSGGREDVW
EPQEDIYWGT ESEWLGDNRY SGDRELENPL AAVQMGLIYV NPEGPNGNPD PVASGRDVRE
TFARMAMNDE ETVALVAGGH TFGKCHGMGD PTLVGPEPEA AGIEEQGLGW KSSFGSGKGG
DTISSGIEGA WKPNPTKWDM GYLNMLFKYE WELVKSPAGA HQWLAKDVDE EDMVIDAHDP
SKKHRPMMTT ADLSLRYDPI YEPISRRYQQ NPEEFTDAFA RAWFKLTHRD MGPRSRYLGA
EVPEEELIWQ DPVPAIDHEL VDEHDIEDLK SKILASGLSV SQLVSTAWAS ASTFRGSDKR
GGANGARIRL APQKDWEVNQ PAQLNTVLNV LEKIIAEFNS AQTGPKKISL ADLIVLGGCA
GIEQAAKNAG CNVSVPFIPG RTDASQEQTD VQSFSVLEPI ADGFRNYQKT KYSASAEELL
VDRAQLLTLT APEMTVLLGG MRVLNTNHGQ SQHGVFTKRP EALTNDFFVN LLDMSTTWKA
ISEDENVFEG RDRSTGEIKW TGTSVDLIFG SNSQLRAIAE VYACDDSQSK FINDFVSAWN
KVMNADRFDL S