Gene Dtox_1734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1734 
Symbol 
ID8428700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1829617 
End bp1830774 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content44% 
IMG OID645034067 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003191214 
Protein GI258514992 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGAGG CTGAAGCCAG AATACCAATG TTCGAGATAA AAGATGTCAT CGTTGAATAC 
GGTGGCGAAG AAATTACACT ATGCATGCAG TGTGGCGTTT GTGTTGCCAC CTGTCCATGG
AAACGCGTTG GCAGCGAATT TACAATTCGC GAAATGTTGT ATATGGGACG CATGGGTTTC
GAAGGCTATG AAAGCGATGA TGTATTGTTT GCCTGCACCA CATGTAAACA TTGTGCCGTC
CGCTGTCCCC GTGGTATTGA TATTTTTAAC GTGGTCAGAG TTATGCGCAG CATGATCTGC
GAATCAGGAG CTATGCCTAA AAACTTGAAA GCGGTTGTAG GCAGTATCAG CAGTCAGGGC
AACCCCTGGG CTCAAGATAA AAGCAAGAGA GAAATTTGGG CTAAAGATGC TGCAGTGCCC
GCTTTTACAG AGGACACTGA GTACCTTCTC TATGTATGCT GCACCTCGGC TTTTGATTCA
CGCAGCCAGA AAATTGCAAA ATCCATCGCT GAACTGCTAC AGAAAGCCGG AGTAAGCTTT
GGTGTATTGT CGGCAGAGGA AAAATGCTGC GGTGAATCTA TCCGTAAAAT TGGCGCAGAA
GAAGCTTTTA CAGCTCTCGC TGAGCACAAC ATTAACCTCT TTAACAGCAA AGGTGTTAAG
AAAATCATCA CCACTTCACC TCATTGTCAT TATACCTTTA AAAATGAGTA CCCTGCTTTC
GGTGGTGAAT ACGAAGTCTA TCACTATACA GAGATAATCA ATCAACTAGT TAAAGATGGT
AAATTAAGCT TCACCAATCC GGTAGACCAA AAAGTAATTT ACCATGAGCC CTGCTATTTA
GGGCGTCACG CACGCTTGTT CGATGCCCCT CGTGAATTGA TGTCAGCAGT ACCCGAGTTA
AAAGTAGTGG AATTTGATAA TAATAAAGAG GACAGCCTTT GCTGTGGCGG CGGCGGCTCC
CGTATCTGGA TGGAGACTGA AGCCCACATG CGTTTCAGTG ATGAAAAAGT AGAAGAAGCT
GCAGCCAAGG AAGTAAATTA TGTAGTAACA GCCTGCCCTT ACTGCGTAGT AATGTTTGAA
GACAGCGTTA AAACTAAAAA CAAAGATGAA GTTTTAGCAG TAAAAGATTT AAGTGAAATA
CTAAAAGAAA GCCTGTAA
 
Protein sequence
MSEAEARIPM FEIKDVIVEY GGEEITLCMQ CGVCVATCPW KRVGSEFTIR EMLYMGRMGF 
EGYESDDVLF ACTTCKHCAV RCPRGIDIFN VVRVMRSMIC ESGAMPKNLK AVVGSISSQG
NPWAQDKSKR EIWAKDAAVP AFTEDTEYLL YVCCTSAFDS RSQKIAKSIA ELLQKAGVSF
GVLSAEEKCC GESIRKIGAE EAFTALAEHN INLFNSKGVK KIITTSPHCH YTFKNEYPAF
GGEYEVYHYT EIINQLVKDG KLSFTNPVDQ KVIYHEPCYL GRHARLFDAP RELMSAVPEL
KVVEFDNNKE DSLCCGGGGS RIWMETEAHM RFSDEKVEEA AAKEVNYVVT ACPYCVVMFE
DSVKTKNKDE VLAVKDLSEI LKESL