Gene Dtox_1282 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1282 
Symbol 
ID8428230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1303716 
End bp1304966 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content41% 
IMG OID645033622 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003190787 
Protein GI258514565 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.438227 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACAG AAGTTATTAA GGCAGCAAAA TTTGATCCTG CATTCAGAGC CGAGGTTGCC 
AATTTAGTTA AGGGTTTTGA TTTCGGAAAT TGCCTTGCCT GCGGTATGTG TACTGCCGGT
TGTGCTTATT CTGATGTTCA TGCGAACAAT GATCCACGCA AGTTTTTGCG CAAACTTATC
TTAGGCATGC GCGAGGAAGC AATGAATGAT CCTTATTACT GGCTGTGTAC AATGTGTGAG
CGCTGCACAA TTGAATGTCC GATGGGTATT AATGTAGCTG CTATAGTCAG AGGCATAAGA
GGGACTACTA ACGAAGCATT CAGTGATGAT CCTGCAAAAA AAGGACCCGG CTTTATGGTC
AAGGTTGTTG AAGAAACTAT TGAATCAGGC AACCAGATGA ATGTATCCCA GGAAGATTTT
ATGGATACAA TTGAGTGGGT TGAAGAAGAA TTGCAGGCAG AACTTGACGA TCCTGATTAT
AAAATTCCCC TTGATGTTGA AGGTGCCGAT TTTATGTTCG GGTTCAACGC TCGGGAAATT
AAATATTACC CTCAGGAACT GCAAAGCATT TTAAAAATCT TTTATGCCGC AAAAGCTAAC
TATACTATTA GCTCTAAAAA ATGGGATGCT ACAAATTTAG CACTTTTCAG CGGTAATGAT
AAGGACTTTT GGACAATTAC CAAACCGCTC TTTGAAGAAA TGGAACGCTT AAAGGCCAAA
GAGTTGATCG TTACTGAGTG CGGACATGCT TTCCGTTCCT GCCGTATGGG CTACAGGAAT
TTCTGGGATA AGGAGAGCGA TTATAAGTTC CCCATCAGGC ATATCCTGCA ATTAATGGCT
GAGTGGATTA AAGAAGGCCG GATTAAGCTG GATCCGGATA AAATTACTGA AACTGTTACC
TACCATGATC CATGCAATAC TGCCCGTAAA GAGGGAGTTT ATGAGGAACC TCGTTATGTA
ATTAACAGTT TTATCAAGAA CTTTAACGAA ATGTACCCCA ATAAGCAGTG GAATTTATGC
TGCGGCGGTG GTGGTGGAGC CTTGGCTACT CCGGAATATA AAGTTGAGCG TTTAGCCAAA
GGAAGGCTGA AAGCTGATCA AGTTATTAAA ACCGGTGCTA AAATTGTTAT TGTTCCCTGC
CATAACTGCA TGGACCAGTT TAATGATATT AATAAAGAAT TTAAGCTTGG TGTTAAGAAC
GAACACCTTT GTGCTCTAAT AGCGGAAGCT TTAATATTAG ATGAAAAATA G
 
Protein sequence
MSTEVIKAAK FDPAFRAEVA NLVKGFDFGN CLACGMCTAG CAYSDVHANN DPRKFLRKLI 
LGMREEAMND PYYWLCTMCE RCTIECPMGI NVAAIVRGIR GTTNEAFSDD PAKKGPGFMV
KVVEETIESG NQMNVSQEDF MDTIEWVEEE LQAELDDPDY KIPLDVEGAD FMFGFNAREI
KYYPQELQSI LKIFYAAKAN YTISSKKWDA TNLALFSGND KDFWTITKPL FEEMERLKAK
ELIVTECGHA FRSCRMGYRN FWDKESDYKF PIRHILQLMA EWIKEGRIKL DPDKITETVT
YHDPCNTARK EGVYEEPRYV INSFIKNFNE MYPNKQWNLC CGGGGGALAT PEYKVERLAK
GRLKADQVIK TGAKIVIVPC HNCMDQFNDI NKEFKLGVKN EHLCALIAEA LILDEK