Gene Dtox_2224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2224 
Symbol 
ID8429207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2393875 
End bp2395074 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content34% 
IMG OID645034534 
Productprotein of unknown function DUF955 
Protein accessionYP_003191664 
Protein GI258515442 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2856] Predicted Zn peptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000122328 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000370902 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAAAAGC AGAAGAAATT TAACGGAGAG CGTCTCAAAA GTGCAAGGAT GTATAATGGG 
TATACATTAA CGGAGCTTTC TAAAATTACA AATATAAGCA AACAATCACT CTCTCTATAT
GAAAATGGAA ACAACAAGCC TGAATGGGAT AATATCTCAA AAATATCGGT AGCTTTAGGA
TTTCCTCGGG ACTTCTTTTT ACAAGAGAGT GATTTTAAAG TTAGTACCGA TGCGACATAT
TTTAGAGCAT TAACAAGTGT AACTAAAAAA GATAGAACCG CACAGAAGAT CAAGCTTGAG
TATCTATCTC AAATATATTT AACGCTGTTT GAATATATTG ATTTTCCTTC ATTAAAAATT
CCATACATTG ACTTTTCCTT AACCGAAGTT TTTGAAACAG ATGAAGAGTT ACAGAATATA
GAAGATGTTG CTGATAAAAT AAGGAAATAT TGGGGATTGG GATCTGAGCC TATACTAGAT
TTAAGATATA TTCTTGAATC TAACGGAATG ATTGTAACGA GTTTTGATGC TGATGCGGAA
AAAATAGATG CATTTAGCCA ACGTACTAAT GTTAATAAGG GTGAAGTTTA CTTAATCGCA
ATATCCGCAT ATGGGCAAAC AATTGCTAGA GCTAGGTTTG ATATGGCGCA TGAATTAGGT
CATATCCTAT TACATCCATG GAGTGAGGAT TTAGAATTAA TCTCTAAGGA AGAATTTCGA
GCAAGGGAGC GCCAAGCAAA CATTTTTGCT GGAGCATTTT TGTTGCCAAA AGAAACATTT
AGACAAGATG TTTCTCCATA CCCGACTACA CTTGATTACT ATTTACATCT TAAGAAAAAA
TGGAATGTTT CTATTGCGGC CATGATTTAC AGAGCATATC AGCTAAAAGT AGTTACAAAT
AATCAGTTTC AATATTTGAT GCGTCAGTTG TCGAAGAATG GGTGGAGAAA AAATGAACCT
TTGGATACTG AATATAAACT ACAAAATAAT ATTTTGCAAT CAGCTGTAGA TATGTTGATA
AATAATAATG TTTTTTCAGG TAAGCAACTT TTAGCTGAAT TAGCTCAGAA GGGGTTATCA
ATGTATCCTG AACAAATTGA AGATCTATTA TGTCTGAAGC ATGGAACACT ATCTAAGGGT
GAAGAGGATA AATCCCAGAT TATACATTTG AAAGATTATA TCCCACCTTC CGCAAGATAA
 
Protein sequence
MKKQKKFNGE RLKSARMYNG YTLTELSKIT NISKQSLSLY ENGNNKPEWD NISKISVALG 
FPRDFFLQES DFKVSTDATY FRALTSVTKK DRTAQKIKLE YLSQIYLTLF EYIDFPSLKI
PYIDFSLTEV FETDEELQNI EDVADKIRKY WGLGSEPILD LRYILESNGM IVTSFDADAE
KIDAFSQRTN VNKGEVYLIA ISAYGQTIAR ARFDMAHELG HILLHPWSED LELISKEEFR
ARERQANIFA GAFLLPKETF RQDVSPYPTT LDYYLHLKKK WNVSIAAMIY RAYQLKVVTN
NQFQYLMRQL SKNGWRKNEP LDTEYKLQNN ILQSAVDMLI NNNVFSGKQL LAELAQKGLS
MYPEQIEDLL CLKHGTLSKG EEDKSQIIHL KDYIPPSAR