Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_2224 |
Symbol | |
ID | 8429207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 2393875 |
End bp | 2395074 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 645034534 |
Product | protein of unknown function DUF955 |
Protein accession | YP_003191664 |
Protein GI | 258515442 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2856] Predicted Zn peptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000122328 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.000000000370902 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGAAAAAGC AGAAGAAATT TAACGGAGAG CGTCTCAAAA GTGCAAGGAT GTATAATGGG TATACATTAA CGGAGCTTTC TAAAATTACA AATATAAGCA AACAATCACT CTCTCTATAT GAAAATGGAA ACAACAAGCC TGAATGGGAT AATATCTCAA AAATATCGGT AGCTTTAGGA TTTCCTCGGG ACTTCTTTTT ACAAGAGAGT GATTTTAAAG TTAGTACCGA TGCGACATAT TTTAGAGCAT TAACAAGTGT AACTAAAAAA GATAGAACCG CACAGAAGAT CAAGCTTGAG TATCTATCTC AAATATATTT AACGCTGTTT GAATATATTG ATTTTCCTTC ATTAAAAATT CCATACATTG ACTTTTCCTT AACCGAAGTT TTTGAAACAG ATGAAGAGTT ACAGAATATA GAAGATGTTG CTGATAAAAT AAGGAAATAT TGGGGATTGG GATCTGAGCC TATACTAGAT TTAAGATATA TTCTTGAATC TAACGGAATG ATTGTAACGA GTTTTGATGC TGATGCGGAA AAAATAGATG CATTTAGCCA ACGTACTAAT GTTAATAAGG GTGAAGTTTA CTTAATCGCA ATATCCGCAT ATGGGCAAAC AATTGCTAGA GCTAGGTTTG ATATGGCGCA TGAATTAGGT CATATCCTAT TACATCCATG GAGTGAGGAT TTAGAATTAA TCTCTAAGGA AGAATTTCGA GCAAGGGAGC GCCAAGCAAA CATTTTTGCT GGAGCATTTT TGTTGCCAAA AGAAACATTT AGACAAGATG TTTCTCCATA CCCGACTACA CTTGATTACT ATTTACATCT TAAGAAAAAA TGGAATGTTT CTATTGCGGC CATGATTTAC AGAGCATATC AGCTAAAAGT AGTTACAAAT AATCAGTTTC AATATTTGAT GCGTCAGTTG TCGAAGAATG GGTGGAGAAA AAATGAACCT TTGGATACTG AATATAAACT ACAAAATAAT ATTTTGCAAT CAGCTGTAGA TATGTTGATA AATAATAATG TTTTTTCAGG TAAGCAACTT TTAGCTGAAT TAGCTCAGAA GGGGTTATCA ATGTATCCTG AACAAATTGA AGATCTATTA TGTCTGAAGC ATGGAACACT ATCTAAGGGT GAAGAGGATA AATCCCAGAT TATACATTTG AAAGATTATA TCCCACCTTC CGCAAGATAA
|
Protein sequence | MKKQKKFNGE RLKSARMYNG YTLTELSKIT NISKQSLSLY ENGNNKPEWD NISKISVALG FPRDFFLQES DFKVSTDATY FRALTSVTKK DRTAQKIKLE YLSQIYLTLF EYIDFPSLKI PYIDFSLTEV FETDEELQNI EDVADKIRKY WGLGSEPILD LRYILESNGM IVTSFDADAE KIDAFSQRTN VNKGEVYLIA ISAYGQTIAR ARFDMAHELG HILLHPWSED LELISKEEFR ARERQANIFA GAFLLPKETF RQDVSPYPTT LDYYLHLKKK WNVSIAAMIY RAYQLKVVTN NQFQYLMRQL SKNGWRKNEP LDTEYKLQNN ILQSAVDMLI NNNVFSGKQL LAELAQKGLS MYPEQIEDLL CLKHGTLSKG EEDKSQIIHL KDYIPPSAR
|
| |