Gene Dtox_2163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2163 
Symbol 
ID8429146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2333548 
End bp2334741 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content38% 
IMG OID645034479 
Productpeptidase M24 
Protein accessionYP_003191609 
Protein GI258515387 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000118517 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0215342 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC AAATACCGTT AAGCGAACTT GAGAGACGTA TTAACTCTTT AAGAACCAAA 
ATGGAGAAAT TATGTCCTGA CTGGGAAATG ATTGCTATTT TTAGTAACAT CAACTTATAT
TATTTCACCG GAACCGTACA AGATGGTATG TTGCTTATAC CCCGAAACGA TGATGCAGTA
TTCTGGGTAA GACGCAGTTA TGAACGAGCT TTGGATGAAT CGTTATTTAC AAGGATTAAA
CCAATGACTA GTTACCGTGA GGCAGCAGCA TCTATGAAAA AATTTCCTGA AACAGTTTAC
ATGGAAACTG AGATAGTACC GCTGGCGTTA TACCAGCGCT TTCAGAAGTA CTTCCCTTTT
ACTCATGTGA AATCAGTTGA TAAATATATA GCCGGGGTAA GAGCAATTAA GAGCAGTTTT
GAACTTTCAT TGATGGTACA GGCTGGTTCA ATTCACCAGA GGACACTGGA ACAATATGTA
CCACAACTGT TGAAAGAAGG TATCAGTGAA GTAGATTTAG CAACCGAATT ATATTCTATT
ATGGTAACTG AAGGACATCA CGGAGTTGCA CGCTTTGGTG CATTTGGCGC AGAGATAGTA
TTAGGGCTTG TATGTTTTGG TGATAGTTCT ATCTATCCAA CATATTTTGA TGTGCCGGGA
GGTAATTACG GTTCGTGCCC TGCGGTACCA TTACTAGGCA ATCGTTATCG CAAGCTTAAG
AAAGGTGATC TAGTTTTTAT TGATATAGGT TGTGGAGTAA GCGGTTATCA TACTGATAAA
ACCATGACTT ATATGTTTGG CAAATCGCTG TCACAAGAAG CGATTTCTGC ACATAAGCAG
TGTGTCGATA TACAAAATAG AATTGCTGAG ATGCTAAAAC CGGGTGCTGT TCCGGCACAA
ATATATAAAA ATACAATAAA CAATCTTAGT CCTGAGTTTC TCGAAAATTT TATGGGTTAC
GGGAATCGCA GGTCCAAATT TTTGGGACAT GGGATAGGTT TATTAATTGA TGAATTGCCT
GTGATAGCAG AAGGGTTTAC TGAGCCGATA GAAGAAGGAA TGGTTTTTGC AATTGAACCT
AAAAAGGGTA TTAAAAATAT TGGTATGGTT GGAACTGAAA ACACTTTTAT AGTTACTCCT
AACGGTGGAC TTTGTATTAC CGGAGATAAT CCGGGATTAA TTCCTGTGTA TTGA
 
Protein sequence
MKKQIPLSEL ERRINSLRTK MEKLCPDWEM IAIFSNINLY YFTGTVQDGM LLIPRNDDAV 
FWVRRSYERA LDESLFTRIK PMTSYREAAA SMKKFPETVY METEIVPLAL YQRFQKYFPF
THVKSVDKYI AGVRAIKSSF ELSLMVQAGS IHQRTLEQYV PQLLKEGISE VDLATELYSI
MVTEGHHGVA RFGAFGAEIV LGLVCFGDSS IYPTYFDVPG GNYGSCPAVP LLGNRYRKLK
KGDLVFIDIG CGVSGYHTDK TMTYMFGKSL SQEAISAHKQ CVDIQNRIAE MLKPGAVPAQ
IYKNTINNLS PEFLENFMGY GNRRSKFLGH GIGLLIDELP VIAEGFTEPI EEGMVFAIEP
KKGIKNIGMV GTENTFIVTP NGGLCITGDN PGLIPVY