Gene Dtox_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2002 
Symbol 
ID8428984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2166724 
End bp2168508 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content43% 
IMG OID645034329 
Productoligoendopeptidase F 
Protein accessionYP_003191460 
Protein GI258515238 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR00181] oligoendopeptidase F 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000655164 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.321245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAACG CAACAGCAGA CAAATACTCC TGGGACTTAA CCGCTATCTT CCCCTCTGAC 
ACAGCCTGGG AAGATGCCTT AAAGCAAATA CAGGAGCTAA CCTCCAAAGT CACCTCTATG
CAGGGCAATC TAACCGTTTC GGCAGAAAAC CTGTATGCCG CTCTGGCCGA TATTGACCGG
TTATCCATCA AACTGGAGCA GGCTTATAGT TACGCCAGGT TGGCTTTTGA CACTTCCATG
GGTGACAACA CAGCCAAAAC ACGCTATGAG AGAATTGATG CTTTGTCTTC GGGAATAGAC
GCACAATTGT CTTTTGTTGA ACCGGAACTG CTCCAATTAA ATGAAGAAAG CTTCCTATCT
TATAAAAAAC AGCTGCCGGA ACTGGAAATA TACTCTTTTA AATTTGAAAA ACTATTTGCT
CTGAAAAAAC ATGTCCTCTC ACCGGATATT GAACAAATTT TGACTAAAAT GAACTCTCTG
GGCCGCTCCT TTAAAAAAAT ATTTGACGAT ATAGCTGTCA ATGATTTAGA TTTTCCCGAA
GTGGCAGGTT CCGACGGGCA AAAGTTTACC GCCGGCGAGG CTAATTATCT CAAGTGCATG
ACTTCTCAAG ACAGGGTTTT GCGTGAGAAT TATTTTAAGG GACTGCTCAA TACTTACGCC
TCACACAAAA ATTCCATTAC CTCCACTTAT TATGGAGCAG TTAAAAACAG CATATTCACT
GCCGGTATCA GAAACTTCAG CTCATCACTG GATATGGCCC TTTCCAGCAA CTTCATACCG
CCGGGAGTTT ACGATAACTT AATCAGCACC GTGCGCAGCA ATGTCGACAA GCTGCAGCGC
TATATAGCTT TGCGTCAAAA AATCCTTGGT CTGCCGGAAA TTCATTTTTA CGACCTGTTT
GTCCCGGTGG TCAAAGATAT GAATAAAACC TACACTTTTG AAGAGGCCAG AGATATTGTG
CTGGAGGCAC TGGCTGTTCT GGGAGAGGAT TATGTAGGAA TACTGAAGCG GGCTTTTTCC
GAGCGATGGA TCGATGTCTA CCCGGCTAAA GGCAAGAGAT CAGGTGCTTA CGCTATGGGA
ATTTACGGCA CTCACCCTTT CTCCCTGTTA AACTTCTCCG GTACTGTAGA GGATATTTTC
ACCCTGGCCC ATGAGTTAGG CCATGTAATG CACAGTTATT TCAGCAATGA AAACCAGCCT
TACATAAATT CACATTATGT AATTTTTACG GCCGAAGTGG CCTCAACCGT AAACGAGACT
CTGCTCTTAA ACTTTTTATT AAAGAAATCA ACCTCTGAGC AGGAAAAAGC CAACTTGTTG
AGCATGCATC TGGACAGCAT CCGCTCGACC CTTTATCGCC AGACATTTTT TGCGGATTTT
GAAAAGCAGG TTCACGAAAC CGTGGAAAAA AATCAGCCTC TAACCCCGGA GACACTGCAA
ACCGTCTATA AAGACTTATA TAAACTGTAT TATGGAGAGA ATTTTGTCAT TGATACGGAA
TTAACCTGTG AATGGCTGCG AATACCTCAT TTTTACTCCC CCTTTTATGT CTACCAGTAT
GCAACAGGTA TTTCAGCGGC TATAAGCATA GCAGCCGGCA TCTTGAATAA AAACCGGTCA
TTTTTAACAG GCTATAAGAA TTTTCTTAAA TCCGGAGGTT CTAAACACCC AATAAACTTG
CTGCAAGAGG CAGGTGTGGA TATGTCAACA CCACAGCCCA TCCAAGATGC TTTAAATGAT
TTTGAAAACT CGGTAAAACA ACTTTCGTCT ATTTTAAAGT TATAG
 
Protein sequence
MINATADKYS WDLTAIFPSD TAWEDALKQI QELTSKVTSM QGNLTVSAEN LYAALADIDR 
LSIKLEQAYS YARLAFDTSM GDNTAKTRYE RIDALSSGID AQLSFVEPEL LQLNEESFLS
YKKQLPELEI YSFKFEKLFA LKKHVLSPDI EQILTKMNSL GRSFKKIFDD IAVNDLDFPE
VAGSDGQKFT AGEANYLKCM TSQDRVLREN YFKGLLNTYA SHKNSITSTY YGAVKNSIFT
AGIRNFSSSL DMALSSNFIP PGVYDNLIST VRSNVDKLQR YIALRQKILG LPEIHFYDLF
VPVVKDMNKT YTFEEARDIV LEALAVLGED YVGILKRAFS ERWIDVYPAK GKRSGAYAMG
IYGTHPFSLL NFSGTVEDIF TLAHELGHVM HSYFSNENQP YINSHYVIFT AEVASTVNET
LLLNFLLKKS TSEQEKANLL SMHLDSIRST LYRQTFFADF EKQVHETVEK NQPLTPETLQ
TVYKDLYKLY YGENFVIDTE LTCEWLRIPH FYSPFYVYQY ATGISAAISI AAGILNKNRS
FLTGYKNFLK SGGSKHPINL LQEAGVDMST PQPIQDALND FENSVKQLSS ILKL