Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_2002 |
Symbol | |
ID | 8428984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 2166724 |
End bp | 2168508 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 645034329 |
Product | oligoendopeptidase F |
Protein accession | YP_003191460 |
Protein GI | 258515238 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR00181] oligoendopeptidase F |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000655164 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.321245 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAAACG CAACAGCAGA CAAATACTCC TGGGACTTAA CCGCTATCTT CCCCTCTGAC ACAGCCTGGG AAGATGCCTT AAAGCAAATA CAGGAGCTAA CCTCCAAAGT CACCTCTATG CAGGGCAATC TAACCGTTTC GGCAGAAAAC CTGTATGCCG CTCTGGCCGA TATTGACCGG TTATCCATCA AACTGGAGCA GGCTTATAGT TACGCCAGGT TGGCTTTTGA CACTTCCATG GGTGACAACA CAGCCAAAAC ACGCTATGAG AGAATTGATG CTTTGTCTTC GGGAATAGAC GCACAATTGT CTTTTGTTGA ACCGGAACTG CTCCAATTAA ATGAAGAAAG CTTCCTATCT TATAAAAAAC AGCTGCCGGA ACTGGAAATA TACTCTTTTA AATTTGAAAA ACTATTTGCT CTGAAAAAAC ATGTCCTCTC ACCGGATATT GAACAAATTT TGACTAAAAT GAACTCTCTG GGCCGCTCCT TTAAAAAAAT ATTTGACGAT ATAGCTGTCA ATGATTTAGA TTTTCCCGAA GTGGCAGGTT CCGACGGGCA AAAGTTTACC GCCGGCGAGG CTAATTATCT CAAGTGCATG ACTTCTCAAG ACAGGGTTTT GCGTGAGAAT TATTTTAAGG GACTGCTCAA TACTTACGCC TCACACAAAA ATTCCATTAC CTCCACTTAT TATGGAGCAG TTAAAAACAG CATATTCACT GCCGGTATCA GAAACTTCAG CTCATCACTG GATATGGCCC TTTCCAGCAA CTTCATACCG CCGGGAGTTT ACGATAACTT AATCAGCACC GTGCGCAGCA ATGTCGACAA GCTGCAGCGC TATATAGCTT TGCGTCAAAA AATCCTTGGT CTGCCGGAAA TTCATTTTTA CGACCTGTTT GTCCCGGTGG TCAAAGATAT GAATAAAACC TACACTTTTG AAGAGGCCAG AGATATTGTG CTGGAGGCAC TGGCTGTTCT GGGAGAGGAT TATGTAGGAA TACTGAAGCG GGCTTTTTCC GAGCGATGGA TCGATGTCTA CCCGGCTAAA GGCAAGAGAT CAGGTGCTTA CGCTATGGGA ATTTACGGCA CTCACCCTTT CTCCCTGTTA AACTTCTCCG GTACTGTAGA GGATATTTTC ACCCTGGCCC ATGAGTTAGG CCATGTAATG CACAGTTATT TCAGCAATGA AAACCAGCCT TACATAAATT CACATTATGT AATTTTTACG GCCGAAGTGG CCTCAACCGT AAACGAGACT CTGCTCTTAA ACTTTTTATT AAAGAAATCA ACCTCTGAGC AGGAAAAAGC CAACTTGTTG AGCATGCATC TGGACAGCAT CCGCTCGACC CTTTATCGCC AGACATTTTT TGCGGATTTT GAAAAGCAGG TTCACGAAAC CGTGGAAAAA AATCAGCCTC TAACCCCGGA GACACTGCAA ACCGTCTATA AAGACTTATA TAAACTGTAT TATGGAGAGA ATTTTGTCAT TGATACGGAA TTAACCTGTG AATGGCTGCG AATACCTCAT TTTTACTCCC CCTTTTATGT CTACCAGTAT GCAACAGGTA TTTCAGCGGC TATAAGCATA GCAGCCGGCA TCTTGAATAA AAACCGGTCA TTTTTAACAG GCTATAAGAA TTTTCTTAAA TCCGGAGGTT CTAAACACCC AATAAACTTG CTGCAAGAGG CAGGTGTGGA TATGTCAACA CCACAGCCCA TCCAAGATGC TTTAAATGAT TTTGAAAACT CGGTAAAACA ACTTTCGTCT ATTTTAAAGT TATAG
|
Protein sequence | MINATADKYS WDLTAIFPSD TAWEDALKQI QELTSKVTSM QGNLTVSAEN LYAALADIDR LSIKLEQAYS YARLAFDTSM GDNTAKTRYE RIDALSSGID AQLSFVEPEL LQLNEESFLS YKKQLPELEI YSFKFEKLFA LKKHVLSPDI EQILTKMNSL GRSFKKIFDD IAVNDLDFPE VAGSDGQKFT AGEANYLKCM TSQDRVLREN YFKGLLNTYA SHKNSITSTY YGAVKNSIFT AGIRNFSSSL DMALSSNFIP PGVYDNLIST VRSNVDKLQR YIALRQKILG LPEIHFYDLF VPVVKDMNKT YTFEEARDIV LEALAVLGED YVGILKRAFS ERWIDVYPAK GKRSGAYAMG IYGTHPFSLL NFSGTVEDIF TLAHELGHVM HSYFSNENQP YINSHYVIFT AEVASTVNET LLLNFLLKKS TSEQEKANLL SMHLDSIRST LYRQTFFADF EKQVHETVEK NQPLTPETLQ TVYKDLYKLY YGENFVIDTE LTCEWLRIPH FYSPFYVYQY ATGISAAISI AAGILNKNRS FLTGYKNFLK SGGSKHPINL LQEAGVDMST PQPIQDALND FENSVKQLSS ILKL
|
| |