Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TDE1865 |
Symbol | |
ID | 2740054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Treponema denticola ATCC 35405 |
Kingdom | Bacteria |
Replicon accession | NC_002967 |
Strand | - |
Start bp | 1889231 |
End bp | 1892284 |
Gene Length | 3054 bp |
Protein Length | 1017 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637160752 |
Product | M16 family peptidase |
Protein accession | NP_972469 |
Protein GI | 42527371 |
COG category | [R] General function prediction only |
COG ID | [COG1026] Predicted Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000469623 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACTC TTATTCACGG CTTTGAAATT ATAAGCAAAA ATCCCTTACC CGAATTTAAT GCTGTAGGCA TTTATGCAAG GCATAAAAAG ACGGGACTTG AACTTTATCA TATTTTAAAC GATGACGATG AAAATCTTTT TTCATATAAT TTTATGACGA GCTCTCCCAA TTCGACGGGG GTAGCTCATA TTATCGAGCA CACGGTTTTA TGCGGCTCTA AAAACTATCC GCTTAAAGAC CCCTTTATGG TTTTGGCAAA GCAGAGCGTT AATACCTTTT TAAATGCCAT GACCTATCCC GATAAGACAG TCTATCCCGC AAGCTCCTTG GTTGAGGCCG ATTATTTTAA CCTAATGTCG GTTTACGGGG ATGCTGTCTT CTTTCCCAAT CTTGACGAAT GGGCTTTTAA GCAGGAAGGA CACCGCTTTG AGCTGGACGA AAACGGAAAG ATGAGCGTTC AGGGAGTTGT CTTAAACGAG ATGCGGGCTA ATTATTCCGA CTTTGACGGG GTAATGTATG ATTGGGCAGC AGCTTCTATT TGTCAGGGAA GTATCTATGC TAAGGATTCG GGCGGCTCTC CATTGGAGAT TCCCGATTTA ACTTATGAAG AATATAAGGC CTTTCACAAA AAATACTATC ATCCCGTAAA CTGCCGAATT TTTTTAATGG GAAATATTCC GACCGAAAAG CAGATGAAAT TTTTAGAAGA AAAATTCCTT TCTAAATTTG AAGCTGCCGA AAAGCCTCCC TTTGTGCCGC CGATTGAGCA CTATGCCGAG CCCCGCTTCT TTTCGGTACC AGCCCCCGCC GGCGGCCCTG CTCCGGCAGG TGATACTGCT TCCGCCGAAG AGATGACTAA AGATTCCGTA ATGCTTAACT GGCTTCTCCC CGAAACTTCG GATACCGAAA AACTTATGCA GGCCTATCTT ATAGGAGAGG TTTTAATCGG GCACAGCGGA GCATACTTAA ATAAGGTTTT ATTGGAATCG GGAATAGGGG AAGACCTTTA TCCCTATAAC GGAATAGGAA AAAGCCTTAG AAACATCACC CTCACAATAG GAATAAAGGG TATTAAAAAG GAAACGCATG AGGATTTTAA AAAGCTGGTC TTTAGTGCTC TTGAAGAGCT TGTAAAAAAA GGAATCGACC CTAAAGAAAT TGAAACGGCC GTTCATTCTA TAGATTTTAG CAACAGAGAA ATAAGAAGAA ACTACGGGCC TTTCGGCATT AACCTGATGG AGCGTGCTAT GGCAGGCTGG ACATACGGGG TAAGTCCCGA AAAAACTCTT CAATACACTC CTGTTTTTGA AAAGGTAAAA AAAGACCTTG CCTCCGATAA AAGGTATATC GAAAAACTGA TTGAAAAGTA TTTACTAAAG AATAAACATC ATGCCCTTGT AAGAGTTTAC CCCGATGCGG ATTTTTGTAA ACGCTTGGAC GAAAGTCTGG AAAAAAGGGC TGAAAATTTT AATGCAAGTT TAACGGATGA GGATCGCAAG GCCATGCTCA AAGAGCAGGA GAAGATGAAC GAGTTTAAAC AAAAAAGCGA TTCCCCTGAA ATGCTCGCCC TTATTCCCCA TCTTTCAAAG AAAGACCTGC CTCCTCTTCC GCCTTCGATA GATGAAGAAA TTGCCTTTAT CGGAAAGGTT CCCATTGTTA TGCATGAGCA GCCTACAAAC GGGATAGGTT ATTTTCAATT GGCTTTTCCT GTAGACGGTT TAAGCGAAGA AGATTATAAA TATCTGCCCC TTCTTTCAAG TTGCCTTACA GGAATGGGAA CCGAAAACCT TTTATGGAGC GAGGTTTCTT CTAAGCTTGC AAATTTACTT GGAGGCTTTT CGGCAAGTGC CGGCGTTTTT ACGGCAAACA AAAATCTTTC TTTATGTAAA AATGCGGATA AGATAAGGCT CTCTGATATA GCCGGAAGGG ATTGGCTTTT TATTTCGGGG AAGATACTCG GTGAATTGAT TCCTGAAGCT GTCTGCTTTG TTTTGCAGTT TTTAAACGAA ATTTCTTTTG ACGATAAAAA ACGCTTAAAC GATTTGGTAA CCCAGAGGAA AAACGATTTT GAGAGTCTCC TTGCCCTTGA CGGAAACAGC CTCGCTCTTC TTAGGGCCAG TGCACCCCTG TCCGAAAAAA ATGCACGGAG GGAGATGCTT TCGGGATTAA GTCAGCTTAA ATTTTTGAGG GAGCTTTATT TAAAAGTTAA AGAAGATAAT TCTAAAAAAG CCGATTCCGA AAATGCTGAT TTAGAATTGA ATAAATTATC AAATAAATTA AGTGCCGTAT ATAAATCGAT TATAAAATCA GGTTTGATTA TCGAAGTTAC CGGTACAAAA GAAAATCTGG CCGCTTTAAA AACAGCCTTT GAAAAAAACT TAAAAGGCTT TAAGGCTCCC GATAAGACTG ATAAGATTGT TTTTGAAAAT CCATTTAAAT TTAGGCCTTC TGAAAAAAAG AGGTTGGAAC TTATTCCTGC TTCTCTTCAA GTAGGCTTTG CAGTTTCGGT TTTTAAGGCG GCAGCTTTCG GTTCGAAGAA GCAAGCCTCA CAGTTAATCC TGTGTAAATG GCTTTCAAGC GGCCCGATGT GGGAAAAGAT AAGAAGCATA GGAGGAGCCT ACGGTGCCTT TACCGTTCCT ATGTCTTTGG AAGAAATTTT AGCCTTTGTT TCCTACAGGG ATCCGAACCC GATAAATTCT CTTTCCGAAT TTTTAAACTC AATAGACGAG ACATTTACCC AAGACTTTTC GGAAGAGATG ATAGAAAAGC TTATTACGGG AAGATACAGC AAGGAGATTA TCCCGATGAC TCCTGCAGGA CGAGGGGCGG CTGCCTTTAG GGATCTTCTT TCGGGTATTT CATATTCCGA AAAAAAAGAA ATCGTTGAAA AGATGCTGGA AACAACGGCT GAGGATTTGC GTAATTGTGC AAAAAAATTA TCGGTTCAAA GGGATTCCCT TTCTTCCGTA GTCTTGGCTT CCGATTCGGC TCTTGCCCAA AAGGAAACGA TAAAAGAACT TTATCCCGAT CCCCTTCTTT CGGAGAGGGT TTGA
|
Protein sequence | MSTLIHGFEI ISKNPLPEFN AVGIYARHKK TGLELYHILN DDDENLFSYN FMTSSPNSTG VAHIIEHTVL CGSKNYPLKD PFMVLAKQSV NTFLNAMTYP DKTVYPASSL VEADYFNLMS VYGDAVFFPN LDEWAFKQEG HRFELDENGK MSVQGVVLNE MRANYSDFDG VMYDWAAASI CQGSIYAKDS GGSPLEIPDL TYEEYKAFHK KYYHPVNCRI FLMGNIPTEK QMKFLEEKFL SKFEAAEKPP FVPPIEHYAE PRFFSVPAPA GGPAPAGDTA SAEEMTKDSV MLNWLLPETS DTEKLMQAYL IGEVLIGHSG AYLNKVLLES GIGEDLYPYN GIGKSLRNIT LTIGIKGIKK ETHEDFKKLV FSALEELVKK GIDPKEIETA VHSIDFSNRE IRRNYGPFGI NLMERAMAGW TYGVSPEKTL QYTPVFEKVK KDLASDKRYI EKLIEKYLLK NKHHALVRVY PDADFCKRLD ESLEKRAENF NASLTDEDRK AMLKEQEKMN EFKQKSDSPE MLALIPHLSK KDLPPLPPSI DEEIAFIGKV PIVMHEQPTN GIGYFQLAFP VDGLSEEDYK YLPLLSSCLT GMGTENLLWS EVSSKLANLL GGFSASAGVF TANKNLSLCK NADKIRLSDI AGRDWLFISG KILGELIPEA VCFVLQFLNE ISFDDKKRLN DLVTQRKNDF ESLLALDGNS LALLRASAPL SEKNARREML SGLSQLKFLR ELYLKVKEDN SKKADSENAD LELNKLSNKL SAVYKSIIKS GLIIEVTGTK ENLAALKTAF EKNLKGFKAP DKTDKIVFEN PFKFRPSEKK RLELIPASLQ VGFAVSVFKA AAFGSKKQAS QLILCKWLSS GPMWEKIRSI GGAYGAFTVP MSLEEILAFV SYRDPNPINS LSEFLNSIDE TFTQDFSEEM IEKLITGRYS KEIIPMTPAG RGAAAFRDLL SGISYSEKKE IVEKMLETTA EDLRNCAKKL SVQRDSLSSV VLASDSALAQ KETIKELYPD PLLSERV
|
| |