Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TDE2020 |
Symbol | |
ID | 2740367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Treponema denticola ATCC 35405 |
Kingdom | Bacteria |
Replicon accession | NC_002967 |
Strand | - |
Start bp | 2040824 |
End bp | 2044246 |
Gene Length | 3423 bp |
Protein Length | 1140 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637160910 |
Product | YD repeat-containing protein |
Protein accession | NP_972623 |
Protein GI | 42527525 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATTA GTTATTGTCG TATACAACCA AACTATCTCT TTTCTCGGAC AGCGATTGCA AGGGAGCAAA AGCAATTTAT AAAAATTGCT TTTGCTGTAC ATCTTTCCGC AGTAATTTCT GCGGAAAGAT ACCGAAGCGA TAGCGTAGCC CTGAAAAGCG CGGTGTCGGT ATTTGTGTAC TTTTATAGTG TATTGATGGT ACACAAATAC TGGCAGTCCG CCAAAAGAAA GTTATTATTA AACAGTCCCA CTGACAGGAC TTTTGAAAAT AGGTGGAGCA GATACAAGGA GCCTAGTAAA AATAAAGCGG AGGCGTATCG GGTATACGTC GAGCATTTAT TTTTGCGTAG CGACGCAGGA GATGCCCGCA TATTTTCAAA AGAGGTAATC CACGACGGTC TAGGACGCAT AAACTACACA GCCAAAGAAG GAGAAGTATA CATAGACGGA ACCGATTACC AAACCCAAAC AGGCTGGAAT GTTTCGGGAG CGGTTTATTA TGATGAAGAC GGAAGAAAAA AAGGAGAAGG ACAGCCTGTG TTCTATGGAG GAGATATACA AAATGAATTA AGCGGAAGCT CAAGCCAAAT CCTTTTATAT GAAAAATTAA ACGAATTAAA ACATCCTACA AGTTATGAAT ATGACGGACT AGGCCGCGTA ATAAAAACAA CCCTTCCCGA CGGAAACATA CAGCGTAATG AATACTTGAT AGATTCTTCC TTACAAATAA CAAAAACAAC CGACCCTAAA GAAAACATAA ACATCAGTAA AAAAGATATA AGAGGAAACA TAAAAGAAGT AGAAAGACGT GATAAAAACA ATACCTTACT AACCAAAGCC CGATACGAAT ATTCGGTATT AGGAGAAATG TTAAAAGCCT ACGACGCTAA AGACAACTTA TTGGCAGTAA ACTATGACAT GCTCGGAAGG CGCATAAGTT TAGAAAGCCT TGATATGGGA AGAAAAGAAT GGAACTATGA CGATAAAGGA AGACTTGAGT ATGAAAACGA CTCCGTATTA AGATCAAAAT TAGCGTCAAT AAAATATGAA TACGACGGAC TGGACAGAAT AATAAAAATA GACTATCCTT TTAGTGAAGA TGTAGAATAT GAATACGGAG CAGCGGGAGA AAAAGGAGCG GGAGAGGTAA TCCGCAAAAA AGATGAAACG GGAGAGACAA TGTACAGCTA CGGCCTACTA AACGAGGTAA AGGTAGAAAC ACGGACAATA AAGAGGGGCA GAGAATTTCA AAAACCCGTA ACTGCCGTGT TTAACTATGA AGCCGATTAC TTAGGCCGAA TGCAAAGCAT AAGCTACCCT GACGGAGAAG TGCTGACTTA CAACTACGAC AAGGGCGGAC AATTAAAAGG AGTTATAGGT AAAAAAGGAA TAGAAACCTA TAGGTATGTA GACAATATCT TATACGATGA ACACGGTCAG AGAGTCTACA TCAAATACGG AAACGGAGTA GAAACAAGAT ATACTTACGA CCCTGCACGG CGTTGGCTAA AAGACATAAA GACAGAAAAC AAAGATAAGA ATTTGGTATT CCAAAAAATA AACTATAACT TTGATGCAGT AGGGAATGTA GAAGGCTATA TAAATACTTC AAGCAAATAT GAAACAAGTC AAAGCTACAG TTATGACAAT TTGTACCAAC TTATAAAAGC CGAAGGCACG CACAAACAAT ACGGAGGAAT AAACCCTAAC CCCGATAACC CGCATCCTTC AAATCCCTTA TACACAAATA AATACAGACA AACCTTTGCC TTCGACATAA TAGGGAACAT GACGAATAAG AGCAGCACCA CAAACCTTCC CGGAGGCTCA ATAGGAACTA CGGATGACAC AAAACTAAAC TATGAACTGG ATTATGAGTA TGACAGTAAA TATGCACACC GCTTAATAAG AGCCGGAACC CGATACTATC GCTACGACGC CAACGGCAAC ATCACTGCAG AAAAAGACGG AAAGTTCAGC GACAAAGAAG AGCTTACCTT TACCTATTCC TACTTTGCAG AACATGACGT ATACGGAGTA GATTACGGCT TTGACCTAGA ACCCCCTGAA GATGACCCTG CAAACCTAGA AAGCGGAGGC ACAACAAGTA CTACACCTAC AGGAGGCTAC AGAAGAGATT ATACATGGAA TGAACGCAAC CTATTGATAA AATCAGACGA TAAGTTAAAC ACCGTAATAT ACCGCTACGG AGATGACGGA CAGAGAGCCT TAAAGTTTAC TCAACAGAGT AATAGCGAAA CTCTTTATTT CAATAACTTT TACTCCGTAC ACCAAGTTGC CCATGAACCT AATCACGAAC ATGGATTACG AGTAAGCAAA CACATCTTTG TCGGAAACTC AAGATTAGTA ACTGCAATGA CGCACGCAGA CAACCACGGC GACACAACCG AGCAGACAGA AAAAAGATAT TATTATCATG CAGATCATCT GCAAAGTGCA CAGTTTATAA CAAATGCTAA AGGAGAGCAG TATGAACACA TAGAATACAC GCCTTACGGA GAACTCTGGA TTGAAGAGAC TGCACCGGGG ATAGATAAGT TACCGTTTAG GTTTACCGGC AAGGAACTGG ATGAAGAGAC TGGACTGTAC TATTATGGTG CGAGGTATTT AGACCCGAAA TATTCGAGAT GGTTGAGTGG AGATCCTGCG TTAGGTGAGT ATATTCCGCA AGCTCCGGTA AATGACGAAG CTAAAAAACA CAACGAGAAC CTGCCCGGAA TGGGAGGTAT ATACAACACG GTAAATTTAC ATGTTTATCA TTATGCGGGG AATAACCCGG TGAAGTATGT GGATCCGAAT GGAGAAAAAA TACTGGATGT TAGTACGACT TTGGTTCAAG AAAACGGAGA AGAAGATCCC TTAGGTAAGG GTAATACTAC AATTGCTTCA CATGGCTGTG TTTTAACAGC ATATACTAGA ATTGCAAATG CGATAAGTGC TATTGGTGAA CGTACATTAC AAAACGCTAA TTCATTGGCT CTTAAGAAGG ATATTTATGA TGGAAATAAC AAAAATTTAA TATTTAACTC TCCTACTGCT TCCAAATTAG TAAATTCCAT ATCTATTCTA AGTGAATATA AAATGTCATT TTATAAATCA TTAAATAATA TTAGCTTAAA TGAATATGGT AGCGAATTGG AAAAACTAAG CAAATCAGAA GACTTATATG CTATTACCAT TAGAGTTTCA AATAAGTATG GAGGCCATAC TGTCAACATG AATTCTGATG GAATTATTGT GGATACTGAA GATAATTATA CCATCAAAAT TAATAACACC TCAGAAAAAG GATATTTACC GTCAACGGTT GTTAATGATA TCATAACGGA TTCTTCAGCT ATAATCCGTA TTGATATATT TAAGATAGAA AAGGTAAAAA CTGTTCCATT TTTACCCGAA TAA
|
Protein sequence | MKISYCRIQP NYLFSRTAIA REQKQFIKIA FAVHLSAVIS AERYRSDSVA LKSAVSVFVY FYSVLMVHKY WQSAKRKLLL NSPTDRTFEN RWSRYKEPSK NKAEAYRVYV EHLFLRSDAG DARIFSKEVI HDGLGRINYT AKEGEVYIDG TDYQTQTGWN VSGAVYYDED GRKKGEGQPV FYGGDIQNEL SGSSSQILLY EKLNELKHPT SYEYDGLGRV IKTTLPDGNI QRNEYLIDSS LQITKTTDPK ENINISKKDI RGNIKEVERR DKNNTLLTKA RYEYSVLGEM LKAYDAKDNL LAVNYDMLGR RISLESLDMG RKEWNYDDKG RLEYENDSVL RSKLASIKYE YDGLDRIIKI DYPFSEDVEY EYGAAGEKGA GEVIRKKDET GETMYSYGLL NEVKVETRTI KRGREFQKPV TAVFNYEADY LGRMQSISYP DGEVLTYNYD KGGQLKGVIG KKGIETYRYV DNILYDEHGQ RVYIKYGNGV ETRYTYDPAR RWLKDIKTEN KDKNLVFQKI NYNFDAVGNV EGYINTSSKY ETSQSYSYDN LYQLIKAEGT HKQYGGINPN PDNPHPSNPL YTNKYRQTFA FDIIGNMTNK SSTTNLPGGS IGTTDDTKLN YELDYEYDSK YAHRLIRAGT RYYRYDANGN ITAEKDGKFS DKEELTFTYS YFAEHDVYGV DYGFDLEPPE DDPANLESGG TTSTTPTGGY RRDYTWNERN LLIKSDDKLN TVIYRYGDDG QRALKFTQQS NSETLYFNNF YSVHQVAHEP NHEHGLRVSK HIFVGNSRLV TAMTHADNHG DTTEQTEKRY YYHADHLQSA QFITNAKGEQ YEHIEYTPYG ELWIEETAPG IDKLPFRFTG KELDEETGLY YYGARYLDPK YSRWLSGDPA LGEYIPQAPV NDEAKKHNEN LPGMGGIYNT VNLHVYHYAG NNPVKYVDPN GEKILDVSTT LVQENGEEDP LGKGNTTIAS HGCVLTAYTR IANAISAIGE RTLQNANSLA LKKDIYDGNN KNLIFNSPTA SKLVNSISIL SEYKMSFYKS LNNISLNEYG SELEKLSKSE DLYAITIRVS NKYGGHTVNM NSDGIIVDTE DNYTIKINNT SEKGYLPSTV VNDIITDSSA IIRIDIFKIE KVKTVPFLPE
|
| |