Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TDE0593 |
Symbol | |
ID | 2741556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Treponema denticola ATCC 35405 |
Kingdom | Bacteria |
Replicon accession | NC_002967 |
Strand | - |
Start bp | 632131 |
End bp | 634026 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637159469 |
Product | internalin-related protein |
Protein accession | NP_971207 |
Protein GI | 42526109 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.138946 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TTTTGACGGT ATTATTTTTA ACAGGATTGT TGACAACGCG TATCGCAGCA GCGGAAAATC CTGCAAAAAC CGCAGGCTCA GGAAGAGCAA TACTCGGTAT AAGCGACGAC CAAAAAGAGA TAGTCGTAAC AGCGGTAACG GCAGACGGTT CTGCAGTACA TGTTGAAGGT TGCACGGTTA CGGAATTACC GAGCGGAGAA GAAACGATCC TCACGGCAAC AGGAGCAAAG GTTATCCTTA AAGGCGCTAT TACCAAGCTG GACTGTGGAG GCAACCGGCT TACAGAGCTT AACGTACAGG GGTTAACAGC TTTACAAAAA TTATTCTGCG ATGACAATCT GCTTACTTCG CTTGATGTCA GCGGTGTAAC CGCTTTGCAA AGCTTGAGCT GCGGAGAGAA TCTACTGACC TCGCTCGACG TATCGGGTTT AACCGGTTTA CGGGAACTGT ACTGCAACCG CAATCATCTG TCTTCGCTTG ATGTACAGAG TTTAACCGCT TTGCAGGATT TGTTCTGCAA CGCCAATAAA CTTACTTCGC TTAACGTACA AGACCTAAAA GTGTTACAGA GACTGCACTG TAACAGCAAC CGGCTTACTT TACTCAATGT TCGGGATTTA AGTGCTTTAC AGGAACTGGA CTGCGTAGGT AATGAACTTA CCTCGCTTGA TGTACATGGC GTAACCGCCT TATGGGAACT GGAATGCAGC AAAAACATGC TTACTTTGCT CGATGTACAA AGTTTAACTT CCTTATCGAA GCTGGACTGT TCCGCAAATC AACTTACCTC ACTAGACGTA CGGAACTTAG CAGCTTTGGA GGAACTGGAT TGTAGTAACA ATAAACTTAC CGCACTTTAT GTGCAGGGAT TGAATGCGTT ACAGGAACTG AACTGTTCCG AAAATGAGCT CACCTCACTC GAAATACAGG GTTTGACTGC TTTGGAAGTA CTGGATTCCG GCCGCAATGA CCTTACCTCA CTCGATGTAC AAGGATTACC GGCTTTAAAG ATACTGAGCT GTACTGTCAA TGAGCTTACC TCACTCAAAG TACGGGACTT ACCGGCCTTG GAAAAACTGG ACTGTTCCGT AAATCAACTT ACCTCAATCG ATATACTAGA GTTAACCGCT TTAAAGGAAC TGAACTGTTC CTTAAATCAA TTTACCTCAA TCAATATACT GAAGTTAACC GCTTTAAAGG AACTGGACTG TTCCACAAAT CAACTTACCT CACTAGACGT ACGGAACTTA GCAGCTTTGG AGAAACTGGA TTGCAGGGAC AATAAACTTA CCTCGCTTAA TGTGCAAGGA TTGAATACAT TACAGAAATT GTACTGTTCC GAAAATGAGC TTACCTCACT CGAAATACAA GGCTTAAAGA CGTTGCAGAA ACTGAACTGT TACAAAAACA AGCTTACCTC ACTCAATGTA CAGGGGCTGA CCGCTTTACA GTGGCTGAAT TGCGGGTATA ATGAACTTAC AACACTTAAC CTAAAAGGCT TACACGCATT GCGTGACCTG GAGTGCTTTA ACAATAATCT TCCTGAACTC GACGTACAGG ACATAAACAC CTTACAAAGA CTGAACTGTT ATCATAATAA ACTTTCTACT CTTGAACTAT CAACATTACA CGGTTTACAG GAACTGTGCT GCTATGATAA CCTTTTTAAC GAAAAAACCC TTATCCGGAT ACTAACCGCC CTGCCCGATA GGAAGCAAAA AAAAGAAGGC AGAGCTTTGA TATATGGTAA AAAAAATGAT CTAAGGGAAG GAACTATAAC GGATTTTTCT TCTTCCGCCG AATTAAAAGC TGCTTTTGAA GCTGCAAAAG CAAAAAACTG GAGATTCTAT AAACGGGATA CAGTTGGAAA TGAGGAAGAG GTTTAA
|
Protein sequence | MKKFLTVLFL TGLLTTRIAA AENPAKTAGS GRAILGISDD QKEIVVTAVT ADGSAVHVEG CTVTELPSGE ETILTATGAK VILKGAITKL DCGGNRLTEL NVQGLTALQK LFCDDNLLTS LDVSGVTALQ SLSCGENLLT SLDVSGLTGL RELYCNRNHL SSLDVQSLTA LQDLFCNANK LTSLNVQDLK VLQRLHCNSN RLTLLNVRDL SALQELDCVG NELTSLDVHG VTALWELECS KNMLTLLDVQ SLTSLSKLDC SANQLTSLDV RNLAALEELD CSNNKLTALY VQGLNALQEL NCSENELTSL EIQGLTALEV LDSGRNDLTS LDVQGLPALK ILSCTVNELT SLKVRDLPAL EKLDCSVNQL TSIDILELTA LKELNCSLNQ FTSINILKLT ALKELDCSTN QLTSLDVRNL AALEKLDCRD NKLTSLNVQG LNTLQKLYCS ENELTSLEIQ GLKTLQKLNC YKNKLTSLNV QGLTALQWLN CGYNELTTLN LKGLHALRDL ECFNNNLPEL DVQDINTLQR LNCYHNKLST LELSTLHGLQ ELCCYDNLFN EKTLIRILTA LPDRKQKKEG RALIYGKKND LREGTITDFS SSAELKAAFE AAKAKNWRFY KRDTVGNEEE V
|
| |