Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TDE2003 |
Symbol | |
ID | 2740456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Treponema denticola ATCC 35405 |
Kingdom | Bacteria |
Replicon accession | NC_002967 |
Strand | + |
Start bp | 2022417 |
End bp | 2023691 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637160893 |
Product | internalin-related protein |
Protein accession | NP_972606 |
Protein GI | 42527508 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000199876 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGAAG AATCGAAAAA ACTGTTTATA ACAATACTGC TTGCAGCGGG ATTGTTGACC GCTGTCGCTG CGACGGAAAA CGCAGGCAGG GCGGTACTTG GTATAAGCCC GCATGAAAAA GAGATAACCG TGACGGCGGT AACGGCAGAC GGCTCGCCCG TACAGGTAGA AGGCTGCACG GTTACGGAAT TCCCGAGCGG CGAAGAAACG GTACTTACGG CAACGGGAAC AAAGGTTGTC CTTAAAGGTG CTCTTATCGA GCTGTTCTGT TATGCTAACA AACTTACCGC GCTTGATGTC CGCGGTTTAA CGGCTTTGCA GGAGCTGTAT TGCCAAGACA ATATGCTTAC TTCGCTTGAT ATACGGGAAT TAACCGGCTT ACATACGCTG TATTGCGGCA ATAATCGGCT TACCGCACTG GATATACGTG GGTTGACCGC TTTGCGGAAG CTGTACTGCA ATAGTAACGA AATCGCATCC CTCGATGTAC GGGGGCTTAC CGCTTTACAG ACACTGTACT GTGACAATAA CCGGCTTTCC TCCCTTGATG TACGGCACCT TACCGCTTTA CAGTGGCTGG ACTGCCACTT CAATAAACTG ACATCACTCG ACGTGCAGGG CTTACCTGCT TTACAGGTAC TGGAGTGCTC GGATAATCAG CTTACCTTGC TTGATGTACA GGGGTTGCCC GCTTTGCAGA AGCTGTACTG CCAAGACAAT AAGATTACTT CGCTTGACGT GCGCGGTTTA ACCTCTTTAC AGGTACTGAT GTGCTACGAC AATCGGCTTA CCGCACTGAA TGTACGCCGT TTGACTGCTT TACAGGAGCT GAATTGCAGT TCCAATGCAA TCGCTTCCAT CGATGTACGG GGACTTACTG CGCTACAGGT GCTGTACTGT GACAATAACC GGCTAACATC CCTCGATGTA CAGGGGCTGA CAGCTTTACA GGAGCTGGCG TGCTCGGATA ATCAGCTTAC CGCACTCAAT GTGCAGGGCT TACCCGCCTT ACAGGCGCTG GGTTTTCAGG ATAACCGCCT TGAAGAAGAC GCCCTTATAC GGATACTCAA CAGTTTGCCC GACCGCAAAA CGGACGAAGA AGGCCTCGCC GTACTGTACA GCGAAGGGGA GAAGCTCCCT AAAGAAAGCG GGTCAGCCGT TCCCGCCTCA GCCGAGTTTC GAGCGGCTTT CAAAGCTGCA AAACAAAAAA ACTGGAAGCT GTACATGCAA ACCGAAGATG GAGACGGTAC GGAAATTCGG TTGCAGGAGG AATAA
|
Protein sequence | MMEESKKLFI TILLAAGLLT AVAATENAGR AVLGISPHEK EITVTAVTAD GSPVQVEGCT VTEFPSGEET VLTATGTKVV LKGALIELFC YANKLTALDV RGLTALQELY CQDNMLTSLD IRELTGLHTL YCGNNRLTAL DIRGLTALRK LYCNSNEIAS LDVRGLTALQ TLYCDNNRLS SLDVRHLTAL QWLDCHFNKL TSLDVQGLPA LQVLECSDNQ LTLLDVQGLP ALQKLYCQDN KITSLDVRGL TSLQVLMCYD NRLTALNVRR LTALQELNCS SNAIASIDVR GLTALQVLYC DNNRLTSLDV QGLTALQELA CSDNQLTALN VQGLPALQAL GFQDNRLEED ALIRILNSLP DRKTDEEGLA VLYSEGEKLP KESGSAVPAS AEFRAAFKAA KQKNWKLYMQ TEDGDGTEIR LQEE
|
| |