Gene TDE2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTDE2003 
Symbol 
ID2740456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTreponema denticola ATCC 35405 
KingdomBacteria 
Replicon accessionNC_002967 
Strand
Start bp2022417 
End bp2023691 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content51% 
IMG OID637160893 
Productinternalin-related protein 
Protein accessionNP_972606 
Protein GI42527508 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000199876 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGAAG AATCGAAAAA ACTGTTTATA ACAATACTGC TTGCAGCGGG ATTGTTGACC 
GCTGTCGCTG CGACGGAAAA CGCAGGCAGG GCGGTACTTG GTATAAGCCC GCATGAAAAA
GAGATAACCG TGACGGCGGT AACGGCAGAC GGCTCGCCCG TACAGGTAGA AGGCTGCACG
GTTACGGAAT TCCCGAGCGG CGAAGAAACG GTACTTACGG CAACGGGAAC AAAGGTTGTC
CTTAAAGGTG CTCTTATCGA GCTGTTCTGT TATGCTAACA AACTTACCGC GCTTGATGTC
CGCGGTTTAA CGGCTTTGCA GGAGCTGTAT TGCCAAGACA ATATGCTTAC TTCGCTTGAT
ATACGGGAAT TAACCGGCTT ACATACGCTG TATTGCGGCA ATAATCGGCT TACCGCACTG
GATATACGTG GGTTGACCGC TTTGCGGAAG CTGTACTGCA ATAGTAACGA AATCGCATCC
CTCGATGTAC GGGGGCTTAC CGCTTTACAG ACACTGTACT GTGACAATAA CCGGCTTTCC
TCCCTTGATG TACGGCACCT TACCGCTTTA CAGTGGCTGG ACTGCCACTT CAATAAACTG
ACATCACTCG ACGTGCAGGG CTTACCTGCT TTACAGGTAC TGGAGTGCTC GGATAATCAG
CTTACCTTGC TTGATGTACA GGGGTTGCCC GCTTTGCAGA AGCTGTACTG CCAAGACAAT
AAGATTACTT CGCTTGACGT GCGCGGTTTA ACCTCTTTAC AGGTACTGAT GTGCTACGAC
AATCGGCTTA CCGCACTGAA TGTACGCCGT TTGACTGCTT TACAGGAGCT GAATTGCAGT
TCCAATGCAA TCGCTTCCAT CGATGTACGG GGACTTACTG CGCTACAGGT GCTGTACTGT
GACAATAACC GGCTAACATC CCTCGATGTA CAGGGGCTGA CAGCTTTACA GGAGCTGGCG
TGCTCGGATA ATCAGCTTAC CGCACTCAAT GTGCAGGGCT TACCCGCCTT ACAGGCGCTG
GGTTTTCAGG ATAACCGCCT TGAAGAAGAC GCCCTTATAC GGATACTCAA CAGTTTGCCC
GACCGCAAAA CGGACGAAGA AGGCCTCGCC GTACTGTACA GCGAAGGGGA GAAGCTCCCT
AAAGAAAGCG GGTCAGCCGT TCCCGCCTCA GCCGAGTTTC GAGCGGCTTT CAAAGCTGCA
AAACAAAAAA ACTGGAAGCT GTACATGCAA ACCGAAGATG GAGACGGTAC GGAAATTCGG
TTGCAGGAGG AATAA
 
Protein sequence
MMEESKKLFI TILLAAGLLT AVAATENAGR AVLGISPHEK EITVTAVTAD GSPVQVEGCT 
VTEFPSGEET VLTATGTKVV LKGALIELFC YANKLTALDV RGLTALQELY CQDNMLTSLD
IRELTGLHTL YCGNNRLTAL DIRGLTALRK LYCNSNEIAS LDVRGLTALQ TLYCDNNRLS
SLDVRHLTAL QWLDCHFNKL TSLDVQGLPA LQVLECSDNQ LTLLDVQGLP ALQKLYCQDN
KITSLDVRGL TSLQVLMCYD NRLTALNVRR LTALQELNCS SNAIASIDVR GLTALQVLYC
DNNRLTSLDV QGLTALQELA CSDNQLTALN VQGLPALQAL GFQDNRLEED ALIRILNSLP
DRKTDEEGLA VLYSEGEKLP KESGSAVPAS AEFRAAFKAA KQKNWKLYMQ TEDGDGTEIR
LQEE