Gene TDE0593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTDE0593 
Symbol 
ID2741556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTreponema denticola ATCC 35405 
KingdomBacteria 
Replicon accessionNC_002967 
Strand
Start bp632131 
End bp634026 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content43% 
IMG OID637159469 
Productinternalin-related protein 
Protein accessionNP_971207 
Protein GI42526109 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.138946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TTTTGACGGT ATTATTTTTA ACAGGATTGT TGACAACGCG TATCGCAGCA 
GCGGAAAATC CTGCAAAAAC CGCAGGCTCA GGAAGAGCAA TACTCGGTAT AAGCGACGAC
CAAAAAGAGA TAGTCGTAAC AGCGGTAACG GCAGACGGTT CTGCAGTACA TGTTGAAGGT
TGCACGGTTA CGGAATTACC GAGCGGAGAA GAAACGATCC TCACGGCAAC AGGAGCAAAG
GTTATCCTTA AAGGCGCTAT TACCAAGCTG GACTGTGGAG GCAACCGGCT TACAGAGCTT
AACGTACAGG GGTTAACAGC TTTACAAAAA TTATTCTGCG ATGACAATCT GCTTACTTCG
CTTGATGTCA GCGGTGTAAC CGCTTTGCAA AGCTTGAGCT GCGGAGAGAA TCTACTGACC
TCGCTCGACG TATCGGGTTT AACCGGTTTA CGGGAACTGT ACTGCAACCG CAATCATCTG
TCTTCGCTTG ATGTACAGAG TTTAACCGCT TTGCAGGATT TGTTCTGCAA CGCCAATAAA
CTTACTTCGC TTAACGTACA AGACCTAAAA GTGTTACAGA GACTGCACTG TAACAGCAAC
CGGCTTACTT TACTCAATGT TCGGGATTTA AGTGCTTTAC AGGAACTGGA CTGCGTAGGT
AATGAACTTA CCTCGCTTGA TGTACATGGC GTAACCGCCT TATGGGAACT GGAATGCAGC
AAAAACATGC TTACTTTGCT CGATGTACAA AGTTTAACTT CCTTATCGAA GCTGGACTGT
TCCGCAAATC AACTTACCTC ACTAGACGTA CGGAACTTAG CAGCTTTGGA GGAACTGGAT
TGTAGTAACA ATAAACTTAC CGCACTTTAT GTGCAGGGAT TGAATGCGTT ACAGGAACTG
AACTGTTCCG AAAATGAGCT CACCTCACTC GAAATACAGG GTTTGACTGC TTTGGAAGTA
CTGGATTCCG GCCGCAATGA CCTTACCTCA CTCGATGTAC AAGGATTACC GGCTTTAAAG
ATACTGAGCT GTACTGTCAA TGAGCTTACC TCACTCAAAG TACGGGACTT ACCGGCCTTG
GAAAAACTGG ACTGTTCCGT AAATCAACTT ACCTCAATCG ATATACTAGA GTTAACCGCT
TTAAAGGAAC TGAACTGTTC CTTAAATCAA TTTACCTCAA TCAATATACT GAAGTTAACC
GCTTTAAAGG AACTGGACTG TTCCACAAAT CAACTTACCT CACTAGACGT ACGGAACTTA
GCAGCTTTGG AGAAACTGGA TTGCAGGGAC AATAAACTTA CCTCGCTTAA TGTGCAAGGA
TTGAATACAT TACAGAAATT GTACTGTTCC GAAAATGAGC TTACCTCACT CGAAATACAA
GGCTTAAAGA CGTTGCAGAA ACTGAACTGT TACAAAAACA AGCTTACCTC ACTCAATGTA
CAGGGGCTGA CCGCTTTACA GTGGCTGAAT TGCGGGTATA ATGAACTTAC AACACTTAAC
CTAAAAGGCT TACACGCATT GCGTGACCTG GAGTGCTTTA ACAATAATCT TCCTGAACTC
GACGTACAGG ACATAAACAC CTTACAAAGA CTGAACTGTT ATCATAATAA ACTTTCTACT
CTTGAACTAT CAACATTACA CGGTTTACAG GAACTGTGCT GCTATGATAA CCTTTTTAAC
GAAAAAACCC TTATCCGGAT ACTAACCGCC CTGCCCGATA GGAAGCAAAA AAAAGAAGGC
AGAGCTTTGA TATATGGTAA AAAAAATGAT CTAAGGGAAG GAACTATAAC GGATTTTTCT
TCTTCCGCCG AATTAAAAGC TGCTTTTGAA GCTGCAAAAG CAAAAAACTG GAGATTCTAT
AAACGGGATA CAGTTGGAAA TGAGGAAGAG GTTTAA
 
Protein sequence
MKKFLTVLFL TGLLTTRIAA AENPAKTAGS GRAILGISDD QKEIVVTAVT ADGSAVHVEG 
CTVTELPSGE ETILTATGAK VILKGAITKL DCGGNRLTEL NVQGLTALQK LFCDDNLLTS
LDVSGVTALQ SLSCGENLLT SLDVSGLTGL RELYCNRNHL SSLDVQSLTA LQDLFCNANK
LTSLNVQDLK VLQRLHCNSN RLTLLNVRDL SALQELDCVG NELTSLDVHG VTALWELECS
KNMLTLLDVQ SLTSLSKLDC SANQLTSLDV RNLAALEELD CSNNKLTALY VQGLNALQEL
NCSENELTSL EIQGLTALEV LDSGRNDLTS LDVQGLPALK ILSCTVNELT SLKVRDLPAL
EKLDCSVNQL TSIDILELTA LKELNCSLNQ FTSINILKLT ALKELDCSTN QLTSLDVRNL
AALEKLDCRD NKLTSLNVQG LNTLQKLYCS ENELTSLEIQ GLKTLQKLNC YKNKLTSLNV
QGLTALQWLN CGYNELTTLN LKGLHALRDL ECFNNNLPEL DVQDINTLQR LNCYHNKLST
LELSTLHGLQ ELCCYDNLFN EKTLIRILTA LPDRKQKKEG RALIYGKKND LREGTITDFS
SSAELKAAFE AAKAKNWRFY KRDTVGNEEE V