Gene TDE2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTDE2020 
Symbol 
ID2740367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTreponema denticola ATCC 35405 
KingdomBacteria 
Replicon accessionNC_002967 
Strand
Start bp2040824 
End bp2044246 
Gene Length3423 bp 
Protein Length1140 aa 
Translation table11 
GC content39% 
IMG OID637160910 
ProductYD repeat-containing protein 
Protein accessionNP_972623 
Protein GI42527525 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATTA GTTATTGTCG TATACAACCA AACTATCTCT TTTCTCGGAC AGCGATTGCA 
AGGGAGCAAA AGCAATTTAT AAAAATTGCT TTTGCTGTAC ATCTTTCCGC AGTAATTTCT
GCGGAAAGAT ACCGAAGCGA TAGCGTAGCC CTGAAAAGCG CGGTGTCGGT ATTTGTGTAC
TTTTATAGTG TATTGATGGT ACACAAATAC TGGCAGTCCG CCAAAAGAAA GTTATTATTA
AACAGTCCCA CTGACAGGAC TTTTGAAAAT AGGTGGAGCA GATACAAGGA GCCTAGTAAA
AATAAAGCGG AGGCGTATCG GGTATACGTC GAGCATTTAT TTTTGCGTAG CGACGCAGGA
GATGCCCGCA TATTTTCAAA AGAGGTAATC CACGACGGTC TAGGACGCAT AAACTACACA
GCCAAAGAAG GAGAAGTATA CATAGACGGA ACCGATTACC AAACCCAAAC AGGCTGGAAT
GTTTCGGGAG CGGTTTATTA TGATGAAGAC GGAAGAAAAA AAGGAGAAGG ACAGCCTGTG
TTCTATGGAG GAGATATACA AAATGAATTA AGCGGAAGCT CAAGCCAAAT CCTTTTATAT
GAAAAATTAA ACGAATTAAA ACATCCTACA AGTTATGAAT ATGACGGACT AGGCCGCGTA
ATAAAAACAA CCCTTCCCGA CGGAAACATA CAGCGTAATG AATACTTGAT AGATTCTTCC
TTACAAATAA CAAAAACAAC CGACCCTAAA GAAAACATAA ACATCAGTAA AAAAGATATA
AGAGGAAACA TAAAAGAAGT AGAAAGACGT GATAAAAACA ATACCTTACT AACCAAAGCC
CGATACGAAT ATTCGGTATT AGGAGAAATG TTAAAAGCCT ACGACGCTAA AGACAACTTA
TTGGCAGTAA ACTATGACAT GCTCGGAAGG CGCATAAGTT TAGAAAGCCT TGATATGGGA
AGAAAAGAAT GGAACTATGA CGATAAAGGA AGACTTGAGT ATGAAAACGA CTCCGTATTA
AGATCAAAAT TAGCGTCAAT AAAATATGAA TACGACGGAC TGGACAGAAT AATAAAAATA
GACTATCCTT TTAGTGAAGA TGTAGAATAT GAATACGGAG CAGCGGGAGA AAAAGGAGCG
GGAGAGGTAA TCCGCAAAAA AGATGAAACG GGAGAGACAA TGTACAGCTA CGGCCTACTA
AACGAGGTAA AGGTAGAAAC ACGGACAATA AAGAGGGGCA GAGAATTTCA AAAACCCGTA
ACTGCCGTGT TTAACTATGA AGCCGATTAC TTAGGCCGAA TGCAAAGCAT AAGCTACCCT
GACGGAGAAG TGCTGACTTA CAACTACGAC AAGGGCGGAC AATTAAAAGG AGTTATAGGT
AAAAAAGGAA TAGAAACCTA TAGGTATGTA GACAATATCT TATACGATGA ACACGGTCAG
AGAGTCTACA TCAAATACGG AAACGGAGTA GAAACAAGAT ATACTTACGA CCCTGCACGG
CGTTGGCTAA AAGACATAAA GACAGAAAAC AAAGATAAGA ATTTGGTATT CCAAAAAATA
AACTATAACT TTGATGCAGT AGGGAATGTA GAAGGCTATA TAAATACTTC AAGCAAATAT
GAAACAAGTC AAAGCTACAG TTATGACAAT TTGTACCAAC TTATAAAAGC CGAAGGCACG
CACAAACAAT ACGGAGGAAT AAACCCTAAC CCCGATAACC CGCATCCTTC AAATCCCTTA
TACACAAATA AATACAGACA AACCTTTGCC TTCGACATAA TAGGGAACAT GACGAATAAG
AGCAGCACCA CAAACCTTCC CGGAGGCTCA ATAGGAACTA CGGATGACAC AAAACTAAAC
TATGAACTGG ATTATGAGTA TGACAGTAAA TATGCACACC GCTTAATAAG AGCCGGAACC
CGATACTATC GCTACGACGC CAACGGCAAC ATCACTGCAG AAAAAGACGG AAAGTTCAGC
GACAAAGAAG AGCTTACCTT TACCTATTCC TACTTTGCAG AACATGACGT ATACGGAGTA
GATTACGGCT TTGACCTAGA ACCCCCTGAA GATGACCCTG CAAACCTAGA AAGCGGAGGC
ACAACAAGTA CTACACCTAC AGGAGGCTAC AGAAGAGATT ATACATGGAA TGAACGCAAC
CTATTGATAA AATCAGACGA TAAGTTAAAC ACCGTAATAT ACCGCTACGG AGATGACGGA
CAGAGAGCCT TAAAGTTTAC TCAACAGAGT AATAGCGAAA CTCTTTATTT CAATAACTTT
TACTCCGTAC ACCAAGTTGC CCATGAACCT AATCACGAAC ATGGATTACG AGTAAGCAAA
CACATCTTTG TCGGAAACTC AAGATTAGTA ACTGCAATGA CGCACGCAGA CAACCACGGC
GACACAACCG AGCAGACAGA AAAAAGATAT TATTATCATG CAGATCATCT GCAAAGTGCA
CAGTTTATAA CAAATGCTAA AGGAGAGCAG TATGAACACA TAGAATACAC GCCTTACGGA
GAACTCTGGA TTGAAGAGAC TGCACCGGGG ATAGATAAGT TACCGTTTAG GTTTACCGGC
AAGGAACTGG ATGAAGAGAC TGGACTGTAC TATTATGGTG CGAGGTATTT AGACCCGAAA
TATTCGAGAT GGTTGAGTGG AGATCCTGCG TTAGGTGAGT ATATTCCGCA AGCTCCGGTA
AATGACGAAG CTAAAAAACA CAACGAGAAC CTGCCCGGAA TGGGAGGTAT ATACAACACG
GTAAATTTAC ATGTTTATCA TTATGCGGGG AATAACCCGG TGAAGTATGT GGATCCGAAT
GGAGAAAAAA TACTGGATGT TAGTACGACT TTGGTTCAAG AAAACGGAGA AGAAGATCCC
TTAGGTAAGG GTAATACTAC AATTGCTTCA CATGGCTGTG TTTTAACAGC ATATACTAGA
ATTGCAAATG CGATAAGTGC TATTGGTGAA CGTACATTAC AAAACGCTAA TTCATTGGCT
CTTAAGAAGG ATATTTATGA TGGAAATAAC AAAAATTTAA TATTTAACTC TCCTACTGCT
TCCAAATTAG TAAATTCCAT ATCTATTCTA AGTGAATATA AAATGTCATT TTATAAATCA
TTAAATAATA TTAGCTTAAA TGAATATGGT AGCGAATTGG AAAAACTAAG CAAATCAGAA
GACTTATATG CTATTACCAT TAGAGTTTCA AATAAGTATG GAGGCCATAC TGTCAACATG
AATTCTGATG GAATTATTGT GGATACTGAA GATAATTATA CCATCAAAAT TAATAACACC
TCAGAAAAAG GATATTTACC GTCAACGGTT GTTAATGATA TCATAACGGA TTCTTCAGCT
ATAATCCGTA TTGATATATT TAAGATAGAA AAGGTAAAAA CTGTTCCATT TTTACCCGAA
TAA
 
Protein sequence
MKISYCRIQP NYLFSRTAIA REQKQFIKIA FAVHLSAVIS AERYRSDSVA LKSAVSVFVY 
FYSVLMVHKY WQSAKRKLLL NSPTDRTFEN RWSRYKEPSK NKAEAYRVYV EHLFLRSDAG
DARIFSKEVI HDGLGRINYT AKEGEVYIDG TDYQTQTGWN VSGAVYYDED GRKKGEGQPV
FYGGDIQNEL SGSSSQILLY EKLNELKHPT SYEYDGLGRV IKTTLPDGNI QRNEYLIDSS
LQITKTTDPK ENINISKKDI RGNIKEVERR DKNNTLLTKA RYEYSVLGEM LKAYDAKDNL
LAVNYDMLGR RISLESLDMG RKEWNYDDKG RLEYENDSVL RSKLASIKYE YDGLDRIIKI
DYPFSEDVEY EYGAAGEKGA GEVIRKKDET GETMYSYGLL NEVKVETRTI KRGREFQKPV
TAVFNYEADY LGRMQSISYP DGEVLTYNYD KGGQLKGVIG KKGIETYRYV DNILYDEHGQ
RVYIKYGNGV ETRYTYDPAR RWLKDIKTEN KDKNLVFQKI NYNFDAVGNV EGYINTSSKY
ETSQSYSYDN LYQLIKAEGT HKQYGGINPN PDNPHPSNPL YTNKYRQTFA FDIIGNMTNK
SSTTNLPGGS IGTTDDTKLN YELDYEYDSK YAHRLIRAGT RYYRYDANGN ITAEKDGKFS
DKEELTFTYS YFAEHDVYGV DYGFDLEPPE DDPANLESGG TTSTTPTGGY RRDYTWNERN
LLIKSDDKLN TVIYRYGDDG QRALKFTQQS NSETLYFNNF YSVHQVAHEP NHEHGLRVSK
HIFVGNSRLV TAMTHADNHG DTTEQTEKRY YYHADHLQSA QFITNAKGEQ YEHIEYTPYG
ELWIEETAPG IDKLPFRFTG KELDEETGLY YYGARYLDPK YSRWLSGDPA LGEYIPQAPV
NDEAKKHNEN LPGMGGIYNT VNLHVYHYAG NNPVKYVDPN GEKILDVSTT LVQENGEEDP
LGKGNTTIAS HGCVLTAYTR IANAISAIGE RTLQNANSLA LKKDIYDGNN KNLIFNSPTA
SKLVNSISIL SEYKMSFYKS LNNISLNEYG SELEKLSKSE DLYAITIRVS NKYGGHTVNM
NSDGIIVDTE DNYTIKINNT SEKGYLPSTV VNDIITDSSA IIRIDIFKIE KVKTVPFLPE