Gene TDE0447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTDE0447 
Symbol 
ID2740429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTreponema denticola ATCC 35405 
KingdomBacteria 
Replicon accessionNC_002967 
Strand
Start bp502898 
End bp505876 
Gene Length2979 bp 
Protein Length992 aa 
Translation table11 
GC content36% 
IMG OID637159319 
ProductTPR domain-containing protein 
Protein accessionNP_971061 
Protein GI42525963 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000620141 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAGTA ATTTGAACAT TATAATTGAG CAGGCTAATG CAGCTATTTT ATCTCGTGAC 
TATGCTTTTG CAGAAAAAAT TTTATTAAAT CAATTAAAAA AACAAAAAAA TACACCTGAT
GAGTATTATC AATTAAAAAA TCTTTTGGGA AAACTCTATA TCCGATCGGG CGATATGAAA
AAGGGCTTGG AAATATACAA AGAACTAAAT TCTCTTAATC CCAATAATTC AGATATTCTA
AACAATATGG GCGTTATCTA CCGCCGTCTT AACATGTTTA ATGAGTCGAT AGTTATCTTG
GAAAAGGCAA AGGCTATAGA CAGCAAAAAT GAAACAACTC TTTATAATCT AGGCAATACA
TATAAACAAA ACGGCGATTA CAAACATGCA ATACAATGTT TTACCGATGT ATTGGATATA
AAACCGGATG ATGCCCTTGC ATATAACCAT TTAGGAAGCG TATATTTTTT ATGCAAAGAT
TACCCTAAAG CTCTTGAAAC TTATAAGATC GGTTTAAAAG TAGATCCCAA TCATCCTTTT
TTGAATTTTA ATCTTGCAGA GTTGTATAAA GAAGAAAAGC ATTATAAAGA AGCGATTAAT
TCATATCAAA CTGCCATGAA AACAAAACCT AATTGGTATG AGGCTCTTGC GGCTATTGCC
GATTGTTATG TAGAAATGGA AGAACTCGGA AAGGCGATTG AAACCTATAA AATGATTATA
GGGTCAACAG GTCAATCGGA AGAAAATTTT ACAAAATTAG CTAAGCTTTA TGAAAAAATT
CATGAAGATA AAGATGCCGA AGATTTTTAT AAAAAAGCCG TTTCTATAAA TGGAAACTTT
TTGCCTGCAG TACTCGGATA TGCCAATATG CTTAAAGCAC AAAAAAGATA TTTTGATGCT
TATAATATTT TGATAAATAA TAAAGAGAAA TATCCTAATA ATAAAGAACT ACTATTGAGC
ACGGCAGAAG TTTGCTTGAT GCTTGAAGAT TATGCAAAGG CTAAAGAAAT CTTAAATCAT
TTAAGTAAAG AAATTAAGGG CGATAAAGAT GTTCTTAAAA TGCAGGGTAA GCTTTATTCG
GTATTAGGGG ATACAAAAAA AGCAGAGCTT ATTTTTGAAC ACTTACTTCA GTTGTCTCCG
AGCGAAATAA ATATGAGGGC CGAATTGGCT GATTTATATT TTCATAACGA TAAATATAAA
GAAGCCGCAA ATGAACTTAT CAAATATCTA AATGAAAAGC CTCAAGAAAT TTCTGCAAGG
TTAAAACTAG GAAAAGCTTA TGAACAAATG AAAAGGTATG ATTTAGCCAA GCATGAATAT
AATAAAATAA TAAAAAATGA TGCAAATAAT ACCGAAGCTC TTGCGGCAAT TTTGGAGCTG
AACAAAAATG AAGGCAATAC TGTTGAAGCT GTTCGTCTTG CAAATAAAAT TGTTGATATT
CAAACCGATA AAATAGAAAA TGACGATATA GGCAGTTTGT CCAAGTCCGT TCAGTTATAT
GAAGATGCCG TTAAGTCTTA CGGCGATGAC GGTATTCTGA ATAAAAATCT TGATAAGCTG
CGCCCTCCTC AAGAAGAACT TGATATAAGC CCTGAACCGG ATTTAGAAGA TTTGGGGGCG
GTAAATTTTG AAGATGATGA TGAAATAACT CTTAGCGAAA GCTTTGAAGA TATGCCTGAT
TTAGAGATGC CTTTTGATGA TTTAATGGAA TTGGCTGATG ATGAAGTTTT TGATCGAGGA
GAAGATGAAG ATTCTTTAGA TAATTTGGTC TATGTTGATG CACCCATTGA CGATAGTCCG
GACATAGGGG CCGAGTATGA TCCTCTTGAG CTGGGCAAGC CGGGGCCCAA TAGGAATCGA
AAGAACGATA TACCTGAAGA AGAGTTACAA CTTCAAGATA CTTCATCAAA AGGAGATGTT
CCTGCAAAAG AAACAACACC TACAAAGGAC TTTGCTCCTT CCCAATCTAA GCCATATCAA
TCTCCTTCAA GTGAATCGGC TCCTCTTCAA TCTCCGCCGT ATCAAGTACC TCCAAATGAA
CCGGCTTCTT ATCAAGCTCA GCCGCAGCAG CAATCAAGTA TACCGGAATC GGATTATCCC
CAAGATCTTA GCTCTGCCGC AAAATCCGGA AAGCCCGACA CAGCCGATAT GCCCTTTTCT
GCAGCAAAGC CTGACACAGC TGACAGCTTT GGTTCGGAAG ATGAAGGAAT CGAAATGGAA
CCGGAATTGG AAGCAGTACC GGAATCAAGG GACGCTTCTA GCCCTCTGTC TGAAAAACCG
GCAGATAAAA AAACGTCTCA AGCTTCGGGT AGTCCAAACC TTGATGAGAT GGATCTAAGT
GCTGAAGATT CTTTAAGTAA TGAAGATGAA TTAAATAATG AAAATGAGTT AATTGATGAT
CTGCCTGCCA GCTCGAATGT GCCTTTGTCC GAAGAAACAA ATTTAAGCTC TGATGATGAG
GATTTTTTAG ATCCTGCGGG TACGGCTCCC GACTCCCTTA TTCCTTCTTC CTTTGATGAT
GATTTGGATA CTTCCGATTC TGTTGATGAA ATTGTAAATA AACTTTCCGA TAAACCGTAT
CTTTCACTTC TGCCTCACAG CTTAAAAGAA TCTCCTCGAT TTGAAGAAGA GTTGGATATT
ATCGAGGGTT ATGAAATAGT TAATCTCTTT GTCTATTTGC GTGACTTGAT GGATAATCTT
CCTTCTATCG AATTAAAAGA TTTTCTTATA AGTAATGAGC GTATTCAAAT GGAATATGTT
ATAAATAAAT TGTCCGGAGA GGTCGGGTTA AAACGCAGGA TGATTCTCTT AAATGTAAAA
GGTGCTTTGA AAAAAACAAT AGACCCTAAA ACTTCTACGG ATAAAACTTT AAAGGATGTT
TTGGGCTATC TTAGAATAAT TGCATCTCAG TTGCCTGATA AGGGATTTGC TTCTGCCTGT
ATAGGAAAGA TCAATAGCCT GATAGAACAA ATCGAATAA
 
Protein sequence
MSSNLNIIIE QANAAILSRD YAFAEKILLN QLKKQKNTPD EYYQLKNLLG KLYIRSGDMK 
KGLEIYKELN SLNPNNSDIL NNMGVIYRRL NMFNESIVIL EKAKAIDSKN ETTLYNLGNT
YKQNGDYKHA IQCFTDVLDI KPDDALAYNH LGSVYFLCKD YPKALETYKI GLKVDPNHPF
LNFNLAELYK EEKHYKEAIN SYQTAMKTKP NWYEALAAIA DCYVEMEELG KAIETYKMII
GSTGQSEENF TKLAKLYEKI HEDKDAEDFY KKAVSINGNF LPAVLGYANM LKAQKRYFDA
YNILINNKEK YPNNKELLLS TAEVCLMLED YAKAKEILNH LSKEIKGDKD VLKMQGKLYS
VLGDTKKAEL IFEHLLQLSP SEINMRAELA DLYFHNDKYK EAANELIKYL NEKPQEISAR
LKLGKAYEQM KRYDLAKHEY NKIIKNDANN TEALAAILEL NKNEGNTVEA VRLANKIVDI
QTDKIENDDI GSLSKSVQLY EDAVKSYGDD GILNKNLDKL RPPQEELDIS PEPDLEDLGA
VNFEDDDEIT LSESFEDMPD LEMPFDDLME LADDEVFDRG EDEDSLDNLV YVDAPIDDSP
DIGAEYDPLE LGKPGPNRNR KNDIPEEELQ LQDTSSKGDV PAKETTPTKD FAPSQSKPYQ
SPSSESAPLQ SPPYQVPPNE PASYQAQPQQ QSSIPESDYP QDLSSAAKSG KPDTADMPFS
AAKPDTADSF GSEDEGIEME PELEAVPESR DASSPLSEKP ADKKTSQASG SPNLDEMDLS
AEDSLSNEDE LNNENELIDD LPASSNVPLS EETNLSSDDE DFLDPAGTAP DSLIPSSFDD
DLDTSDSVDE IVNKLSDKPY LSLLPHSLKE SPRFEEELDI IEGYEIVNLF VYLRDLMDNL
PSIELKDFLI SNERIQMEYV INKLSGEVGL KRRMILLNVK GALKKTIDPK TSTDKTLKDV
LGYLRIIASQ LPDKGFASAC IGKINSLIEQ IE