Gene TDE0071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTDE0071 
Symbol 
ID2741686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTreponema denticola ATCC 35405 
KingdomBacteria 
Replicon accessionNC_002967 
Strand
Start bp78742 
End bp80982 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content42% 
IMG OID637158941 
ProductU32 family peptidase 
Protein accessionNP_970688 
Protein GI42525590 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATAG AACTTTTAGC TCCGGCAGGG AATACTGAGG CCCTTGATGC TGCCGTAAAC 
GAAGGAGCCG ATGCCGTATA TTTAGGCTTA AAAACTTTTA ATGCGCGTAT GCGTTCCTCC
AATTTTGCAT GGAGCCAATT TGAAGCAGCC GTAGACAGCC TTCACAAGCG GAATAAAAAG
GTCTATGTTA CCGTTAATAC GGTTTTAACC GAAGAAGAAA CCGAAAAGAT GTACCGCTTT
TTAAAATACC TTGATTCAGT TTCTCCCGAC GGCATCATCG TTCAAGATTT GGGCATAGTT
CAAATGGCTA AAACACACTT TCCAAAATTA AAGCTTCACG CCTCAACCCA GATGAATATA
GCCTCAGCCA AGGCTGCAAA TGCTTTAAGC CGATTCGGAA TATCAAGGGT TGTTTTAGCA
AGGGAGCTTT CATTAAAAGA AATCGAAGCG GTCAACTTAA ACACATCCTG CGAGCTTGAG
GTTTTTGTGC ACGGAGCCCT ATGCGTCTGC CAATCGGGGC TATGTATGTT TTCTTCATAC
CTTGGAGGAA AATCAGCAAA CCGAGGGATG TGTGCCCAAG CCTGCCGAAG ACTTTACACG
GCCCACACAC CTCAAGGCGA CAAAGACGGC TATTATTTTT CTCCCCATGA CTTGCAGCTC
ATCGATTATG TTCCCGACCT TATAAAAGCG GGAGTTTCAT CCTTTAAAAT TGAAGGAAGA
ATGAAGAGTG CAGAATATGT AGGGACGGTA GTTTCAGCCT ACCGCCATGT AATAGACAAC
TGGGAAAAGA ATAAAAAGGA AGCTGTCGAA ACAGGCCGCC GCATTCTAGC CGGAGACTTT
GCCCGAAAAA AGACTAGCTT TTTATTTGTT TCGTCAAAGG CTGAGGAAAT ATTAAACCCG
AATCAGGCGG GCGGCACCGG TATTTTTTTA GGAACGATAG ACAAAACCGC CCAATTTAAG
ATTAAAGAAG TTGAAGGGAA AACAAAAGAA GAGCCCATGC GCAAGGTTCA CTATGTGCTT
TTAAAGGGCG GCTCCTATAC TCCCGATTTT GGGGATTCGA TTCGGCTTCA TACAAAGGAT
GACCGCGGAA GAGAAAGCTG GAAAATTCAA GATATAAGGA TTGAAAAAGC CTCTTCAAAA
AAATCAGGTT CGGCCGATGA AGTATGGATT CAGGTCCCTG CCGATTTCGG CATAGGCGAC
AGCGTTTACC TTTTGCAAAC TAAAAGTATG AGTAAGCGTT ATACTCCGGT GCTGCCCAAA
ACTTTGAGTC CTTTTAGAAA AAGGCCCGGA GACGACAAGC TTCCCGAACT TAAACTTTAT
CCCGAAGGGA ATCCTGCCGG TAATGAGAAT GTAAAGGAAG GAGCAAAAAA CAAACAGGAC
GACAAGCGGG CTAAAAAAAT TTTGAGTAAA TCCAATATTG ATATCTTCCC TGACGGTTTT
TATGTTCAAG TATCTTCAGT AAACAGCCTC CACACCATTC TTTCCGACAA ACCTGTGCGG
GTCATTATAA ACCTAAACGA AGATACAAGA GAAGCCTTAG CCGGTACGGG AAAAAGCGGC
AAACCTCTCC CTTTCTCAAA ACGGGAAATC TTTATTTCAC TTGATCCTTT TATGGGACAA
GCCGAAAGCG ATGAGCTTGA ACTCTATCTT ATGAGCCTAA TCGAAAAGGG TTATACTCAA
TTTGTAATCA ACAATCCTGC CCATATTACA ATGCTAAAAA ATAAAAATCT AAACCTTGTT
GCCGGCCCCT ATCTTTACAG CTTTAACCGC TGGGCTGTAA AATGGCTTCA AGAAAATAAT
ATTTTAAAGT TTATATCTCC CATCGAAAAC TCTCAAAAAA ACCTTGAAGA GGTTTATGCT
CCCGGTATGA GAAAACAGGT ACTGATCCCT GTTTTTGCCT ATCCTGCCCT ATTTAGAATG
AGGTTCACAC TTCCAAAAAC TTATGATTTC TTATATTTTT CCGATAAACA AGGAGAGGCT
TTCAAGGCTT TTTCAACACC GTCTGCTTCA TTTGTGCTAC CGGAAAAACC TTTTTCGATA
ACCGACCGCA TCGGCTCTTT GGAAAAAAAA GGCTTTGATA AATTCCTTCT TGATTTTTCT
CACACGGAAA TCGAACGCGG TGAGTACAGA CATATTGTAA ACTCATGCAG AAAAGGAATT
TTTCTCGAAG ATACTTCACG CTTCAACTGG AAAGAAGGTT TTTATGACCC TGTGAAAATT
GAGGCGAGAA AACAAAAGTA A
 
Protein sequence
MNIELLAPAG NTEALDAAVN EGADAVYLGL KTFNARMRSS NFAWSQFEAA VDSLHKRNKK 
VYVTVNTVLT EEETEKMYRF LKYLDSVSPD GIIVQDLGIV QMAKTHFPKL KLHASTQMNI
ASAKAANALS RFGISRVVLA RELSLKEIEA VNLNTSCELE VFVHGALCVC QSGLCMFSSY
LGGKSANRGM CAQACRRLYT AHTPQGDKDG YYFSPHDLQL IDYVPDLIKA GVSSFKIEGR
MKSAEYVGTV VSAYRHVIDN WEKNKKEAVE TGRRILAGDF ARKKTSFLFV SSKAEEILNP
NQAGGTGIFL GTIDKTAQFK IKEVEGKTKE EPMRKVHYVL LKGGSYTPDF GDSIRLHTKD
DRGRESWKIQ DIRIEKASSK KSGSADEVWI QVPADFGIGD SVYLLQTKSM SKRYTPVLPK
TLSPFRKRPG DDKLPELKLY PEGNPAGNEN VKEGAKNKQD DKRAKKILSK SNIDIFPDGF
YVQVSSVNSL HTILSDKPVR VIINLNEDTR EALAGTGKSG KPLPFSKREI FISLDPFMGQ
AESDELELYL MSLIEKGYTQ FVINNPAHIT MLKNKNLNLV AGPYLYSFNR WAVKWLQENN
ILKFISPIEN SQKNLEEVYA PGMRKQVLIP VFAYPALFRM RFTLPKTYDF LYFSDKQGEA
FKAFSTPSAS FVLPEKPFSI TDRIGSLEKK GFDKFLLDFS HTEIERGEYR HIVNSCRKGI
FLEDTSRFNW KEGFYDPVKI EARKQK