Gene TDE1195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTDE1195 
Symbol 
ID2741156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTreponema denticola ATCC 35405 
KingdomBacteria 
Replicon accessionNC_002967 
Strand
Start bp1227724 
End bp1229781 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content41% 
IMG OID637160073 
Productprolyl endopeptidase 
Protein accessionNP_971802 
Protein GI42526704 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00494828 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATATA AAAAATCGGA TGTTTCCGAC AATTATTTTG GAACCATCGT GCCTGATCCG 
TACCGATGGC TTGAAGACGA TAATGCACCC GAAGTCATAG CTTGGGTTAA AGAAGAAAAT
AAAAAAACCG AAGATTTTTT ATCCAAAATC TCTTTCAGAG GAGAGCTAAA AAAACGGCTT
GAAGAAATTT GGGATTATGA AAAACGTTCA GGTCTTTTTA AGGCAGGAAA TTTCTATTAT
TTTTTTAGAA CGGAAGGCTT ACAAAATCAA AGCATTATGT GCCGCCAAAG CGGAAACATA
AAGGCGGAAA GCTCTCCTGA AGTCTTTTTT GATCCGAATA AGCTAAGCGC GGACGGAACT
ACGGCCTTAA AAAATCTTGC CTTTTCCAAG GATGGAAAAT ACATGGCCTA CTCCGTATCG
GGAAGCGGCT CCGACTGGGA AGAAATCTTT GTCTTTGATG CCGAAAAAAA AGCCGATACG
GGAGAACACA TCCACTGGGT AAAATTTTCC AATATTGCAT GGTATAAGGA CGGTTTTTTT
TACAGCTCAT ACGATACTCC CGATAAAGGA AAATCTTTAA CCGAAAAAAA CGAATTCCAA
AAGTTAAAAT ACCATAAACT TGGAACAAAA GAAAGCGATG ACCTTCTCAT TTTTGAGGAC
AAGGATCATC CCCTGCGCTC TTTTTCTGCA AGTACAACTG AAGACGAGAA AACCCTCCTT
CTTACCGCTT TTGAAGTAGG AAGTGAGGGC AATATGCTCT TTGTTGCGGA TCTAAGCGAA
GGTCTTCCGA AATGTTCACA CTGCTTTAAA CAATACAACA CTCATTTTAA TGACAGTGTC
TGGCCCCTTG AAACCGAAAA CGGCTTTTTA TATTTATTAA CAAATAAACA AGCTCCATTT
TACCGAGTTG TAAAGACATC TTTAAACAAT ATAAGTGAAA AGTCCATCGA TGAAGTAATC
CCTCAAAAAG ACTGCCTTTT ATCAAGCGCG GCCCTTTGCG GAGGAAAACT TCTTACGGTT
TACTTGAGGG ATGTTCAGGA TGAGGCCTTT ATCTGCGGCC TTGACGGAAA AAATAGCACA
AAAATAAATC TGCCTGCAAA TGGAAGTATT TCTTTTTCAG GAACACGAAA AAATGAAGAC
TCTTTATTTT TCAATTTTAC CTCTTATACA ACTCCCAACA AAATCATACG CTATGATATA
AAAACAAACA GTTTAACCGA CTTTTTTGTT CCTGCCATTC CAATCAACAC AGGAGATTTT
AAATGCGAAC AGGTCTTTTT TAAGAGCAAG GACGGAACAA AAATTCCTAT GCACATTGTT
TCAAAAAAAG ATATTAAACT CGATGGAAGT AACCCTACGA TTATGTATGG GTACGGAGGC
TTTGCTATTT CTCTTCCACC TGCCTTTTCT GCAGCCAGAA TGGCCTTTTT GGAAAAAGGA
GGCATCTTTG CCTGCGTAAA TTTACGCGGC GGCCTTGAAT ACGGAGAAGC ATGGCACTCG
GCAGGAAAAA AGATGAAAAA ACAAAACGTC TTCGACGATT TTATTGCAGC CGGAGAATAT
TTGATAGAAC ACAAATATAC TTCAAGCAAA AAACTTGCAA TTCAAGGAGG CTCAAACGGA
GGCCTTTTAA TAGGAGCCGT AACAAACCAA CGCCCCGATC TTTTTGCCGT TGCAATCCCT
CAGGTTGGAG TCTTGGACAT GCTCCGCTAC CAGCATTTTA CCATAGGCTG GGCTTGGGTC
GATGAATACG GAAGCAGCGA GGACAGTAAG GAGATGTTTG AATATCTTTA TGCTTACTCG
CCCCTCCATA ACGTAAAAGA AGGAGTCAAT TATCCTTCCA TTATGGTATG TACGGGAGAC
CATGATGACA GGGTTGTTCC TGCACACTCC TTTAAGTATG CTCAAGCCTT GCACGATACT
TACAAGGGAG AAAACCCTAT CCTCATCCGT ATAACCGAAA AAGCGGGCCA CGGAGCCGGC
AAACCCACTG CAAAGATAAT AGAAGAAACG GCGGATATCT ACGCCTTTAT CTTTAAGCAA
ACCGGTCATA TAATCTAA
 
Protein sequence
MQYKKSDVSD NYFGTIVPDP YRWLEDDNAP EVIAWVKEEN KKTEDFLSKI SFRGELKKRL 
EEIWDYEKRS GLFKAGNFYY FFRTEGLQNQ SIMCRQSGNI KAESSPEVFF DPNKLSADGT
TALKNLAFSK DGKYMAYSVS GSGSDWEEIF VFDAEKKADT GEHIHWVKFS NIAWYKDGFF
YSSYDTPDKG KSLTEKNEFQ KLKYHKLGTK ESDDLLIFED KDHPLRSFSA STTEDEKTLL
LTAFEVGSEG NMLFVADLSE GLPKCSHCFK QYNTHFNDSV WPLETENGFL YLLTNKQAPF
YRVVKTSLNN ISEKSIDEVI PQKDCLLSSA ALCGGKLLTV YLRDVQDEAF ICGLDGKNST
KINLPANGSI SFSGTRKNED SLFFNFTSYT TPNKIIRYDI KTNSLTDFFV PAIPINTGDF
KCEQVFFKSK DGTKIPMHIV SKKDIKLDGS NPTIMYGYGG FAISLPPAFS AARMAFLEKG
GIFACVNLRG GLEYGEAWHS AGKKMKKQNV FDDFIAAGEY LIEHKYTSSK KLAIQGGSNG
GLLIGAVTNQ RPDLFAVAIP QVGVLDMLRY QHFTIGWAWV DEYGSSEDSK EMFEYLYAYS
PLHNVKEGVN YPSIMVCTGD HDDRVVPAHS FKYAQALHDT YKGENPILIR ITEKAGHGAG
KPTAKIIEET ADIYAFIFKQ TGHII