Gene TDE1966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTDE1966 
SymbolhtrA-1 
ID2741529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTreponema denticola ATCC 35405 
KingdomBacteria 
Replicon accessionNC_002967 
Strand
Start bp1980413 
End bp1981690 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content42% 
IMG OID637160856 
Producttrypsin domain/PDZ domain-containing protein 
Protein accessionNP_972569 
Protein GI42527471 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000431561 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTTT ATAGTAGAAG ACAGACCCTC GTATTTTCGC TCATTGCAGC GGTTATTTTT 
GCAAGTGCAG GTTTTTTTGC CGGTATAAAA TATAGTACAG GAAACGCCGG CTCGACTGGA
ATTCAAAGCG GAACCTCAAG CAACCCTGCC GATTTTGAAG AAAGTGCAGA AAACGGATTT
GCTCAGACGG AAAATTCGCA CAATTTAAAT ATGCAGCAGC ATGGAAATAC GGCAGCTTTA
AACACTGCAA ATGAAGCAGG ATACATGGGC TATACTCCTG CCGAATCACA GAATATTCGT
GTATATGAAT CGACCAATGA AGCTGTCGTA AACATAACCA CCGAAACTAT GGGAGCAAAC
TGGTTTTTTG AGCCTGTTCC GGTTGAAGGC AGTTCGGGTT CAGGCTCCAT AATCGACGAA
AGCGGATTGG TACTGACCAA TGCACATGTA ATTTCAGAAG CTTCAAAGAT TTATATTTCT
CTTTCTGACG GAAGTCAGTA CGAGGCAAAA GTAGTAGGAA CGGATGCCGA AAACGATTTG
GCTGTTTTAA AATTTGATCC GCCTAAAAAT ATTAAACTTA CGGTAATAAA ATTAGGAGAC
TCAACCAATT TAAAAGTCGG CCAAAGAGTT TTAGCTATCG GAAACCCTTT CGGATTGGAA
AGAACTCTTA CAGACGGAAT AGTCTCGGCA CTGAAACGCC CGATTCAAAA CGATAAAAAC
ATTATCATCA AAAATATGAT TCAAACCGAT ACGGCAATTA ACCCCGGAAA CTCAGGCGGT
CCTCTTTTAG ACACTCAAGG AAGAATGATA GGAATAAATA CCATGATCTA TTCCACATCG
GGAAGCTCAG CCGGAGTAGG CTTTGCTGTT CCCGTAAATA CGGCTAAAAG AGTTGTTGCA
GATATCTTAA AATACGGAAA GGTTATCCGC GGTTCCATCG ATGCCGATTT GGTTCAAGTT
TCAGGAAGAC TAGCCTCTTA TGCAAAACTC CCCGTTTCTT ACGGTCTCCT TGTTTCCGAA
GTAAAAAAAG GAAGCAATGC GGCAAAGGCC GGCCTTCGCG GAGGAAATGA AGCTGTGCGG
TCAGGAGTGG GCAGATACAG TTCCGTCTTT TACATAGGCG GCGATATCAT TGTCGAAATA
GCCGGACAAA AGATAAATAA CATAACAGAT TATTATTCGG TACTGGAGGA TAAAAAACCC
GGTGAAACGG TAAAGGTTAA AATTGTCAGA GGGAAAAAAC TTGTCGATTT AAGCTTAACC
TTATCGGAAC GAAACTAA
 
Protein sequence
MKLYSRRQTL VFSLIAAVIF ASAGFFAGIK YSTGNAGSTG IQSGTSSNPA DFEESAENGF 
AQTENSHNLN MQQHGNTAAL NTANEAGYMG YTPAESQNIR VYESTNEAVV NITTETMGAN
WFFEPVPVEG SSGSGSIIDE SGLVLTNAHV ISEASKIYIS LSDGSQYEAK VVGTDAENDL
AVLKFDPPKN IKLTVIKLGD STNLKVGQRV LAIGNPFGLE RTLTDGIVSA LKRPIQNDKN
IIIKNMIQTD TAINPGNSGG PLLDTQGRMI GINTMIYSTS GSSAGVGFAV PVNTAKRVVA
DILKYGKVIR GSIDADLVQV SGRLASYAKL PVSYGLLVSE VKKGSNAAKA GLRGGNEAVR
SGVGRYSSVF YIGGDIIVEI AGQKINNITD YYSVLEDKKP GETVKVKIVR GKKLVDLSLT
LSERN