Gene TDE0654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTDE0654 
Symbol 
ID2739202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTreponema denticola ATCC 35405 
KingdomBacteria 
Replicon accessionNC_002967 
Strand
Start bp689746 
End bp690918 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content43% 
IMG OID637159530 
ProductM20/M25/M40 family peptidase 
Protein accessionNP_971267 
Protein GI42526169 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0191847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATATAT TAAAAAAGGT AAAAGAAATA GAAAAAGACA TAATTTCTTG GCGCCGCCAT 
TTGCATCAAA ATCCTGAGGT TGGCTTTGAA CTTCCTAATA CAATAGATTT TGTATGTAAA
AAATTGGATG AGTTCGGGAT TAAGTATGAC AGAAATGCGG CAAAAAGTGC CGTTATAGGT
TATATTCACG GTGCAGAAAA AGGAGATGTT ATTGCTCTGC GTGCAGACAT GGATGCCCTT
CCTGTTTGCG AAGCTACCGG GCTTGACTTT GCTTCTAAGA ATTCCTTTAT GCACGCTTGC
GGTCATGATG CTCATACTTC GATATTGCTT GGGGCTGCAA AGGTACTAAA CGATTTAAAG
GGCAGTTTTA AAGGAACCGT TAAGCTTATC TTCCAACCTG CGGAAGAACT GGGAACAGGC
TCTGTAGACA TCTGTGAAAA AGGAATTCTT GATGACGTAA AAGAAATCAT CGGTCTTCAT
GTAGGCTGTA TAAGCGATGA AGCAAAACCC GGCGAATTCC TTTTTTCAAA GGGCTCGATG
ATGGCCTGTA TGGATAAATT TTCAATTAAG GTTAAGGGCG TAGGAGCTCA CGGAGCTTAT
CCATCACTTT CAGTAGACCC CGTTGTAATT GGGTCTCACA TAGTTGTCGC CATACAGGAA
ATCTTAGGCC GAGAGGTACA TCCTACGGAG CCGGCTGTAA TAACGGTTGG ACAATTCCAT
TCAGGCTCGG CATTCAATAT AATTCCGCCT GAAGCTTATC TTGAAGGAAC CGTACGGGCC
GTAACAAATG AGACGAGGGA ATTGATAGCA AAACGGATTG AAGAAGTTGC CTCCAATATT
GCAAAAGCTT TTAGAGGTTC AATTGAATAC CAATTCTTTA GACAGCCGCC TCCTCTTATA
AACGATGCGA AAGTTACGGA TAAGGCTATG GGAGCCGCCA AGGAGCTTTT CCCGAATGAC
GTTAAGCTTA TGCAGCGGCC GGTCATGGGA GGAGAAGATT TTGCATGGTA CTTAGAAAAA
GTTCCGGGTT CATTTATCTT CTTATCGACT CCATCCCCCA TTGAAGGAAA AGTCTGGCCC
CACCACAATC CCAAATTTGC CTTAGATGAA TCGCAGTTTT ACAAAGGTAC TGCTCTTTTT
GTAGCTTATG TAATGCAGGA GCTTGGTAAA TAA
 
Protein sequence
MDILKKVKEI EKDIISWRRH LHQNPEVGFE LPNTIDFVCK KLDEFGIKYD RNAAKSAVIG 
YIHGAEKGDV IALRADMDAL PVCEATGLDF ASKNSFMHAC GHDAHTSILL GAAKVLNDLK
GSFKGTVKLI FQPAEELGTG SVDICEKGIL DDVKEIIGLH VGCISDEAKP GEFLFSKGSM
MACMDKFSIK VKGVGAHGAY PSLSVDPVVI GSHIVVAIQE ILGREVHPTE PAVITVGQFH
SGSAFNIIPP EAYLEGTVRA VTNETRELIA KRIEEVASNI AKAFRGSIEY QFFRQPPPLI
NDAKVTDKAM GAAKELFPND VKLMQRPVMG GEDFAWYLEK VPGSFIFLST PSPIEGKVWP
HHNPKFALDE SQFYKGTALF VAYVMQELGK