Gene TDE2228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTDE2228 
Symbol 
ID2741333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTreponema denticola ATCC 35405 
KingdomBacteria 
Replicon accessionNC_002967 
Strand
Start bp2267377 
End bp2268822 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content40% 
IMG OID637161117 
Productaminoacyl-histidine dipeptidase, putative 
Protein accessionNP_972828 
Protein GI42527730 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.371488 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCAT TACAGAACAC TGAACCTAAG GAAGTATTTA AATGGTTTTA CGAAATCTCT 
CAAGTGCCGA GAGGTTCGGG AAACGAAAGA GCTATTAGCG ATTTTCTTGT AAAATTTGCA
AAAGATAGAA ATCTTGAAGT ACATCAAGAT AAGGCTATGA ATGTTATCAT AAAGAAGCCC
GGAACTGCCG GCTATGAAAA ATCTCCGACA GTTATTATTC AGGGACACAT GGATATGGTT
TGTGAAAAGG ATGCTTCCTC AAATCATGAT TTTTTAAAGG ATCCTATTAA ATTCGTTGTA
AAGGGAGAAA TGCTCTATGC CGATAAGACA ACCCTTGGAG GAGATGACGG TATAGCAGTC
GCATACGCTC TTACCGTCCT TGACTCAAAG GATATTCCCC ATCCGCCGCT CGAAGTTTTG
ATTACGACAG AAGAGGAAAC AGGGATGGGC GGAGCTATGG CTCTTACCGA TGAACACCTG
CAAGGAACAC GCCTTTTAAA TATAGATTCG GAAGAAGAAG GCGTCTTTTT GGTAAGCTGC
GCAGGCGGAT CCAATATTAA TATTTTTTTC GATATAAAGA AAGAAGCAGC CAAGGGAACA
TTCTTAAAAA TCACTGTCGG AGGTCTTCTC GGAGGACATT CGGGTATCGA AATAAACAAG
CAGAGAGCCA ACTCAATTAA ACTTTTGGGA AGAATTCTGT ATAACATCAA GCAAAACGAA
AAAATCAATA TAGTAGAAAT TTCAGGCGGT TCAAAACACA ATGCTATTGC AAAGGATGCC
CATGCTGTTA TAGCGGTTGA AAATAAGGAA GCCGTTTTGA AAATTGTCGA AAAACTTGCT
GCCGATTTTA AGGGCGAATA CAGAGCTGTT GATAAACTTT TAACTGTTAC TGCAAATGAA
ACGCAGAATT CTTCAGGCCA AATGTTTACA AAAGAGCTTA CCTTAAATCT AATTGATTTT
ATGGCAAGTA TTCCCAATGG TGTTCAATAT ATGAGCATGG AGATTCACGG CCTTGTTCAA
ACAAGTTTAA ATAACGGAGT TTTGGAAGAA ATTGATGGAA GAATCAAATT TACAACCTCT
GTACGAAGCA GTGTAAAGAG TGCCTTGGAT GAAATTGTGG ACATACTTAG AATCCAAGCC
GAGCGCTGCG GAGCCGAATT CAAAAAGGTT TCGGAGTATC CCGCTTGGGA GTACAGTCCC
GATTCTCCTG TACGCGATGC TGCCGTCAAT GTTTACAAAA AGCTTAACAA AAAAGAACCC
GTTATTACGG CCATCCACGC AGGGCTTGAA TGCGGTCTTT TAAAGAAAAC CCTTCCCAAT
GTAGATGCTG TAAGTTTCGG GCCCAATTTG TATGATGTTC ATACTCCTAA CGAACATATG
GACATTGCCT CTGTAGAACG TGTATGGAAG TTCTTGCTGG CTTATTTGGC CGAATTAAAG
AATTAA
 
Protein sequence
MNPLQNTEPK EVFKWFYEIS QVPRGSGNER AISDFLVKFA KDRNLEVHQD KAMNVIIKKP 
GTAGYEKSPT VIIQGHMDMV CEKDASSNHD FLKDPIKFVV KGEMLYADKT TLGGDDGIAV
AYALTVLDSK DIPHPPLEVL ITTEEETGMG GAMALTDEHL QGTRLLNIDS EEEGVFLVSC
AGGSNINIFF DIKKEAAKGT FLKITVGGLL GGHSGIEINK QRANSIKLLG RILYNIKQNE
KINIVEISGG SKHNAIAKDA HAVIAVENKE AVLKIVEKLA ADFKGEYRAV DKLLTVTANE
TQNSSGQMFT KELTLNLIDF MASIPNGVQY MSMEIHGLVQ TSLNNGVLEE IDGRIKFTTS
VRSSVKSALD EIVDILRIQA ERCGAEFKKV SEYPAWEYSP DSPVRDAAVN VYKKLNKKEP
VITAIHAGLE CGLLKKTLPN VDAVSFGPNL YDVHTPNEHM DIASVERVWK FLLAYLAELK
N