Gene TDE2101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTDE2101 
Symbol 
ID2740340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTreponema denticola ATCC 35405 
KingdomBacteria 
Replicon accessionNC_002967 
Strand
Start bp2125639 
End bp2126709 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content46% 
IMG OID637160991 
Producthypothetical protein 
Protein accessionNP_972702 
Protein GI42527604 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00076564 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TTTTTGTTTT ATTTTTGGCG ATACTTTTTG TATCGGTTTC GGCTTGTAAA 
AACCCGTTTT TTAAAAACAT GCTGGATAAG GATTCGGGGA GTGAAGGGAC CGGAGATTGG
AATTCTCAAA GTTCCGATGT AGGTTCCTTT GAAGACGCAG GGGACTTTGT AAAAATAATA
CCTCCTGCAA ACGGCATCGT AGGCGTTGCT CCTAACTACG CCTTACCCGG AAATCATGAT
TATTGGAAAG GTGTATTTAT TGCAGGGCGC ACGGTAAAAC TGAGCCCCTA TAAGATCGGC
AAAATGGAGG TAACCTATGA GCTATGGTAT AGTGTACTAA AATGGAATAC TGATAATGGT
AGGGGATACA TCTTTGCCAA TCAGGGAAGA GAAGGCAGTA ATGGAGGTGA AGGAGTAGCC
CCCACAGGTG CAAAAAAAGA GCCTGTAACA ATGATAAGCT GGCGAGACTG CATAGTGTGG
TGTAATGCGT ATACTGAAAA AGAAAAAGGA ATAGGCGAAT GCGTCTACCG CAAAAAGGAC
AATCATACGG TTGTATTAAA AGATGCGACG GCAACAGCTG CTTGTGATTC AGCCTATGCC
GATATGAATA AAAAAGGCTT TAGACTTCCG ACGGAAGCCG AGTGGGAATA TGCTGCCCGC
AGGCAGGGAA GCAATACTGA AAATGCGGCA CAATACGGCG ATGTATGGCT GACCAAATTA
AACAGTGCAA GCGGAGCCAA AGATAAATGG GATACGGCTG AAACAGGAGA GGTTGCATGG
TATAAAGGTA ATTCAGGAAA TAAAACTCAT CCGGTAGGAA AAAAGCGGGC AAATGCTCTA
GGTTTATACG ACATGTCGGG GAATGTCACC GAATGGTGTT TTGATTGGGA TGACACCATA
GCAGCAGAAA ATGTTACCGA TCCTCAAGGT GCCGCGTCGG GCTCTGCCCG TGTTGAACGC
GGCGGCAGCT GGCTCAACTA CGCGTACGGC TGCACTGTAG GCGTACGGTA CTGCGTCACT
CCTGGCAGCA GGAGCGACAA TCTTGGCTTC CGCCTGGCTT GTCGGCCATA G
 
Protein sequence
MKKFFVLFLA ILFVSVSACK NPFFKNMLDK DSGSEGTGDW NSQSSDVGSF EDAGDFVKII 
PPANGIVGVA PNYALPGNHD YWKGVFIAGR TVKLSPYKIG KMEVTYELWY SVLKWNTDNG
RGYIFANQGR EGSNGGEGVA PTGAKKEPVT MISWRDCIVW CNAYTEKEKG IGECVYRKKD
NHTVVLKDAT ATAACDSAYA DMNKKGFRLP TEAEWEYAAR RQGSNTENAA QYGDVWLTKL
NSASGAKDKW DTAETGEVAW YKGNSGNKTH PVGKKRANAL GLYDMSGNVT EWCFDWDDTI
AAENVTDPQG AASGSARVER GGSWLNYAYG CTVGVRYCVT PGSRSDNLGF RLACRP