Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TDE1731 |
Symbol | |
ID | 2739189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Treponema denticola ATCC 35405 |
Kingdom | Bacteria |
Replicon accession | NC_002967 |
Strand | - |
Start bp | 1785622 |
End bp | 1787025 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637160614 |
Product | hypothetical protein |
Protein accession | NP_972335 |
Protein GI | 42527237 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00706224 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTAAAA ACACGGATTT AAAAAAATAT GCTCTTATCG GTACAAAGCG CTTTTGGTTT TTAATGTGGG GCTTGGGCCT TGCAGGGCAG TTATGCTGGA ACATCGAAAA TCAGTGGTTT AATACCTTCG TATATGCTAA AATAGCAAAG GACTCTTCGA TAGTAACCCT TATGGTTATT ACGAGTGCAT TAGTAACAAC CTTTTCAACT TTTTTATTCG GTACCTTATC CGATAGGATA GGCTCAAGGC GACGATTTGT TTCGATAGGC TACATCGTCT GGGGGCTTAC CACCATCCTT TTCGGCTTGA CCGAATTTGT AGGAAGAGGA CAGGTCGGAA CAGGGGCTAA GGTATCTGTA TGGGCTGCCG TTCTTGTTAT CCTTGCCGAT GATTTTATGA GTTTTTTCGG TTCGATGGGA AACGATTCGG GCTACAACGC ATGGAGCAAC GATATGACAA CGGATAAAAA CCGCGGACAG GTTGGGGCTG TGCTTGCAAT TCAGCCCGTT ATAGGTACTA TAGTCGGCAC TGTGCTGGGC GGTCTTTTAA TAGGAGCCGA AAATAATTAC CAGCGTCTTT TTTGGTCTAT GGGTTTGTTT GTCATAGGTA CCGGTCTTAT TTCACTTTTT TTGTTAAAGG ATGCTCCCGA TTTAAAACCG CATAAGGAGG GTTCTTTTGG CAAACAGTTT GCTGCGATTT TTAAGGCTGA AGGCTTTTTT TCGCATAAAG AACTGATGCT CGCCTGTATA ACGACAGCCG TCTTTTTTAT TTCGTTTAAT GTTTATTTTG TTCACATGGG AAACTGGATG ATTTACCGCA TGGGCTTTGA TGCTGCCCGC ATGGGAATAA TTCAGGGTTT AAGTTTGCTT GCTGCCTCCT TATCGGTTAT TCCTGCAATA GGCCTTATAA ATAAAAGCCG AACGCCGCGC CTTGCCGCAT TTGCAATTAT TCTTAACAGT TTGGGCCTTT GTATTTTATC TCTTTTTATA AAGCCGGCCT CGGTTAACCC TGATGCAGTG TTCAGCCTGA AAAATATTTC TCTTTTTTTG TCCGTATTTT TAGCCGGTAC GGGACAGATT CTTGTAACCC AATCGATGAC TATGTGGGTA AAGGAGCTAT ACCCCGAAAC CTCTCGAGGA CAGTTTGAAG GTATGCGTAT TCTCTTTTTT GTGTTGACTC CGATGATTAT CGGAACAATA ATAGGAAATT TTTTGGTAAA AAATGGGGCA GGCTCGGTGG TAAACGAATT CGGCATAACC GAAAATATTC CCGTAGAGTC TATCTATGTA TGTGGTGCCG TTTTAGCCCT CTGCGCCTTT ATTCCACTTT ACTTTGCATC TAAGCTTTAC AATAAGCGAA AAAACAATGA GCCTAAGGAC ACTGAATCCT ATGAGAAAAA ATAA
|
Protein sequence | MLKNTDLKKY ALIGTKRFWF LMWGLGLAGQ LCWNIENQWF NTFVYAKIAK DSSIVTLMVI TSALVTTFST FLFGTLSDRI GSRRRFVSIG YIVWGLTTIL FGLTEFVGRG QVGTGAKVSV WAAVLVILAD DFMSFFGSMG NDSGYNAWSN DMTTDKNRGQ VGAVLAIQPV IGTIVGTVLG GLLIGAENNY QRLFWSMGLF VIGTGLISLF LLKDAPDLKP HKEGSFGKQF AAIFKAEGFF SHKELMLACI TTAVFFISFN VYFVHMGNWM IYRMGFDAAR MGIIQGLSLL AASLSVIPAI GLINKSRTPR LAAFAIILNS LGLCILSLFI KPASVNPDAV FSLKNISLFL SVFLAGTGQI LVTQSMTMWV KELYPETSRG QFEGMRILFF VLTPMIIGTI IGNFLVKNGA GSVVNEFGIT ENIPVESIYV CGAVLALCAF IPLYFASKLY NKRKNNEPKD TESYEKK
|
| |