Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TDE1933 |
Symbol | |
ID | 2740173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Treponema denticola ATCC 35405 |
Kingdom | Bacteria |
Replicon accession | NC_002967 |
Strand | - |
Start bp | 1949690 |
End bp | 1952701 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637160821 |
Product | hypothetical protein |
Protein accession | NP_972536 |
Protein GI | 42527438 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAC CTTTAAATAT TTTAATAGCG GCTGTTGCCG TAATTTTACT TTTTACAGCT TGTCAACAAT TTCTGGAAGA TCCGGAAGAT TTTTTAAGCT ATTGGGCGAG CGAGGCTTTT GTCAAAGACC ATAGCATAGG TTCGGCACAT AGGCCGGATA ATGCAGGTGT GCCGTGTGTC GGCTCTTCGG AGCCCGTAGA TATTACGCTC TCAGTGCATA ACCCGAAAGG CTTTTCGTTT GTTATGCCTG CTTCTTCGGC ACCGGCAGGC ATTGTCGAAT TTAAAGAACT TTCTTCTCAG CCTACGGCAG AAACCGACTA TGCACTGGAG CGAACCGGTT TCGGCGCGCT GAAGCTTACA TATAAACCGT CCCTTTTACA AAAATACGAA CAAGGTTCAG GCAGTTTAAA CCCGACCATC ACCCTAAAAG CTAAAGACGG CAGGGTATTT AAGCAAACCT ACACCTTCGG CATAAAATCG AACACCCCGC CGCCAAAACC GGCAGTCGTG CTTGCAAAAC AGACTAATGC TTCTCCTGAG CCTTCGTATT ACGTGCTCTG CATAGATACA AAAGATTTGG TTGATACTTC TGCCTTTGTA GGCGGAAAAT ATATCCATGA TGATATTGCA TACGTTACCG TAAACGGAAT ACGATACAAC TTAAAAATGA ATGATGCCCA TAACGGTTTT AGCAAAAAAC CGGCGGCACC CTCATTTCTT GAAAACGGTA CAGGGCTTAT GGCGGTCGGC AGTGAATCGC TGCCTACAGC GAGCCATTGG GTTTTATACT ACAAAACGAA CATAAGAATA GGAGACGGCA ACCCGCCTAC AACATACCAT ATAACCTTGA TCGACAACGA AGGCGTTACT TCGGATGAGG CTGTCAAAAC GATAGAAGCA AGCGGTAAGA CTCACACCGT AACATTCAGC GTTGTAGACG GAACTGGAGG AACGCTTACA GCAAAAGTTG ATGGTAGCAA TATTCAGTCG GGCAGTGAGG TTGGACGCGG AAAGACGGTA ACCTTCACTG CGAATCCCGA TACGGTAAAC GGTTATGAGG TTGAAAAATG GACGCTTGAC GACAACGAAG TATCCGGTCA TACAAGTACG GAGTATACGC TTTACAACAT AACCGACAAT GCAACCGTAA CGGTAAAGTT TAAAAAGAAG ATTTACACCG TAACCTACCG CGTAGAAATC GTAGACGGAG AAGCGGGAGG AAAGATTAAA GCCGATTCAG GTAACTTTGT AGAAAACGGC AGCACATCAG TTGAATACGG CGGAAGTGTA ACCTTTACCG CGCACCCAAC CAATACAGAC TGGAAAGTTG CGGAATGGAA AAGGGACAAT GCCGAAGAAA ACGGAACGAA TGGCACCTAT ACGATTTCTG TAACCGCTGC TACAACCGTA ACGGTGAAGT TTTACCAGTC GACCTTAAAG AATCCGGCAA CGTGGAAAGA CCTTGCGCGT GCGGTTAAAA GCGCACCCGA TAACGCTGTC ATCACCATAA ACGGTGAAAT ACAGGCGACA GATGTCGCAA ACAATGCGAG CAGAATTAAG GTCCAAAAAA ACCTTACCAT AAAAGGCGAA AACAACGCCA TTTTGAACGC AGTCGGCAAG CAGGGCATCT TTGATGTGTT TAGAACATTC ACCCTTCAAG ACATAACGCT TAAAAATAGT GCACTGCCCT CCGAGTACTC AGGCGGTGCC GGCGTATACG TAAACCCGTC CGGTACACTC ATTATGAAAG GCTCAAGCGT TATCACCAAG TGCTCGGCGG CGAATTCCGG CGGTGGCGTG TATGTAGGCG GCGGAACCTT TGAAATGCAC GACACAAGCG CTATCACCGG CTGCACGGCA AGTAAAGGCG GCGGCGTTTA TGTAAGCGGC GGAATCTTTA AAATGCAAGA TTCCGCAATC GTTACCCCCT CTACCGGATC TGAGCAATAT ACGGCGGGTA AAAACGATGT GTATTTGGAG AGCGGAAAGA TGATAACCGT TGACGGCACA TTGTCAAACA ACCCCGCCGC GCGCATTACG GTACCGGACA ACAAATACCA GTCGACTACC AAAGTGCTTG ACGGAAGCGC AGTCGGTTCG GAACACGCCA AGTTTGCCGT AACGCCGGAG AAAGTTACGG AGGACTCGGA GAACTGGAAT GTATTTTGGT ATGTTGACGC TGGCGGTAGA TTGAAAGCAG AATTTGAGGA TCCTTCACTG TTGCAGGAAG TAATTAGTAG TAGACACGAC AATACGCCCT TTATTATAAA GCTGGGGAAT ATCAGCAACC TTACAAATGT TGGAATACCA GGTAATAAAA AGATTATGCT CAAGGCTGAT AGAGAGGTAA CCTTAACATG TCCTCAGAAG TGGAACGACC TTAAACACTT ACAGGTATAC GAAAATGCCT CATTAACATT AAAAGGGCCG ATAACACTGC AAGGCAAGGA CTACGGCATT AATTCTCAAT ACGCGCTTTA TGTAGAAAAG AACGGTAAGG CAGAAATCAA AGACGGCGTA ACAATTACCG GGTTTAAAAA CGCCGGCAGA GGTACGGTAT TTGCAGACGG AGACCTTACG ATGACAGGCG GAACAATTAC CGGCAACAAA GCCGACAAAG GCGGCGGCGT ATATATAGCT CCCAACAGAA GTTTTACGAT GAAAGGCGGA AGCATTAAGG GAAATACGGC TGGAAATGGC GGCGGCGTAT TTGTGGACTT TGACAGCGAC TATTATATGT ACGGAACTTT TACGATGTCA GGCGGAAGGA TTGAAGGCAA CACAGCCGAC CATGGCGGCG GCGTATTTAC ACATGGAAGT TTTACGATGT CAGGCGGAAT AATTACCAAG AATAAAGCAA ATACTGACGG CAAGGCAGTG ATGCTTGATC ACTACTTTGA TTGGGAAGAC GGCGAGATAA AGGACAACAA AGAAGGCAAC GGCGCGGTAA TTGGAGGCAA TGTCTCTCGC TATCTTAGAA AAGCCCCCGG TAATAGTAAT ACGGAAAGCT AA
|
Protein sequence | MKRPLNILIA AVAVILLFTA CQQFLEDPED FLSYWASEAF VKDHSIGSAH RPDNAGVPCV GSSEPVDITL SVHNPKGFSF VMPASSAPAG IVEFKELSSQ PTAETDYALE RTGFGALKLT YKPSLLQKYE QGSGSLNPTI TLKAKDGRVF KQTYTFGIKS NTPPPKPAVV LAKQTNASPE PSYYVLCIDT KDLVDTSAFV GGKYIHDDIA YVTVNGIRYN LKMNDAHNGF SKKPAAPSFL ENGTGLMAVG SESLPTASHW VLYYKTNIRI GDGNPPTTYH ITLIDNEGVT SDEAVKTIEA SGKTHTVTFS VVDGTGGTLT AKVDGSNIQS GSEVGRGKTV TFTANPDTVN GYEVEKWTLD DNEVSGHTST EYTLYNITDN ATVTVKFKKK IYTVTYRVEI VDGEAGGKIK ADSGNFVENG STSVEYGGSV TFTAHPTNTD WKVAEWKRDN AEENGTNGTY TISVTAATTV TVKFYQSTLK NPATWKDLAR AVKSAPDNAV ITINGEIQAT DVANNASRIK VQKNLTIKGE NNAILNAVGK QGIFDVFRTF TLQDITLKNS ALPSEYSGGA GVYVNPSGTL IMKGSSVITK CSAANSGGGV YVGGGTFEMH DTSAITGCTA SKGGGVYVSG GIFKMQDSAI VTPSTGSEQY TAGKNDVYLE SGKMITVDGT LSNNPAARIT VPDNKYQSTT KVLDGSAVGS EHAKFAVTPE KVTEDSENWN VFWYVDAGGR LKAEFEDPSL LQEVISSRHD NTPFIIKLGN ISNLTNVGIP GNKKIMLKAD REVTLTCPQK WNDLKHLQVY ENASLTLKGP ITLQGKDYGI NSQYALYVEK NGKAEIKDGV TITGFKNAGR GTVFADGDLT MTGGTITGNK ADKGGGVYIA PNRSFTMKGG SIKGNTAGNG GGVFVDFDSD YYMYGTFTMS GGRIEGNTAD HGGGVFTHGS FTMSGGIITK NKANTDGKAV MLDHYFDWED GEIKDNKEGN GAVIGGNVSR YLRKAPGNSN TES
|
| |