Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TDE1637 |
Symbol | polA |
ID | 2739746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Treponema denticola ATCC 35405 |
Kingdom | Bacteria |
Replicon accession | NC_002967 |
Strand | + |
Start bp | 1686212 |
End bp | 1689022 |
Gene Length | 2811 bp |
Protein Length | 936 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637160520 |
Product | DNA polymerase I |
Protein accession | NP_972243 |
Protein GI | 42527145 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGATA CAATCTATGT TTTGGATGCC TACGGGCTTA TTTACCGCTC TTATTTTGCC TTTATTTCAA GACCTCTTAC AAACTCTAAG GGCGAAAATG TTTCGGCTAT CTTCGGCTTT TTTAAAAGCC TTCATTCCAT ATTTACCGAA TATAATCCTA AGCTCTTTGT TACTGCCCTC GATTCCCTTA CACCTACTTT TAGGCATGAG ATGTACAAAG AATACAAGGC TACAAGGGAT AAAACGCCCG ATGACCTCCA TGCGCAAATC GACAAAATAG AAGAAATTTT AAAAACATTT AAATTGCCTA CAGTCCGCTG TAACGGCTTT GAGGCCGATG ATGTAATAGC TTCCATAGCC GCCCTTGCAG AAAAGGAAGG CAGGGAATGT GTGGTTATTT CGGGCGACAA GGATTTAATG CAGCTTGTTT CAAAAACCAC GACAATGCTT AAACCGGGGA AGATTAAGGC TTGGGAGGGC TTCGATGCCG AAAATGTAAA AGAAGAATGG GGCGTTTATC CCGACGGAAT GTTAGATCTT CTTTCCCTCA TAGGCGACAG TGCCGACAAT GTTCCCGGCA TAAAGGGGGT CGGCCCGAAA ACAGCCGTAA AACTCCTTGA AGAGTATAAA AGCCTCGACG GCATCTACGC AAATACAGGA AACTTAAAGG GGGCTTTAAA AACAAAAATA GAAGAAGGAA AGGAATCGGC TTATTTTTCT AAAGAGCTTA TAAGGCTCCG CTTCGATGTT CCTGTCGAAA AAGACTTAAA CGCCTACTCT ACTTCGCAAA TGGACTATGA GGCGGCTGCC CGTCTTTTTA TAAGCGAAGA GCTTCCCAAT ATTGCAAAAC TATATTCCGA AAAAATAATT GCCGAAAAAA ATGCTCCATC TTCAAAAGAA AACTTAAAAC AAGAGACCGG TCTTTTTGAA AATTCGGAAC AAACTTCTCC CGAAATGCTC CCGCAAGAAT TAGGAACAGG AGAAGAAATC TCCCTTCCTC AAAATAAAGG TGATTACAAA CTTGTAGACG AAGCGGAAGA ACTTTTTAAA ATAGTAGACG AAGCCTTAAA ACAGGGCCTT GCATCATACG ACTGTGAAAC CACAAGTGAA GATCCCCTAA ATGCAGAAGT CTGCGGCTTC TCCCTTGCCC TAAAAGAAGG AGAGGCCTAT TATTTCCCCT TAAAGGCACC CTGTCCCGAA CTTGGAGAGG AAGCTCCAAA ACTCATAGCC TTTAAGGATG CTAAAAAGGC CGTAACAAAG CTCTTTGACT CAAAGATGAC CCTCATAATG CATAACGGCA AATTCGATAT TCAGGCAGCC CTTTCATCAA AACTTGCGAG CGGCATTTCG GCAAATCTTT TTGACACGAT GATAGCCGCA TGGCTTTTAG ACCCTGCCCG CTCTTCTTAC GGAATGGATA AACTTGCAGA AAGTATTTTA GGCGTAAAAA CCATAAGGTT TAAAGACCTT GTAAAGCAGG GACAAAACTT TTCGGATATT CCATTAAAAG AAGCCTGTCC CTATGCTGCA GAAGATGCCG ACATAACATT CCGCTTTTAT AAAAAATTCT TACCCCTCTT AAAAAAGAAT AATTTGGAAA AACTTTTCTT TGACCTTGAA ATGCCCATCA CAAAACTTTT AACCGAGATG GAAATAAAGG GTATCTTTTT AAAGGGAGAA GAACTTACGG CCTACTCAAA AGAATTGGGA AAAGAACTTG AAGATTGCGA AAAGGATATT TACCGCCTCG TGGGCCATGA GTTTAATATA GCCTCCCCTA AACAGCTTCA AGAAGTTTTA TTTGAAGAAA GAAAACTTAC CCCCGGCAAA AAAACTAAGA CGGGGTATTC TACGGATACT TCGGTCCTTG AAAACCTTGC TTCGGAAGAT CCCGTACCTG CAAAAATCTT GGATTACAGG GCTCTTGCAA AGTTAAAATC CACATACACC GATACCCTTC CCAAGATGAC GGACAAAAAC GGAAGAATCC ATACAAGTTT TATTCAAACG GGAACAGCCA CAGGCCGTCT TTCAAGCCGA GACCCCAATT TACAAAATAT CCCCATACGC GGAAACGAGG GGCGGAAGAT AAGGGAAGCC TTTCAGGCGG AAAAGGGGCG GGTTCTTATT TCTGCAGATT ATTCGCAGAT TGAGCTTGTA ATCCTTGCTC ATCTTTCAAA AGATCAAAAC CTAGTAGAAG CCTTTAATAC GGGAATAGAC GTTCACGCCA AGACTGCAAG CCTAATCTTT GCCGTAGACA TAAAAGATGT AAGTCAGGAT ATGAGACGCA TAGCAAAGAC CATAAACTTC GGCGTAATGT ACGGCATGAG CGCCTTCCGC CTTGCTTCTT CTTTAAGAAT TCCTCGAAAA AGAGCTGATG AGTTTATAAA GGCTTATTTT GCCACATACT CGGGCGTATC CGGCTTTATG GCAAATGTTT GTCAAGAAGC CGAACAAAGA GGCTATGTAG AAACCTTAAT GGGAAGAAGG CGTTATCTTC CGGCTATAAA CAGTAAAAAC AAGGTAGAAA AGGCGGGGGC CGAACGCATT GCGGTAAACA CCCCGATTCA GGGCACGGCC GCCGATATAG TAAAACTTGC AATGCTTGAA GTCGATAAAG CCTTAAAAAA ACAAAAACTT GATGCTTCCA TACTTTTGCA GGTTCATGAT GAGCTTATAA TAGAGGCAGC CGAATCTGAA AGAGAAAAAG TCATGTCCCT CGTAAAAGAA AAAATGGAAG GCGTAATCAA ACTTTCGGTA CCCTTAAGGG TAAGTATTGA ATCGGGAATG AGCTGGGGAG AGTTCCACTA A
|
Protein sequence | MKDTIYVLDA YGLIYRSYFA FISRPLTNSK GENVSAIFGF FKSLHSIFTE YNPKLFVTAL DSLTPTFRHE MYKEYKATRD KTPDDLHAQI DKIEEILKTF KLPTVRCNGF EADDVIASIA ALAEKEGREC VVISGDKDLM QLVSKTTTML KPGKIKAWEG FDAENVKEEW GVYPDGMLDL LSLIGDSADN VPGIKGVGPK TAVKLLEEYK SLDGIYANTG NLKGALKTKI EEGKESAYFS KELIRLRFDV PVEKDLNAYS TSQMDYEAAA RLFISEELPN IAKLYSEKII AEKNAPSSKE NLKQETGLFE NSEQTSPEML PQELGTGEEI SLPQNKGDYK LVDEAEELFK IVDEALKQGL ASYDCETTSE DPLNAEVCGF SLALKEGEAY YFPLKAPCPE LGEEAPKLIA FKDAKKAVTK LFDSKMTLIM HNGKFDIQAA LSSKLASGIS ANLFDTMIAA WLLDPARSSY GMDKLAESIL GVKTIRFKDL VKQGQNFSDI PLKEACPYAA EDADITFRFY KKFLPLLKKN NLEKLFFDLE MPITKLLTEM EIKGIFLKGE ELTAYSKELG KELEDCEKDI YRLVGHEFNI ASPKQLQEVL FEERKLTPGK KTKTGYSTDT SVLENLASED PVPAKILDYR ALAKLKSTYT DTLPKMTDKN GRIHTSFIQT GTATGRLSSR DPNLQNIPIR GNEGRKIREA FQAEKGRVLI SADYSQIELV ILAHLSKDQN LVEAFNTGID VHAKTASLIF AVDIKDVSQD MRRIAKTINF GVMYGMSAFR LASSLRIPRK RADEFIKAYF ATYSGVSGFM ANVCQEAEQR GYVETLMGRR RYLPAINSKN KVEKAGAERI AVNTPIQGTA ADIVKLAMLE VDKALKKQKL DASILLQVHD ELIIEAAESE REKVMSLVKE KMEGVIKLSV PLRVSIESGM SWGEFH
|
| |