Gene TDE1637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTDE1637 
SymbolpolA 
ID2739746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTreponema denticola ATCC 35405 
KingdomBacteria 
Replicon accessionNC_002967 
Strand
Start bp1686212 
End bp1689022 
Gene Length2811 bp 
Protein Length936 aa 
Translation table11 
GC content42% 
IMG OID637160520 
ProductDNA polymerase I 
Protein accessionNP_972243 
Protein GI42527145 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATA CAATCTATGT TTTGGATGCC TACGGGCTTA TTTACCGCTC TTATTTTGCC 
TTTATTTCAA GACCTCTTAC AAACTCTAAG GGCGAAAATG TTTCGGCTAT CTTCGGCTTT
TTTAAAAGCC TTCATTCCAT ATTTACCGAA TATAATCCTA AGCTCTTTGT TACTGCCCTC
GATTCCCTTA CACCTACTTT TAGGCATGAG ATGTACAAAG AATACAAGGC TACAAGGGAT
AAAACGCCCG ATGACCTCCA TGCGCAAATC GACAAAATAG AAGAAATTTT AAAAACATTT
AAATTGCCTA CAGTCCGCTG TAACGGCTTT GAGGCCGATG ATGTAATAGC TTCCATAGCC
GCCCTTGCAG AAAAGGAAGG CAGGGAATGT GTGGTTATTT CGGGCGACAA GGATTTAATG
CAGCTTGTTT CAAAAACCAC GACAATGCTT AAACCGGGGA AGATTAAGGC TTGGGAGGGC
TTCGATGCCG AAAATGTAAA AGAAGAATGG GGCGTTTATC CCGACGGAAT GTTAGATCTT
CTTTCCCTCA TAGGCGACAG TGCCGACAAT GTTCCCGGCA TAAAGGGGGT CGGCCCGAAA
ACAGCCGTAA AACTCCTTGA AGAGTATAAA AGCCTCGACG GCATCTACGC AAATACAGGA
AACTTAAAGG GGGCTTTAAA AACAAAAATA GAAGAAGGAA AGGAATCGGC TTATTTTTCT
AAAGAGCTTA TAAGGCTCCG CTTCGATGTT CCTGTCGAAA AAGACTTAAA CGCCTACTCT
ACTTCGCAAA TGGACTATGA GGCGGCTGCC CGTCTTTTTA TAAGCGAAGA GCTTCCCAAT
ATTGCAAAAC TATATTCCGA AAAAATAATT GCCGAAAAAA ATGCTCCATC TTCAAAAGAA
AACTTAAAAC AAGAGACCGG TCTTTTTGAA AATTCGGAAC AAACTTCTCC CGAAATGCTC
CCGCAAGAAT TAGGAACAGG AGAAGAAATC TCCCTTCCTC AAAATAAAGG TGATTACAAA
CTTGTAGACG AAGCGGAAGA ACTTTTTAAA ATAGTAGACG AAGCCTTAAA ACAGGGCCTT
GCATCATACG ACTGTGAAAC CACAAGTGAA GATCCCCTAA ATGCAGAAGT CTGCGGCTTC
TCCCTTGCCC TAAAAGAAGG AGAGGCCTAT TATTTCCCCT TAAAGGCACC CTGTCCCGAA
CTTGGAGAGG AAGCTCCAAA ACTCATAGCC TTTAAGGATG CTAAAAAGGC CGTAACAAAG
CTCTTTGACT CAAAGATGAC CCTCATAATG CATAACGGCA AATTCGATAT TCAGGCAGCC
CTTTCATCAA AACTTGCGAG CGGCATTTCG GCAAATCTTT TTGACACGAT GATAGCCGCA
TGGCTTTTAG ACCCTGCCCG CTCTTCTTAC GGAATGGATA AACTTGCAGA AAGTATTTTA
GGCGTAAAAA CCATAAGGTT TAAAGACCTT GTAAAGCAGG GACAAAACTT TTCGGATATT
CCATTAAAAG AAGCCTGTCC CTATGCTGCA GAAGATGCCG ACATAACATT CCGCTTTTAT
AAAAAATTCT TACCCCTCTT AAAAAAGAAT AATTTGGAAA AACTTTTCTT TGACCTTGAA
ATGCCCATCA CAAAACTTTT AACCGAGATG GAAATAAAGG GTATCTTTTT AAAGGGAGAA
GAACTTACGG CCTACTCAAA AGAATTGGGA AAAGAACTTG AAGATTGCGA AAAGGATATT
TACCGCCTCG TGGGCCATGA GTTTAATATA GCCTCCCCTA AACAGCTTCA AGAAGTTTTA
TTTGAAGAAA GAAAACTTAC CCCCGGCAAA AAAACTAAGA CGGGGTATTC TACGGATACT
TCGGTCCTTG AAAACCTTGC TTCGGAAGAT CCCGTACCTG CAAAAATCTT GGATTACAGG
GCTCTTGCAA AGTTAAAATC CACATACACC GATACCCTTC CCAAGATGAC GGACAAAAAC
GGAAGAATCC ATACAAGTTT TATTCAAACG GGAACAGCCA CAGGCCGTCT TTCAAGCCGA
GACCCCAATT TACAAAATAT CCCCATACGC GGAAACGAGG GGCGGAAGAT AAGGGAAGCC
TTTCAGGCGG AAAAGGGGCG GGTTCTTATT TCTGCAGATT ATTCGCAGAT TGAGCTTGTA
ATCCTTGCTC ATCTTTCAAA AGATCAAAAC CTAGTAGAAG CCTTTAATAC GGGAATAGAC
GTTCACGCCA AGACTGCAAG CCTAATCTTT GCCGTAGACA TAAAAGATGT AAGTCAGGAT
ATGAGACGCA TAGCAAAGAC CATAAACTTC GGCGTAATGT ACGGCATGAG CGCCTTCCGC
CTTGCTTCTT CTTTAAGAAT TCCTCGAAAA AGAGCTGATG AGTTTATAAA GGCTTATTTT
GCCACATACT CGGGCGTATC CGGCTTTATG GCAAATGTTT GTCAAGAAGC CGAACAAAGA
GGCTATGTAG AAACCTTAAT GGGAAGAAGG CGTTATCTTC CGGCTATAAA CAGTAAAAAC
AAGGTAGAAA AGGCGGGGGC CGAACGCATT GCGGTAAACA CCCCGATTCA GGGCACGGCC
GCCGATATAG TAAAACTTGC AATGCTTGAA GTCGATAAAG CCTTAAAAAA ACAAAAACTT
GATGCTTCCA TACTTTTGCA GGTTCATGAT GAGCTTATAA TAGAGGCAGC CGAATCTGAA
AGAGAAAAAG TCATGTCCCT CGTAAAAGAA AAAATGGAAG GCGTAATCAA ACTTTCGGTA
CCCTTAAGGG TAAGTATTGA ATCGGGAATG AGCTGGGGAG AGTTCCACTA A
 
Protein sequence
MKDTIYVLDA YGLIYRSYFA FISRPLTNSK GENVSAIFGF FKSLHSIFTE YNPKLFVTAL 
DSLTPTFRHE MYKEYKATRD KTPDDLHAQI DKIEEILKTF KLPTVRCNGF EADDVIASIA
ALAEKEGREC VVISGDKDLM QLVSKTTTML KPGKIKAWEG FDAENVKEEW GVYPDGMLDL
LSLIGDSADN VPGIKGVGPK TAVKLLEEYK SLDGIYANTG NLKGALKTKI EEGKESAYFS
KELIRLRFDV PVEKDLNAYS TSQMDYEAAA RLFISEELPN IAKLYSEKII AEKNAPSSKE
NLKQETGLFE NSEQTSPEML PQELGTGEEI SLPQNKGDYK LVDEAEELFK IVDEALKQGL
ASYDCETTSE DPLNAEVCGF SLALKEGEAY YFPLKAPCPE LGEEAPKLIA FKDAKKAVTK
LFDSKMTLIM HNGKFDIQAA LSSKLASGIS ANLFDTMIAA WLLDPARSSY GMDKLAESIL
GVKTIRFKDL VKQGQNFSDI PLKEACPYAA EDADITFRFY KKFLPLLKKN NLEKLFFDLE
MPITKLLTEM EIKGIFLKGE ELTAYSKELG KELEDCEKDI YRLVGHEFNI ASPKQLQEVL
FEERKLTPGK KTKTGYSTDT SVLENLASED PVPAKILDYR ALAKLKSTYT DTLPKMTDKN
GRIHTSFIQT GTATGRLSSR DPNLQNIPIR GNEGRKIREA FQAEKGRVLI SADYSQIELV
ILAHLSKDQN LVEAFNTGID VHAKTASLIF AVDIKDVSQD MRRIAKTINF GVMYGMSAFR
LASSLRIPRK RADEFIKAYF ATYSGVSGFM ANVCQEAEQR GYVETLMGRR RYLPAINSKN
KVEKAGAERI AVNTPIQGTA ADIVKLAMLE VDKALKKQKL DASILLQVHD ELIIEAAESE
REKVMSLVKE KMEGVIKLSV PLRVSIESGM SWGEFH