Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0886 |
Symbol | |
ID | 4810504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1061395 |
End bp | 1064079 |
Gene Length | 2685 bp |
Protein Length | 894 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106302 |
Product | DNA polymerase I |
Protein accession | YP_001037313 |
Protein GI | 125973403 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGC AAAAATTGAT GGCCATAGAC GGAAACAGTA TTCTTAACAG GGCTTTTTAC GGGCTTCCCG AACTTCTGAC AACATCCGAC GGGATATATA CCAACGGAAT TTATGTTTTT TTAAATATAA TGCATAAATT TATTGAAGAG GAAAATCCCG AGTACATTTG CGTTGCGTTC GACCTTAAGG CTCCGACTTT CAGGCACAAT AAATACGAAG GTTACAAGGC AAACAGGAAA GGAATGCCGG AAGAGCTCCG GGTTCAGGTT CCCCTGCTTA AGGAAGTTTT GGATGCAATG AATATAAAAA GACTTGAGAT GGAGGGGTTC GAGGCTGACG ACATACTTGG TTCCGTTTCC TTGTGCGCCG AAAAAAAGGG TCTGGAAGTA ATACTGGTTA CAGGGGACAG GGATGCTTTT CAATTGATCG GTCCTTCCAC AAGGCTGAAA CTTCCGCGGA CGAGAGGCGG GAAAACAGAG GTTGAGGAAT ATGACTACAA CAAGATTGTG GAGGTCTACG GAATCAAGCC GGAACAGTTT GTTGACGTCA AGGCTTTGGC GGGAGATACT TCCGACAATA TTCCCGGTGT TCCGGGTATC GGCGAAAAGA CGGCCCTGGC TCTCATAAAA GAATACAACA ATCTTGAAAA CCTTTATAAT TCATTGGACA GCATTAAAAA GAAAGGACTT AGGGAAAAGC TTGAAACTTT TAAGGAGCAG GCTTTTCTGA GCAGGGAGCT TGCCCTGATT GAAAGAAACA TGCCGTCCCT TTGTGATATT GAAGAGCTGA AAAGAGTGGA GATTGACAGG GAAAAAACCT ATGAGATATT TAAGAGGCTG GAATTTAGAA GCTTTATTGA CAAGTTTGGA TTGAACGATG TCCAAATCCA AAATACCGTG GAACTGAATG TGAAAATCGC AAAAAACGCC AGTGAACTTG AGAGTTTGAA AAACAATATT CTCAAGTCCA GAAAAGTTTG TATTTATCAT TTGATTGACA AAACGGGCAG CTTTTCTCAA AAGCTTGCCG CCATTGCAAT TTCGCCCGTG GAGGATGAAG CATGGTATTT GGATTTTACC AATAATATTG ATGAAGATGA GTTTTTCAGG CAGTTTAAGG ACGTTTTGGA GGATGGAAAT ATAAAGAAAT ACGGGCATGA TTTGAAAAAT TTTATAGTAT ATTTAAATAA TCGGGGAATT GATTTTAACG GTTTGGCTTT TGACACAATG ATTGGAGCTT ATATAATAAA CCCGTCAAAG GAGACCTATA CGATATCCGA GCTGGCACAG GAGTATTTAA ACTTGAGTGT AAAGGCGGTT GAGGAACTTG CGGGCAAGGG CAAAAGCTTT ACTTTGTTTA AGGACATGCA GCCTGACGTT CTTTCAAAGA CTGTTGGTGT TTATCCTCAT GTTATAAGCA AAGTAAGCCG GAAAATTGAC AGCCTTCTTA AAGAAAACAA CCAGGAGAGG CTTTATTATG ACATTGAGCT TCCGCTGGTG CGGACCTTGG CGGATATGGA GTATTACGGA TTCAAGGTTA ATGTCGATGC TCTTGTGGAA TTTTCGAAAG AGCTTCAGGA AAAGATAGAT GTTGTAACAA AAGAAATATA CACTTTGGCG GGAGAAGAGT TCAATATCAA TTCTCCGAAA CAGCTGGGAG TTATTTTGTT TGAGAAACTG GGTCTTCCCA TTATTAAGAA AACAAAAACC GGATATTCAA CCGATGCTGA AGTATTGGAA GAGCTTTCCG ACAGGCATGA AATAGTGGAA AAAATACTGG AATACAGACA GCTTGTAAAG CTGAAATCCA CTTATGCGGA AGGCCTTTTG GCGGTTATAA ATCCTTACAC GGGAAAGATT CATTCAAGTT TCAACCAGAC AGTGACGGCT ACGGGAAGAA TAAGCAGTAC AGAGCCAAAT CTTCAGAATA TACCGATAAA ACTTGAAATG GGCAGGAAAA TACGAAAAGT TTTTATACCT TCGGATGAAA ACTATCTGCT TCTTGATGCG GACTATTCCC AGATAGAGCT TCGGGTTCTG GCCCACATAA CCAATGACGA AAACATGATA AATGCGTTTT TAAACAACGA AGACATTCAT ACTTCCACGG CTGCATCGGT CTTTGGAATA CCAAAAGAGG AAGTTACCCC TCTCATGAGG TCCAGAGCGA AAGCTGTCAA TTTCGGTATT GTATACGGTA TAGGGGACTT CAGTCTTGCA AAGGATCTTA AGATAAGCAG AAAGGAAGCC AGAGCATATA TAGACGGTTA TCTGGACAGA TATCCAAATG TAAAGAAATA TATGCATGAT ATTGTGGAAG AGGGAAAAGA AAAAGGTTTT GTAACCACCA TGTTCATGAG AAGAAGGTAC CTTCCTGAGC TTAAATCGCG CAACTTCAAC ATACGGTCTT TTGGAGAACG GGTTGCGATG AACACCCCGA TACAGGGAAG TGCCGCGGAT ATAATCAAGA TTGCCATGGT AAAGGTGCAT GGAGAGCTTA AAAAAAGAAA GCTTAAATCC AGGCTGATAC TTCAGGTTCA CGATGAACTT ATTGTAGAGA CGTTCAAGGA TGAAAAAGAA GAGGTGGAAA AGATTTTACT TGAAGGCATG CAAAATGCCG TAAGTCTGAA AGTGCCGCTG GTTGTGGAGA TTAAATCGGG CAGCAACTGG TATGAGACAA AGTAA
|
Protein sequence | MSKQKLMAID GNSILNRAFY GLPELLTTSD GIYTNGIYVF LNIMHKFIEE ENPEYICVAF DLKAPTFRHN KYEGYKANRK GMPEELRVQV PLLKEVLDAM NIKRLEMEGF EADDILGSVS LCAEKKGLEV ILVTGDRDAF QLIGPSTRLK LPRTRGGKTE VEEYDYNKIV EVYGIKPEQF VDVKALAGDT SDNIPGVPGI GEKTALALIK EYNNLENLYN SLDSIKKKGL REKLETFKEQ AFLSRELALI ERNMPSLCDI EELKRVEIDR EKTYEIFKRL EFRSFIDKFG LNDVQIQNTV ELNVKIAKNA SELESLKNNI LKSRKVCIYH LIDKTGSFSQ KLAAIAISPV EDEAWYLDFT NNIDEDEFFR QFKDVLEDGN IKKYGHDLKN FIVYLNNRGI DFNGLAFDTM IGAYIINPSK ETYTISELAQ EYLNLSVKAV EELAGKGKSF TLFKDMQPDV LSKTVGVYPH VISKVSRKID SLLKENNQER LYYDIELPLV RTLADMEYYG FKVNVDALVE FSKELQEKID VVTKEIYTLA GEEFNINSPK QLGVILFEKL GLPIIKKTKT GYSTDAEVLE ELSDRHEIVE KILEYRQLVK LKSTYAEGLL AVINPYTGKI HSSFNQTVTA TGRISSTEPN LQNIPIKLEM GRKIRKVFIP SDENYLLLDA DYSQIELRVL AHITNDENMI NAFLNNEDIH TSTAASVFGI PKEEVTPLMR SRAKAVNFGI VYGIGDFSLA KDLKISRKEA RAYIDGYLDR YPNVKKYMHD IVEEGKEKGF VTTMFMRRRY LPELKSRNFN IRSFGERVAM NTPIQGSAAD IIKIAMVKVH GELKKRKLKS RLILQVHDEL IVETFKDEKE EVEKILLEGM QNAVSLKVPL VVEIKSGSNW YETK
|
| |