Gene Cthe_0886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0886 
Symbol 
ID4810504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1061395 
End bp1064079 
Gene Length2685 bp 
Protein Length894 aa 
Translation table11 
GC content41% 
IMG OID640106302 
ProductDNA polymerase I 
Protein accessionYP_001037313 
Protein GI125973403 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGC AAAAATTGAT GGCCATAGAC GGAAACAGTA TTCTTAACAG GGCTTTTTAC 
GGGCTTCCCG AACTTCTGAC AACATCCGAC GGGATATATA CCAACGGAAT TTATGTTTTT
TTAAATATAA TGCATAAATT TATTGAAGAG GAAAATCCCG AGTACATTTG CGTTGCGTTC
GACCTTAAGG CTCCGACTTT CAGGCACAAT AAATACGAAG GTTACAAGGC AAACAGGAAA
GGAATGCCGG AAGAGCTCCG GGTTCAGGTT CCCCTGCTTA AGGAAGTTTT GGATGCAATG
AATATAAAAA GACTTGAGAT GGAGGGGTTC GAGGCTGACG ACATACTTGG TTCCGTTTCC
TTGTGCGCCG AAAAAAAGGG TCTGGAAGTA ATACTGGTTA CAGGGGACAG GGATGCTTTT
CAATTGATCG GTCCTTCCAC AAGGCTGAAA CTTCCGCGGA CGAGAGGCGG GAAAACAGAG
GTTGAGGAAT ATGACTACAA CAAGATTGTG GAGGTCTACG GAATCAAGCC GGAACAGTTT
GTTGACGTCA AGGCTTTGGC GGGAGATACT TCCGACAATA TTCCCGGTGT TCCGGGTATC
GGCGAAAAGA CGGCCCTGGC TCTCATAAAA GAATACAACA ATCTTGAAAA CCTTTATAAT
TCATTGGACA GCATTAAAAA GAAAGGACTT AGGGAAAAGC TTGAAACTTT TAAGGAGCAG
GCTTTTCTGA GCAGGGAGCT TGCCCTGATT GAAAGAAACA TGCCGTCCCT TTGTGATATT
GAAGAGCTGA AAAGAGTGGA GATTGACAGG GAAAAAACCT ATGAGATATT TAAGAGGCTG
GAATTTAGAA GCTTTATTGA CAAGTTTGGA TTGAACGATG TCCAAATCCA AAATACCGTG
GAACTGAATG TGAAAATCGC AAAAAACGCC AGTGAACTTG AGAGTTTGAA AAACAATATT
CTCAAGTCCA GAAAAGTTTG TATTTATCAT TTGATTGACA AAACGGGCAG CTTTTCTCAA
AAGCTTGCCG CCATTGCAAT TTCGCCCGTG GAGGATGAAG CATGGTATTT GGATTTTACC
AATAATATTG ATGAAGATGA GTTTTTCAGG CAGTTTAAGG ACGTTTTGGA GGATGGAAAT
ATAAAGAAAT ACGGGCATGA TTTGAAAAAT TTTATAGTAT ATTTAAATAA TCGGGGAATT
GATTTTAACG GTTTGGCTTT TGACACAATG ATTGGAGCTT ATATAATAAA CCCGTCAAAG
GAGACCTATA CGATATCCGA GCTGGCACAG GAGTATTTAA ACTTGAGTGT AAAGGCGGTT
GAGGAACTTG CGGGCAAGGG CAAAAGCTTT ACTTTGTTTA AGGACATGCA GCCTGACGTT
CTTTCAAAGA CTGTTGGTGT TTATCCTCAT GTTATAAGCA AAGTAAGCCG GAAAATTGAC
AGCCTTCTTA AAGAAAACAA CCAGGAGAGG CTTTATTATG ACATTGAGCT TCCGCTGGTG
CGGACCTTGG CGGATATGGA GTATTACGGA TTCAAGGTTA ATGTCGATGC TCTTGTGGAA
TTTTCGAAAG AGCTTCAGGA AAAGATAGAT GTTGTAACAA AAGAAATATA CACTTTGGCG
GGAGAAGAGT TCAATATCAA TTCTCCGAAA CAGCTGGGAG TTATTTTGTT TGAGAAACTG
GGTCTTCCCA TTATTAAGAA AACAAAAACC GGATATTCAA CCGATGCTGA AGTATTGGAA
GAGCTTTCCG ACAGGCATGA AATAGTGGAA AAAATACTGG AATACAGACA GCTTGTAAAG
CTGAAATCCA CTTATGCGGA AGGCCTTTTG GCGGTTATAA ATCCTTACAC GGGAAAGATT
CATTCAAGTT TCAACCAGAC AGTGACGGCT ACGGGAAGAA TAAGCAGTAC AGAGCCAAAT
CTTCAGAATA TACCGATAAA ACTTGAAATG GGCAGGAAAA TACGAAAAGT TTTTATACCT
TCGGATGAAA ACTATCTGCT TCTTGATGCG GACTATTCCC AGATAGAGCT TCGGGTTCTG
GCCCACATAA CCAATGACGA AAACATGATA AATGCGTTTT TAAACAACGA AGACATTCAT
ACTTCCACGG CTGCATCGGT CTTTGGAATA CCAAAAGAGG AAGTTACCCC TCTCATGAGG
TCCAGAGCGA AAGCTGTCAA TTTCGGTATT GTATACGGTA TAGGGGACTT CAGTCTTGCA
AAGGATCTTA AGATAAGCAG AAAGGAAGCC AGAGCATATA TAGACGGTTA TCTGGACAGA
TATCCAAATG TAAAGAAATA TATGCATGAT ATTGTGGAAG AGGGAAAAGA AAAAGGTTTT
GTAACCACCA TGTTCATGAG AAGAAGGTAC CTTCCTGAGC TTAAATCGCG CAACTTCAAC
ATACGGTCTT TTGGAGAACG GGTTGCGATG AACACCCCGA TACAGGGAAG TGCCGCGGAT
ATAATCAAGA TTGCCATGGT AAAGGTGCAT GGAGAGCTTA AAAAAAGAAA GCTTAAATCC
AGGCTGATAC TTCAGGTTCA CGATGAACTT ATTGTAGAGA CGTTCAAGGA TGAAAAAGAA
GAGGTGGAAA AGATTTTACT TGAAGGCATG CAAAATGCCG TAAGTCTGAA AGTGCCGCTG
GTTGTGGAGA TTAAATCGGG CAGCAACTGG TATGAGACAA AGTAA
 
Protein sequence
MSKQKLMAID GNSILNRAFY GLPELLTTSD GIYTNGIYVF LNIMHKFIEE ENPEYICVAF 
DLKAPTFRHN KYEGYKANRK GMPEELRVQV PLLKEVLDAM NIKRLEMEGF EADDILGSVS
LCAEKKGLEV ILVTGDRDAF QLIGPSTRLK LPRTRGGKTE VEEYDYNKIV EVYGIKPEQF
VDVKALAGDT SDNIPGVPGI GEKTALALIK EYNNLENLYN SLDSIKKKGL REKLETFKEQ
AFLSRELALI ERNMPSLCDI EELKRVEIDR EKTYEIFKRL EFRSFIDKFG LNDVQIQNTV
ELNVKIAKNA SELESLKNNI LKSRKVCIYH LIDKTGSFSQ KLAAIAISPV EDEAWYLDFT
NNIDEDEFFR QFKDVLEDGN IKKYGHDLKN FIVYLNNRGI DFNGLAFDTM IGAYIINPSK
ETYTISELAQ EYLNLSVKAV EELAGKGKSF TLFKDMQPDV LSKTVGVYPH VISKVSRKID
SLLKENNQER LYYDIELPLV RTLADMEYYG FKVNVDALVE FSKELQEKID VVTKEIYTLA
GEEFNINSPK QLGVILFEKL GLPIIKKTKT GYSTDAEVLE ELSDRHEIVE KILEYRQLVK
LKSTYAEGLL AVINPYTGKI HSSFNQTVTA TGRISSTEPN LQNIPIKLEM GRKIRKVFIP
SDENYLLLDA DYSQIELRVL AHITNDENMI NAFLNNEDIH TSTAASVFGI PKEEVTPLMR
SRAKAVNFGI VYGIGDFSLA KDLKISRKEA RAYIDGYLDR YPNVKKYMHD IVEEGKEKGF
VTTMFMRRRY LPELKSRNFN IRSFGERVAM NTPIQGSAAD IIKIAMVKVH GELKKRKLKS
RLILQVHDEL IVETFKDEKE EVEKILLEGM QNAVSLKVPL VVEIKSGSNW YETK