Gene Cthe_0460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0460 
Symbol 
ID4808388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp574145 
End bp576253 
Gene Length2109 bp 
Protein Length702 aa 
Translation table11 
GC content42% 
IMG OID640105874 
ProductDNA topoisomerase I 
Protein accessionYP_001036891 
Protein GI125972981 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.159815 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGACA AATTGATTAT TGTTGAGTCT CCCGCAAAGG CACATACTAT TGGGAAATTC 
TTGGGAAAAG ACTATAAGAT AGTTGCTTCT GTCGGACATG TGAGAGACCT TCCCAAAAGT
CAGATGGGGG TCGATATAGA GAATGATTTT ACTCCCAAGT ACATTACAAT AAGAGGTAAG
GGTGAAATAA TTTCAAAACT TAAAAAAGAA GCAAAGAACG CAAGCACTAT CTATCTTGCA
ACCGACCCTG ACCGTGAGGG TGAGGCTATT TCGTGGCATT TGGCTACTTT GCTTAACATA
GATAAAAATG AGAAATGCAG GATAACTTTT AATGAGATAA CCAAAAATGC TGTTAAAAAC
GCCATAAAAT CACCCAGGGA AATCAACATG GACCTGGTTG ATGCCCAACA GGCAAGAAGG
GTATTGGACA GGATTGTGGG ATACAAGATA AGTCCCCTGC TTTGGAAAAA AGTTAAAAAA
GGATTGAGTG CAGGAAGGGT TCAGTCGGTT GCGACAAGGC TTATCTGCGA CAGGGAAGAA
GAAATTGAAA AGTTTGTACC TGAGGAATAC TGGACCATAA CTGCAAAACT CTTAAAAGGT
GGAGTAAAGG CTCCTTTTGA GGCCAAATTT TACGGTCTGG ACAATAAAAA GACTGAACTT
AAAAGCGAGG AAGAAGTAAA TAAAGTTCTG GATGAGATAA AAGACGCCGT GTTTGTGGTT
CAGAAAGTGA AAAAAGGGGA GAAAAAGAAA AATCCTAACG CACCGTTTAC CACCAGTACG
ATGCAGCAGG AAGCTTCCAG GAAACTGGGA TTTTCCACAA AAAAGACAAT GATGGTGGCA
CAGCAGCTCT ATGAAGGAAT TGAAGTAAAA GGTGTCGGGG CTGTGGGTCT TGTCACTTAT
ATTCGTACCG ATTCCACGAG AATTTCCGAG GAGGCGCAAA ATCAAGCTGC AAAGTATATT
AAAGAAAAGT TTGGTGAAAG TTATCTTCCC AAAGAAAAAA ATGTTTATAA AAACAAATCT
GCCTCTCAGG ATGCCCATGA GTGTATAAGA CCGACATCGG TTGAAATGGA TCCTGAGTCT
GTTAAGGATT CACTTACCAA GGAGCAGTAC CGTCTGTATA AGCTTATATG GGATAGGTTC
GTGGCCAGTC AGATGGCGCC GGCCGTATAC GACACTATAA ATGCGGATAT TGAAGCCGGA
AAATATCTTT TCAAGGCAAG CGGCTCCACT GTAAAGTTTC CTGGATTTAC GGTCCTGTAT
CAGGAAGACA AGGACGATGA AACGGAAGAA GGAGAGGTCA TTGTTCCGGA GCTTGCGGAA
GGGGAAAGCT TAAAGCTTAA AAAGCTTGAA CCCAGACAAC ATTTCACCCA GCCGCCGCCA
AGGTATACGG AAGCAAGCCT GGTTAAGGCT TTGGAAGAAA AAGGCATAGG AAGACCGAGT
ACTTACGCCC CCATCATTAC AACCATTTTG GCCCGGGGTT ATGTGGTGAA GGAAGGCAAG
ACATTGGTTC CGACCGAGCT CGGGAAAATC GTTACGGATA TTATGAAAAA CTATTTTCAG
GATATCGTGG ATGTAGAGTT TACGGCTCAA ATGGAGAAAA CGCTTGATGA AGTGGAAGAG
GGCGAAAAAA GATGGGTAGA TGTCATGAGA AGCTTTTACT CCCAATTTGT TGATGTTCTG
AAAAATGCGG AGGAAAAGAT AGGCAATATT GAAGTTCCCG AGGAAGTAAC CGATGAGATT
TGTGAAAAGT GCGGAAGAAA CATGGTAATT AAAGTTGGAA AGAAAGGAAG ATTTTTGGCG
TGTCCCGGTT TTCCGGAGTG CAGAAACGCC AAGCCTATTT TGGAGGATGC GGGTGTAACC
TGTCCTAAAT GCGGCGGTAA AGTGTACATT AAAAAGACGC GGAAAGGCAG GAAATATCTT
GGTTGTGAGA ATAACAACAG CGATCCCAAA TGCGATTTTA TGACTTGGGA TATGCCGTCA
AAAGAAAACT GCCCGAAATG CGGAAGTTTT TTGCTTAAAA AGTATTCCGG CAGGAAGGTA
CAGCTAAAAT GCAGCAATGA AAACTGTGAT TATGTAAAAA CGGGGAAAGA AAAAAAGGAA
GATGAATAA
 
Protein sequence
MADKLIIVES PAKAHTIGKF LGKDYKIVAS VGHVRDLPKS QMGVDIENDF TPKYITIRGK 
GEIISKLKKE AKNASTIYLA TDPDREGEAI SWHLATLLNI DKNEKCRITF NEITKNAVKN
AIKSPREINM DLVDAQQARR VLDRIVGYKI SPLLWKKVKK GLSAGRVQSV ATRLICDREE
EIEKFVPEEY WTITAKLLKG GVKAPFEAKF YGLDNKKTEL KSEEEVNKVL DEIKDAVFVV
QKVKKGEKKK NPNAPFTTST MQQEASRKLG FSTKKTMMVA QQLYEGIEVK GVGAVGLVTY
IRTDSTRISE EAQNQAAKYI KEKFGESYLP KEKNVYKNKS ASQDAHECIR PTSVEMDPES
VKDSLTKEQY RLYKLIWDRF VASQMAPAVY DTINADIEAG KYLFKASGST VKFPGFTVLY
QEDKDDETEE GEVIVPELAE GESLKLKKLE PRQHFTQPPP RYTEASLVKA LEEKGIGRPS
TYAPIITTIL ARGYVVKEGK TLVPTELGKI VTDIMKNYFQ DIVDVEFTAQ MEKTLDEVEE
GEKRWVDVMR SFYSQFVDVL KNAEEKIGNI EVPEEVTDEI CEKCGRNMVI KVGKKGRFLA
CPGFPECRNA KPILEDAGVT CPKCGGKVYI KKTRKGRKYL GCENNNSDPK CDFMTWDMPS
KENCPKCGSF LLKKYSGRKV QLKCSNENCD YVKTGKEKKE DE