Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0460 |
Symbol | |
ID | 4808388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 574145 |
End bp | 576253 |
Gene Length | 2109 bp |
Protein Length | 702 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640105874 |
Product | DNA topoisomerase I |
Protein accession | YP_001036891 |
Protein GI | 125972981 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA [COG0551] Zn-finger domain associated with topoisomerase type I |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.159815 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGACA AATTGATTAT TGTTGAGTCT CCCGCAAAGG CACATACTAT TGGGAAATTC TTGGGAAAAG ACTATAAGAT AGTTGCTTCT GTCGGACATG TGAGAGACCT TCCCAAAAGT CAGATGGGGG TCGATATAGA GAATGATTTT ACTCCCAAGT ACATTACAAT AAGAGGTAAG GGTGAAATAA TTTCAAAACT TAAAAAAGAA GCAAAGAACG CAAGCACTAT CTATCTTGCA ACCGACCCTG ACCGTGAGGG TGAGGCTATT TCGTGGCATT TGGCTACTTT GCTTAACATA GATAAAAATG AGAAATGCAG GATAACTTTT AATGAGATAA CCAAAAATGC TGTTAAAAAC GCCATAAAAT CACCCAGGGA AATCAACATG GACCTGGTTG ATGCCCAACA GGCAAGAAGG GTATTGGACA GGATTGTGGG ATACAAGATA AGTCCCCTGC TTTGGAAAAA AGTTAAAAAA GGATTGAGTG CAGGAAGGGT TCAGTCGGTT GCGACAAGGC TTATCTGCGA CAGGGAAGAA GAAATTGAAA AGTTTGTACC TGAGGAATAC TGGACCATAA CTGCAAAACT CTTAAAAGGT GGAGTAAAGG CTCCTTTTGA GGCCAAATTT TACGGTCTGG ACAATAAAAA GACTGAACTT AAAAGCGAGG AAGAAGTAAA TAAAGTTCTG GATGAGATAA AAGACGCCGT GTTTGTGGTT CAGAAAGTGA AAAAAGGGGA GAAAAAGAAA AATCCTAACG CACCGTTTAC CACCAGTACG ATGCAGCAGG AAGCTTCCAG GAAACTGGGA TTTTCCACAA AAAAGACAAT GATGGTGGCA CAGCAGCTCT ATGAAGGAAT TGAAGTAAAA GGTGTCGGGG CTGTGGGTCT TGTCACTTAT ATTCGTACCG ATTCCACGAG AATTTCCGAG GAGGCGCAAA ATCAAGCTGC AAAGTATATT AAAGAAAAGT TTGGTGAAAG TTATCTTCCC AAAGAAAAAA ATGTTTATAA AAACAAATCT GCCTCTCAGG ATGCCCATGA GTGTATAAGA CCGACATCGG TTGAAATGGA TCCTGAGTCT GTTAAGGATT CACTTACCAA GGAGCAGTAC CGTCTGTATA AGCTTATATG GGATAGGTTC GTGGCCAGTC AGATGGCGCC GGCCGTATAC GACACTATAA ATGCGGATAT TGAAGCCGGA AAATATCTTT TCAAGGCAAG CGGCTCCACT GTAAAGTTTC CTGGATTTAC GGTCCTGTAT CAGGAAGACA AGGACGATGA AACGGAAGAA GGAGAGGTCA TTGTTCCGGA GCTTGCGGAA GGGGAAAGCT TAAAGCTTAA AAAGCTTGAA CCCAGACAAC ATTTCACCCA GCCGCCGCCA AGGTATACGG AAGCAAGCCT GGTTAAGGCT TTGGAAGAAA AAGGCATAGG AAGACCGAGT ACTTACGCCC CCATCATTAC AACCATTTTG GCCCGGGGTT ATGTGGTGAA GGAAGGCAAG ACATTGGTTC CGACCGAGCT CGGGAAAATC GTTACGGATA TTATGAAAAA CTATTTTCAG GATATCGTGG ATGTAGAGTT TACGGCTCAA ATGGAGAAAA CGCTTGATGA AGTGGAAGAG GGCGAAAAAA GATGGGTAGA TGTCATGAGA AGCTTTTACT CCCAATTTGT TGATGTTCTG AAAAATGCGG AGGAAAAGAT AGGCAATATT GAAGTTCCCG AGGAAGTAAC CGATGAGATT TGTGAAAAGT GCGGAAGAAA CATGGTAATT AAAGTTGGAA AGAAAGGAAG ATTTTTGGCG TGTCCCGGTT TTCCGGAGTG CAGAAACGCC AAGCCTATTT TGGAGGATGC GGGTGTAACC TGTCCTAAAT GCGGCGGTAA AGTGTACATT AAAAAGACGC GGAAAGGCAG GAAATATCTT GGTTGTGAGA ATAACAACAG CGATCCCAAA TGCGATTTTA TGACTTGGGA TATGCCGTCA AAAGAAAACT GCCCGAAATG CGGAAGTTTT TTGCTTAAAA AGTATTCCGG CAGGAAGGTA CAGCTAAAAT GCAGCAATGA AAACTGTGAT TATGTAAAAA CGGGGAAAGA AAAAAAGGAA GATGAATAA
|
Protein sequence | MADKLIIVES PAKAHTIGKF LGKDYKIVAS VGHVRDLPKS QMGVDIENDF TPKYITIRGK GEIISKLKKE AKNASTIYLA TDPDREGEAI SWHLATLLNI DKNEKCRITF NEITKNAVKN AIKSPREINM DLVDAQQARR VLDRIVGYKI SPLLWKKVKK GLSAGRVQSV ATRLICDREE EIEKFVPEEY WTITAKLLKG GVKAPFEAKF YGLDNKKTEL KSEEEVNKVL DEIKDAVFVV QKVKKGEKKK NPNAPFTTST MQQEASRKLG FSTKKTMMVA QQLYEGIEVK GVGAVGLVTY IRTDSTRISE EAQNQAAKYI KEKFGESYLP KEKNVYKNKS ASQDAHECIR PTSVEMDPES VKDSLTKEQY RLYKLIWDRF VASQMAPAVY DTINADIEAG KYLFKASGST VKFPGFTVLY QEDKDDETEE GEVIVPELAE GESLKLKKLE PRQHFTQPPP RYTEASLVKA LEEKGIGRPS TYAPIITTIL ARGYVVKEGK TLVPTELGKI VTDIMKNYFQ DIVDVEFTAQ MEKTLDEVEE GEKRWVDVMR SFYSQFVDVL KNAEEKIGNI EVPEEVTDEI CEKCGRNMVI KVGKKGRFLA CPGFPECRNA KPILEDAGVT CPKCGGKVYI KKTRKGRKYL GCENNNSDPK CDFMTWDMPS KENCPKCGSF LLKKYSGRKV QLKCSNENCD YVKTGKEKKE DE
|
| |