Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0927 |
Symbol | |
ID | 4811220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1112854 |
End bp | 1116426 |
Gene Length | 3573 bp |
Protein Length | 1190 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106346 |
Product | condensin subunit Smc |
Protein accession | YP_001037354 |
Protein GI | 125973444 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG1196] Chromosome segregation ATPases |
TIGRFAM ID | [TIGR02168] chromosome segregation protein SMC, common bacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.177469 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCATCTGA AGAGACTTGA AATTCAAGGC TTTAAATCCT TTGCCGACAG GATACAGCTT GAATTTAATT CCGGCATTAC TGCTGTGGTG GGACCGAACG GAAGCGGTAA AAGCAACATT TCCGATGCGA TAAGGTGGGT ACTGGGAGAA CAAAGTGCCA AAACCTTAAG AGGCGGAAAG ATGGAAGATG TAATATTTGC CGGTACCGAA CACAGAAAGC CCATGGGATT TGCGGAAGTT TCCCTTACTT TTGACAATTC TGACGGCGTG CTTCCCATTG ATTTTAGTGA GGTTACGGTA ACCAGGAGGG TGTATCGCTC CGGGGAAAGT GAGTATATGA TAAACAAAAC CCCGTGCCGT CTTAAGGATA TATATGAGTT GTTCCTTGAT ACGGGAATTG GCAAAGACGG TTATTCCATA ATCGGACAAG GAAGAGTTGA TGAAATCTTA AGCTCAAAGT CCGAAGACAG AAGGGCTATT TTTGAAGAAG CCTCCGGGAT TATGAAGTAT AAGGTACGAA AGCAGGAAGC GGAAAAAAAG CTTGAAATGA CAAGGCAGAA TCTGCTTAGA ATAAATGATA TTATTGCTGA GCTTGAAAAC CAGCTGGAGC CTTTAAGAGA GCAGTCCGAG GTGGCAAAAA GGTACCTTGG CTTGAGAGAG ACTCTAAAAG TACTGGAAGT AAATGTATAT ATAGAAAATA TCGCCAGATA CAAGGAAAAG ATAAAGGAAT TGGAAGAAAA CTATGCGTCG GTAAAAGATA ACATAGACAG TGAAAATAAG AGGCTGGAGG AAATTACAAG CCTCAATCAG ACGAACTTGA GCATTTTAAA AGATATGGAA GGAAGGCTTG AAGCTGCAAA ACAAGAGTAT TATGCAATTG ACGGTAATCT TGAAAAATCC AATTCCGAAA TCAGGCTAAA CCAGGAAAAA ATCAACAATC TGTTTTCAAA TATTGAACGC CTTGACGGGG AAATAGCCGA GATTGATGAG AAAATCAAAA CTATTTTGGA GGAAGAAGCT TCAAAGAATA GTAAAATCGT CTATCTTCAG GAGAGATACA ATGAGTATTC GGCAAAGCTT GAAGAGGCTG AAAAAAAGCT TCAGGCCATT ATTGCCACAC TGAATGAGAA TGAGAGACAT ATTGAAAACT TAAAGACAGA GATAATGGAA ATGCTTGACA TTCAGTCGGA CAAGAAGACG CAGATCAACA ATATAAAAAA CCACATCGAA GGCATCAAAA AGAGACAGGC CAATATTGAC AAAGAAGTAT ATCAGCTGAC TCTTGAAAAA GACAAGGAGT GCATGAAAAA AGAGGAGCTG TCCGAGAGTA TCTACAAGAC AAACGAACTT ATAAAAAATA TAAAAGACTT GCTTCAGGAA TTAACTGAAA AAAGAAAAGA CCTTGGTATA AAGCTCGAAG AGGAAAAGAA AAAGCAGAAT AACGTCAGGT CTCAAATACA GATAATGACT TCGAGACAAA AGATGTTAAT AGACATGGAG CGAAACCTGG AAGGCTATAA CCGGACTGTA AGGGTTATTT TACAGGCGTG CCGTGAGTCT CATGAGTTTG GAAAGGGCAT TCATGGTGCG CTGGCTCAAC TGTTCACAGT GGACAAAAGG TATGAGACAG CGATAGAAAT GGCGCTGGGT GGAGCTTTGC AGAATATTGT CACCACCAGT GAGGAGGATG CCAAGAGGGT AATAGAATAT TTAAAGAAAA ACAATTTGGG AAGGGCAACT TTTCTTCCCA TATCCTCTGT AAAAGGAAAA TATCTTGACG ACAGCATTTT AAACCAGTTA AAAGATCATG AAGGCTTTGT CGGGGTTGCA TCCGACCTTA TTGAATATGA CGAGCAGTAC AGGGGAATAA TTTTGAGCCT TCTGGGCAAG GTAGTGGTGG TAGAGAGCCT GGATGCCGGA ATAAGGATGG CAAGAAAGTT TGGATACGGC TTTAGAATAG TATCCCTGGA CGGAGACATA TTAAGCACCA CCGGTTCCAT ATCCGGAGGA AGCAAGGAAA AAAGAGAGTC GGGAATTTTG AGCAGAAATC GGGAAATCTC CGAACTTGGG GAAAGCATTG CAAGGCTTAA GGAAGACGAT GAAGCAATAG AAAAAAATGT TGAGGGATTA ATCAGAGAGC TTGAAGAGAT TACGGACAAA ATATCCTTTG AGGAAAGAAG CCTTAAAGAC AATGAACTTG TAAAAATAAG AGACGAGAGC CATTTGGCCC AGATTGAGGA AAATATCAAA AGAAGCCTTG CAAGAATTGA TATGCTAAAG CAGGAGAAAG AGCAGCTCAT AAGACAGGAA AAGGATACTT GTCTGGAGCT TTCAAAATAC GAAGATGAAC TGTCTGAAAT AGAAAGGGAT ATTGCAGAGA AAAAAGAGGT TGTGGCAAGG TACCAGGAAA AGAACAAAGA AGAGCAGTCG GTAAGAGATG CATTGCACAA CGATATTACC GACTACAGGA TTTCTGTAAA CTCCATTCTG GAGAGCATGG AAGGCGTAAA GGAAACATTG GAGAGACTGG TAAATGAGAA AAACAGTCTT GTAAAGGCAA TGGAAAGAAA AAAAGCCGAG AAAGCCAGAA ACGAGCAGGA GATTAAAGCT TTGCAGGAAA AAAATGAAGG TCTTGATAAA CTAATTAAAA AGTATGAGGA GGAGAAATCT GGAAAGACTT TTGAAATAGA CAGGATTACG GAGGAGAAAA AAATCCGGGA GGAAGAGTCT GCGGGTATTA TAGATCAGAT AACCGAAATA AACAAAAATA TATTGCTGCT TCAGGAGGAA TACAGTAGAA TTGAGGTTAA GAAGGCAAAG CTCGAGTCTG AGATGGAGTC TATTCAAAAC AGAATGTGGG ATGAGTATGA GTTGACTTAT ACCAACGCAC TTGAGCTTAA AAAAGATATT GGAAGCATGG CGCAGGCGCA AAAAAGGATT GCCGAAATAA GAAATGAGAT AAAGGAACTG GGGCCTGTCA ATGTGGCCGC CATTGACGAA TATATAAAGA CAAAAGAGCG TTTTGAGTTT ATGTCGGCAC AGAAAAGCGA TATGGAACAG GCTGAGAAAA AGCTTCAGAA AGTAATAAAT GAAATGATGA CCATAATGAA GCGCCAGTTT ATGGAGAAAT TTAAACTTAT AAATGAAAAT TTTAACTTGG TGTTCAGAGA GTTGTTTGAC GGCGGAAGGG CGGAATTGAT TCTGGTAGAC AAGGAGAATG TGCTTGAAAG CGGCATTGAG ATAGAAGTAC AACCTCCGGG GAAGAAACTT CAGAATCTTA TGCTTCTTTC GGGAGGAGAA AGAGCGTTTA CTGCAATTGC GCTGCTTTTT GCCATTCTAA GGCTTAATCC GACGCCCTTC TGTGTGCTGG ATGAGATTGA AGCAGCTTTG GATGATGCAA ATGTGTACAA GTTTGCACAA TATTTAAAGA AATATTCCAA TGTAACTCAG TTTGCGGTTA TTACTCACAG AAAGGGCACC ATGGAAGCTG CCGATACGCT GTATGGTGTA ACAATGCAGG AGCATGGTGT TTCAAAAGTG GTTTCCCTTA AAATGGGGGA AAAAGTGGGT TAA
|
Protein sequence | MHLKRLEIQG FKSFADRIQL EFNSGITAVV GPNGSGKSNI SDAIRWVLGE QSAKTLRGGK MEDVIFAGTE HRKPMGFAEV SLTFDNSDGV LPIDFSEVTV TRRVYRSGES EYMINKTPCR LKDIYELFLD TGIGKDGYSI IGQGRVDEIL SSKSEDRRAI FEEASGIMKY KVRKQEAEKK LEMTRQNLLR INDIIAELEN QLEPLREQSE VAKRYLGLRE TLKVLEVNVY IENIARYKEK IKELEENYAS VKDNIDSENK RLEEITSLNQ TNLSILKDME GRLEAAKQEY YAIDGNLEKS NSEIRLNQEK INNLFSNIER LDGEIAEIDE KIKTILEEEA SKNSKIVYLQ ERYNEYSAKL EEAEKKLQAI IATLNENERH IENLKTEIME MLDIQSDKKT QINNIKNHIE GIKKRQANID KEVYQLTLEK DKECMKKEEL SESIYKTNEL IKNIKDLLQE LTEKRKDLGI KLEEEKKKQN NVRSQIQIMT SRQKMLIDME RNLEGYNRTV RVILQACRES HEFGKGIHGA LAQLFTVDKR YETAIEMALG GALQNIVTTS EEDAKRVIEY LKKNNLGRAT FLPISSVKGK YLDDSILNQL KDHEGFVGVA SDLIEYDEQY RGIILSLLGK VVVVESLDAG IRMARKFGYG FRIVSLDGDI LSTTGSISGG SKEKRESGIL SRNREISELG ESIARLKEDD EAIEKNVEGL IRELEEITDK ISFEERSLKD NELVKIRDES HLAQIEENIK RSLARIDMLK QEKEQLIRQE KDTCLELSKY EDELSEIERD IAEKKEVVAR YQEKNKEEQS VRDALHNDIT DYRISVNSIL ESMEGVKETL ERLVNEKNSL VKAMERKKAE KARNEQEIKA LQEKNEGLDK LIKKYEEEKS GKTFEIDRIT EEKKIREEES AGIIDQITEI NKNILLLQEE YSRIEVKKAK LESEMESIQN RMWDEYELTY TNALELKKDI GSMAQAQKRI AEIRNEIKEL GPVNVAAIDE YIKTKERFEF MSAQKSDMEQ AEKKLQKVIN EMMTIMKRQF MEKFKLINEN FNLVFRELFD GGRAELILVD KENVLESGIE IEVQPPGKKL QNLMLLSGGE RAFTAIALLF AILRLNPTPF CVLDEIEAAL DDANVYKFAQ YLKKYSNVTQ FAVITHRKGT MEAADTLYGV TMQEHGVSKV VSLKMGEKVG
|
| |