Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1433 |
Symbol | |
ID | 7310204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 1739138 |
End bp | 1741084 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643608357 |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_002505767 |
Protein GI | 220928858 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00229473 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAT ATGCCGTAAC ACTTGAATTT AACAAAATAA TTGAAATGCT TATGGAAAAT ACCGTTTCTC AAAGAGCTAA AGAGAATCTT TCGATGCTTC AACCTTTTCT GAAGGAGGGA GAATGCAGAA GAAGAATGAA GGAAACTACC AACGCAAAAA GAATTCTTGA AAGTCTGGGG ACTCCTCCTC TCTCTTCAAT GAACGAGCTA GACAGAATTC TTGATCTTTG TTCAAAGGGT GCAATGCTTG TTCCTGAACA ATTAGAAGAG GTTTCACAGT TTCTTCTTGC CTGCAGACGT ATGAAATCAT ATCTCAAAAG GACCGAGAGC CTTGAGAATG ATATTGCTTA TTATGGAAGC TCAATAGATG CACTTAATGA CCTCTGTGAA GAAATTGAAC ACTCAATCAG AAATGGACGG GTTGATGATT CTGCTTCACC AAAGCTGAAA GATATACGAA GAAGGAAGGA TAATTCCCAC CAACAAGTCC GTACCAAGCT TGAAAGTATA TTAAGGAGTA AAAAAGAATG GTTTGCGGAC AGTTTTGTTT CAATGAGAAA TGGTCATTTC GTACTGCCCG TAAGAAAAGA ATATAAGAAC ATGGTAGTCG GCTCGCTGAT AGAAACCTCC GCCACAGGAG GAACTTATTT TATTGAGCCT TCCGCTGTAC GCAGACTTCA AGAGGAAATA TCCGCCCTTA CAGTGGAAGA AGAAAATGAA GAAAGAAAAA TCTTATATAC TCTGACCTCA TTAGTGGATG AACATTCAGC TGTATTTAAA ACTAACGTAG ATATAATGGA AACTCTTGAT GCAGTTTTTG CTAAGGCTAA GCTTTCAGTT CAGATGAAGG CCTGTGAAGC TGATATTCAT ATGGACCGCA GGATTGTTAT CAAAGCCGGA CGCCACCCTC TTCTTAATCA ATCGGAATGT GTTCCTCTTG ACTTTGAAAT CGGTAGTGGT ATAAGAGGAG TTGTTATAAC TGGGCCAAAT ACAGGAGGAA AAACAGTGGC CCTTAAAACT GTAGGGTTGC TTTCAATCAT GGCACAAAGC GGACTTCATG TTCCCGCCGC CATGGGAAGT GAATTTTCAA TGCATAATAT GATTTTATGT GATATCGGAG ACGGACAGAG TATTACCGAG AACTTGTCAA CTTTTTCGGC ACACATAACA ACCATAAATG ATATTCTAAG GCAATCCACG GGAGAAAGCC TTGTCCTGCT TGACGAAGTA GGCTCTGGAA CCGACCCGGC AGAAGGTATG GGTATAGCTA CCGCAATTCT GGAAGATTTG AAAAACAAGG GCTGCCTGTT TGTAGCAACT ACTCATTATC CTGAAATAAA GGACTATGCC AAAAAAACAC CCGGTCTTGT AAATGCCAGA ATGGCTTTTG ACAAGGAAAG TTTGAAACCT TTGTATAAAC TTGAAATTGG TGAGGCCGGG GAAAGCTGTG CCCTGTATAT TGCAAAAAGG CTTGGCTTTC CCCCACATCT TCTCAGGCTT GCTCATGATG CCGCCTATAA AGAGTACCCT ACTAAGGTTA ATTCAGAGAA TAATGAATCC TTGTTAAGCC CTATTGGAGC GGACAGCCAG TCCGGAATTC CACCTGACTT AAATGCACCT GTTCTTATAA AGGAAAACCC AAAGAAAAAT GTACCACAAA ACAGAAGCAG CAGATTTAAC ATTGGAGATA GTGTTATGGT CTTTCCTCAG AAAGAAATAG GGCTGGTTTA TCAAAAAGCC AACGAACATG GTGAAATAGG TGTTCAGATT AAAGGACAAA AAAAACTTCT TTCCCACAAA AGAATAAAAC TACACGTACC AGCCAGCGAG TTATACCCAG CTGACTACGA CTTTTCAATA ATATTTGATA CTGTGGACAA TCGCAAAGCA AGACACAAGA TGGGGAAAAG ACATAATCCT GAGTTGTTTA TAGAGTATGA AAATTAG
|
Protein sequence | MNKYAVTLEF NKIIEMLMEN TVSQRAKENL SMLQPFLKEG ECRRRMKETT NAKRILESLG TPPLSSMNEL DRILDLCSKG AMLVPEQLEE VSQFLLACRR MKSYLKRTES LENDIAYYGS SIDALNDLCE EIEHSIRNGR VDDSASPKLK DIRRRKDNSH QQVRTKLESI LRSKKEWFAD SFVSMRNGHF VLPVRKEYKN MVVGSLIETS ATGGTYFIEP SAVRRLQEEI SALTVEEENE ERKILYTLTS LVDEHSAVFK TNVDIMETLD AVFAKAKLSV QMKACEADIH MDRRIVIKAG RHPLLNQSEC VPLDFEIGSG IRGVVITGPN TGGKTVALKT VGLLSIMAQS GLHVPAAMGS EFSMHNMILC DIGDGQSITE NLSTFSAHIT TINDILRQST GESLVLLDEV GSGTDPAEGM GIATAILEDL KNKGCLFVAT THYPEIKDYA KKTPGLVNAR MAFDKESLKP LYKLEIGEAG ESCALYIAKR LGFPPHLLRL AHDAAYKEYP TKVNSENNES LLSPIGADSQ SGIPPDLNAP VLIKENPKKN VPQNRSSRFN IGDSVMVFPQ KEIGLVYQKA NEHGEIGVQI KGQKKLLSHK RIKLHVPASE LYPADYDFSI IFDTVDNRKA RHKMGKRHNP ELFIEYEN
|
| |