Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2469 |
Symbol | |
ID | 7311135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2987222 |
End bp | 2989102 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643609398 |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_002506777 |
Protein GI | 220929868 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAACAC CGGAAGAAAA GTACAAAAAA AGAGTAGATG TTTATAAAAG AAAGCTTGAA TTATACACCG CAAAAAGCAG TAATACAGGT AATTATAAGC TATTGGTGTT TTTTACGGGC TTGTTAGGAG CAGCAGTTTT CTTTTTCTTA AAGCTATATA TACTAATGGC AGCAGTAATA CTGATTTTTG GGGGATTGTT TGTGTACCTT TCGACAATTC ACAATACACT TATCAAAAGT AAAAACTACT ATCAGGCGAT GCTTCAAATA AACCAAATGT GCTTGAAAAG GGCTATTGGT GAATGGAATG AATTCACAGA CAAAGGTGAA GAATTTCTTA ATCCTCAGCA TGACTACACT TATGATCTTG ATATATTTGG GAAAGGTTCT CTTTTCCAGA TGCTTAATAT GACAGCAAGT TATTCAGGTA GACACAAACT GGCTGAATTG TTTTTAAATC CTTTAAAGCA AAAGGAAGAA ATATACAATA GACAGGAAGC TTTACAGGAG TTGGCTAAAA AGCTTCTTTT CAGGCATCGG CTTTTTTCAA ACGGCTTAAT TCTTAATAAA AATACAATAT TGACTGATGG TACTGACAGT GATAAAAAAA GAAAAAAGAC TCTTTTAGAC ACCATGAATA AGCTGGATGA TGTATATTCG TGGGTAAAAA AGGAAAAAAG CCTGTACAGC TCCCCTAAAT TTAAGCTGTT CATATTTGCA ATGCCGGCTT TTTCATTCAT TATGCTTATA CTTGGGATCA TGGGCCTTGT ACCTGTTTAT GTTCCTGTAG CACTTTATAT TTTACAGTTT ATTTTAATCG GGTTCAGAGC TGAAAGCAGA AACAAGACCT TTGAATTGGT AGAAAAGTAC TCCGACACCC TAAAGGTGTA TAAAAGCTTG TTAAAGAAAT TTGAAACAGA GAAATTCTCA AGCGGTTACA TAAATAGTTT AAAAAATAAC TTAAAAGATG ATTACGGAAA TCCTGCATGG AGACAAATTG AAAAGCTATC AAAGATATGG GAGCTTATTG CAAACAGATA CAACCTGATG CATGCTATTA TAAATATTGC TACACTTTGG GACTTCCATT GCCTTGTAGC TCTTGAAAAC TGGAAAAAAG GCGGAGGTAA ATATGTTGAA AAGTGGTTTG ACATAATAGG AGAGGTTGAA GCTTTATGTA GCCTGTCTCT TATGTGCCAT GACAATCCTG AATACGTAAT GCCCCGCATT TGTGATGAAA ATAATCTACG GATAGAAGCT TTACACTTAG GACACCCGTT GCTGTCAAAG GGTAGAAAAT GCAATGATAT AAAAATCAAT TCAGCTGAAC CAATACTGCT TATAACAGGT TCCAATATGT CGGGCAAAAG TACATTTCTC AGGACGGTAG GTGTAAGCCT TGTATTAACT TATCTGGGCC TACCCGTTTG TGCAGAATCC TTTACCTGCC CGGTATTAAA GGTATATGCA TGTATGAGAA CCAGTGACAA TCTTGGACAA AGTGTTTCCT CATTCTATGC CGAATTGCTG AGAGTCAAAA TGATTGTCGA AGCCGTACAA AGGGGAGAAA AGGTGTTTTT CCTCCTGGAC GAGATTTTTA AGGGAACAAA CTCCGCGGAC AGACATACCG GTGCTAAAAT GCTGATAAAC CAATTAGATA AAAAAGGGGC CTGGGGCCTT GTTTCCACCC ACGACCTTGA ACTTGCCGAT ATGGAAAATG AGAGTAAAGG CCGCATAAGG AACTACCATT TTAAGGAGTA CTATAAAGAT GATCAGATAT TCTTTGATTA TCAGCTCAGA AAAGGTGTAT CAGATACGAA AAATGCAATT TATCTTATGA AAATGGCTGG CGTCAATGTG GAAGATATAA GTCCGCAGTA A
|
Protein sequence | MRTPEEKYKK RVDVYKRKLE LYTAKSSNTG NYKLLVFFTG LLGAAVFFFL KLYILMAAVI LIFGGLFVYL STIHNTLIKS KNYYQAMLQI NQMCLKRAIG EWNEFTDKGE EFLNPQHDYT YDLDIFGKGS LFQMLNMTAS YSGRHKLAEL FLNPLKQKEE IYNRQEALQE LAKKLLFRHR LFSNGLILNK NTILTDGTDS DKKRKKTLLD TMNKLDDVYS WVKKEKSLYS SPKFKLFIFA MPAFSFIMLI LGIMGLVPVY VPVALYILQF ILIGFRAESR NKTFELVEKY SDTLKVYKSL LKKFETEKFS SGYINSLKNN LKDDYGNPAW RQIEKLSKIW ELIANRYNLM HAIINIATLW DFHCLVALEN WKKGGGKYVE KWFDIIGEVE ALCSLSLMCH DNPEYVMPRI CDENNLRIEA LHLGHPLLSK GRKCNDIKIN SAEPILLITG SNMSGKSTFL RTVGVSLVLT YLGLPVCAES FTCPVLKVYA CMRTSDNLGQ SVSSFYAELL RVKMIVEAVQ RGEKVFFLLD EIFKGTNSAD RHTGAKMLIN QLDKKGAWGL VSTHDLELAD MENESKGRIR NYHFKEYYKD DQIFFDYQLR KGVSDTKNAI YLMKMAGVNV EDISPQ
|
| |