Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1470 |
Symbol | |
ID | 7310239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1782650 |
End bp | 1785028 |
Gene Length | 2379 bp |
Protein Length | 792 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643608396 |
Product | MutS2 family protein |
Protein accession | YP_002505804 |
Protein GI | 220928895 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.471316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGAAA AAACTCACAG AGTACTTGAA TTTGATAAAA TTCTCGATAA ATTAAAAGGC TTAACCGCAT CTGAATTGGG GAGGGAGCTT GTTTTAGAGC TAACTCCTCA AACTGATTAC AGAGTGGTTG AAAAAATGCT GTCAGAAACA AATGACGGTG TAAGTTGTAT AATGAGAAGG GGTTCACCAC CTCTTGGAGG AATTACAGAT ATCAGGATGA GCCTAAAAAG GCTTGATATG GGAGGCGTAC TGAATCCCGG AGAATTACTG CGGCTGGCAG GAGTTTTAAG AGCCGCCAGA AGGTTAAAGG GCTATATCAA TGATAAACTG GACGAAAATA ATGCCAGTGT GGTTAAGGAA CTTATATCCT GTCTTGAATC AAATCAGAGG TTGGAACAGA AAATAGATAA CTGCATACTG AGCGAAGATG AGATTGCTGA TAATGCAAGT CCTGCACTAA GCAGCATCAG ACGTCAGATT AAGGAGCAGC AGGCGTCTAT TAAGGATAAG CTGAATTCTA TTATTCGTTC CACAAAATAT CAGAAATATA TTCAGGAATC AGTTGTAACC ATGAGGGGTG ACAGGTATGT AATCCCGGTA AAGCAGGAGC ATAAGGGAGA TATACCCGGA TTGGTTCATG ATTCTTCTGC CAGCGGAGCC ACCTTGTTTA TTGAGCCAAT GGCAGTTGTT GAAGCAAACA ACAGCATAAA ACAGCTCAGA GTGAAGGAAC AAACAGAGAT AGACAGAATT CTTGCCGAGC TTTCTCAGGA TGCTTCTCTA GTATTACCCC AATTGAATGC TAACATGAGT ATAATGGCAA GACTTGATTT TATTTTTGCA AAGTCAAAAC TGGCAATAGA CTATAACTGT ATATGTCCTA AAATTAATGA TACAGGCAAA ATAATTATTA AAAAGGGGAG GCATCCTCTT CTTGACCCTA AAATTGTAGT TCCTATTGAT TTCTGGATTG GTGAAAAATT CAGTTCATTG ATTGTTACAG GACCCAATAC AGGGGGTAAG ACCGTTTCAC TTAAAACAGT AGGACTATTT ACTCTTATGA TGCAGTCAGG GCTTCTGGTT CCCGCAAATG ACGGAACAGA GATGAGCGTA TTTGAAAAAA TCTATGCAGA CATTGGCGAC GAGCAGAGTA TTGAACAAAG TCTTAGTACG TTTTCTTCTC ATATGAAGAA TATTGTGGAC ATTCTAAGTG GTGTCAACAA TAAGTCACTG ATACTTCTGG ATGAATTGGG AGCGGGGACA GACCCTACCG AAGGGGCGGC TCTTGCTATG TCCATTCTTG AATGTCTGCA CCAGATGGGA GCTACTACCC TTGCTACAAC CCACTATAGT GAACTCAAGG TTTATGCTAT TTCAACAACG GGAGTTGAAA ATGCCTCCTG TGAATTTGAT GTTGAAACCC TTAGGCCTAC CTACCGACTG CTTATAGGCG TACCGGGAAA GAGTAATGCA TTTGCCATTT CTAAAAGACT TGGCCTGACA GATGACATAA TAGAAAGGTC TAAGGAATTC TTGTCACAAG AGGACATCAG GTTTGAGGAC ATATTATTAA GTATTGAAAA GAACCGTAGT GAGGCCGAAA AAGAGAAAAT GCGTGCTGAA AGCTATCGTC AGGAAGCGGA GAGGCTTAAA AAGGACCTGG AAGATCAGAA GCGAAGATTG GCCGCTCAGA AGGAAAGCGA GCTTCGCAAG GCCCGTGAGG AGGCACGCCG TATTCTTACC GATTCAAAGC GTCAGGCCGA TGAACTTGTT TCTGAAATGA AAAGACTGGC AAAGGAACAG GAGGAAGCCG AAGTTCGAAG GCAAACAGAA GAGCTCCGTC AAAAACTTAA TAAGAGTATA AATAATCTGG ATGATTCGCT GGTTGAATCA ATTATGCCCA GACAAGGACT GGTTAAACCA CCCAAAAACC TCAAGCCGGG TGATACTGTA TTGATAGTAA ACCTCAACCA GAAGGGAACG GTTTTAACAC TCCCGGACAA GAATGGTGAA GCACAGGTTC AGGCAGGAAT TATGAAAATA AATGTGCATA TTTCAAACCT CAAACTGGTA GATGAGCAGA AACAGCAGAT TCAAAGAACC GGAATGGGAA AAATAGGTGT TTCTAAAGCA CAGAATATGT CAACTGAAAT TGATCTCCGT GGAATGATGC TCAGCGAAGC TGTTGACGTT GTCGACAAGT ATCTTGACGA TGCAAGTATA GCAGGTATGG GGGGAGTAAC ACTTATTCAC GGAAAAGGAA CAGGAGCACT GCGGGCTGGG TTGCATCAGC ATTTGAAGCA TAATCCTCAT ATAAAAAGCT TCAGACTTGG TAAGCTGGGT GAGGGCGAAA ACGGGGTTAC AGTTGTTGAG CTTAAATAG
|
Protein sequence | MNEKTHRVLE FDKILDKLKG LTASELGREL VLELTPQTDY RVVEKMLSET NDGVSCIMRR GSPPLGGITD IRMSLKRLDM GGVLNPGELL RLAGVLRAAR RLKGYINDKL DENNASVVKE LISCLESNQR LEQKIDNCIL SEDEIADNAS PALSSIRRQI KEQQASIKDK LNSIIRSTKY QKYIQESVVT MRGDRYVIPV KQEHKGDIPG LVHDSSASGA TLFIEPMAVV EANNSIKQLR VKEQTEIDRI LAELSQDASL VLPQLNANMS IMARLDFIFA KSKLAIDYNC ICPKINDTGK IIIKKGRHPL LDPKIVVPID FWIGEKFSSL IVTGPNTGGK TVSLKTVGLF TLMMQSGLLV PANDGTEMSV FEKIYADIGD EQSIEQSLST FSSHMKNIVD ILSGVNNKSL ILLDELGAGT DPTEGAALAM SILECLHQMG ATTLATTHYS ELKVYAISTT GVENASCEFD VETLRPTYRL LIGVPGKSNA FAISKRLGLT DDIIERSKEF LSQEDIRFED ILLSIEKNRS EAEKEKMRAE SYRQEAERLK KDLEDQKRRL AAQKESELRK AREEARRILT DSKRQADELV SEMKRLAKEQ EEAEVRRQTE ELRQKLNKSI NNLDDSLVES IMPRQGLVKP PKNLKPGDTV LIVNLNQKGT VLTLPDKNGE AQVQAGIMKI NVHISNLKLV DEQKQQIQRT GMGKIGVSKA QNMSTEIDLR GMMLSEAVDV VDKYLDDASI AGMGGVTLIH GKGTGALRAG LHQHLKHNPH IKSFRLGKLG EGENGVTVVE LK
|
| |