Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1014 |
Symbol | |
ID | 4811308 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1212794 |
End bp | 1215175 |
Gene Length | 2382 bp |
Protein Length | 793 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106432 |
Product | MutS2 family protein |
Protein accession | YP_001037439 |
Protein GI | 125973529 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0625051 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGAAA AAACACTTAA AATATTGGAG TTTAATAAAA TAATTGACAA ACTGGTAAGC CTTGCAACAT CTTCTTTGGG AAAAGAACTG GCGGAAAAGC TGGTGCCAGA TACTGATCTT AACAGGGTTG AAAGGGCACA AAAGGAAACC AGTGACGCCG TCGCTTTTAT TGCAAGAAGG GGAACCCCGC CCATGGGGGG AATTCATGAT ATACGGGACA GTTTAAAAAG AGTTGAAATC GGAGCGATTC TAAACCCTGG AGAGCTTTTA AAAACTGCCG ACGTTTTAAG GGCTGTAAGA AATCTTAAAA GCTATGCGAG CAATGACAGA ATCAAGACCG ATGAAGACAA TATTGTAAGT GAGCTTATAG GATGCCTTGA ATCCAATAAG CGGATTGAAG ACAGGATTTA CATGTCAATA CTAAGTGAGG ATGAAATAGC TGACAATGCA AGCCCGACCC TTGCCAACAT AAGAAGGCAG ATACGAAATG CCCAGGAATC AATAAAAGAT AAGCTCAATG ATATCATAAG GTCGTCAAGA TATCAGAAAT ATATACAGGA GCCCATAGTT ACTTTAAGAG GAGACAGATA TGTAATACCT GTCAAGCAGG AGTACAGAAC CGAAATACCC GGACTTATAC ATGATTCATC CGCCAGCGGG GCGACCATTT TTATTGAGCC TATGGCGGTT GTGGAGGCCA ACAACCACAT ACGGGAGCTT AAAATTAAAG AGCAGGCTGA AATTGAGAAA ATACTGGGGG AATTGACCGG GGAGATAAGA GGAATTGTTG ATTCCTTAAA GTCGAATGTT TCAATTTTAG GCCGTTTGGA TTTCATATTT GCCAAGGCAA GGCTCAGCCT TGATTATAAC TGTGTTTGCC CTGTACTTAA CGATGAACAT AAAATATTAA TAAAAAAGGG AAGACATCCT CTTTTAGACA AAAAAACCGT TGTTCCCATC GATTTTTGGA TCGGGGAAGA CTTCAACACC CTTGTAGTGA CCGGACCCAA TACCGGAGGT AAAACGGTTA CTTTGAAAAC TGTGGGCCTG TTTACCCTTA TGACGCAGGC AGGGCTTCAT ATTCCGGCAA ATGAGGGAAC CAAAATGAGT ATTTTCAAAA AAGTCTATGC CGACATAGGG GATGAGCAGA GTATCGAACA GAGCCTTAGT ACTTTTTCTT CGCATATGAA GAATATAGTT GGAATATTAA AGGATGTGGA TGAAGATTCC CTTGTTCTGT TTGATGAGCT TGGAGCGGGA ACAGACCCTA CCGAGGGTGC CGCCCTTGCA ATGTCAATAC TTGAGTATTT AAGAAACAAG GGCAGTACAA CGGTTGCCAC CACCCATTAC AGCCAGCTGA AAGCGTATGC CGTTACCACA AAATTTGTGG AAAATGCCTG CTGCGAGTTT AATGTGGAGA CACTAAGGCC CACTTACAGG CTATTGATTG GAGTTCCCGG AAAAAGCAAC GCCTTTGCAA TATCAAAAAG GCTGGGGCTT TTTGATGACA TTATTGAGAA GGCCAAAGAA TTTTTAACCC AGGACGATAT AAAGTTTGAA GACATGCTTA TGTCGATTGA GAAAAACTTA AATCAGTCCG AAAATGAAAA AATGAAAGCT GAAAGCTATC GACTCGAAGC CGAAAAGCTA AAAAAAGAGC TGGAGGAGCA AAAAAGAAAG CTTGCTGAAA ATCGGGAAAG ATTAATACAG GAAGCAAGAG CTGAAGCGAG AAAAATTCTT CTTGAAGCAA GAAAAGAGGC GGAAGAGATT ATTTCTAAAA TGAGGAGGCT TGAACAGGAA GTCCATAACG CGCAGAGGCA AAAGGAGGCG GAAGAGCTTA GGCTCAAGCT TAAAAGAAAG GTTGATTCCA TTGAGGAAAC ACTGGAATTG CCCCTTGCTC CGAAAAACGC TTTGGTAAAA CCCCCGGAGA ATTTAAAGCC CGGTGACAGT GTTCTAATCG TCAATTTGGA CCAGAAAGGA ACGGTTATCA CTCCTCCGGA CAAGGACGGA GAAGTGGTGG TTCAGGCCGG AATTATGAAA ATAAACGTTC ATATATCAAA TTTAAAACTG GTGGACGAAC AAAAAATTGT GTTAAACAAT TCCGGAATTG GCAAAATAGG TATGTCAAAA GCAAAAAGCA TATCAACTGA AATTGATGTA AGGGGATACA ACTTGGAAGA GGCCATTGAA AGTGTCGACA AGTATTTGGA TGATGCTTAT CTTTCCGGGC TTACGGAGGT ATCTATTATT CACGGCAAGG GAACCGGAGT ACTCAGAAGT GGCATACAGA AATTTTTAAA ATCAGATTCC AGGGTTAAAT CTTTCAGGCT TGGAAAGTAC GGAGAAGGTG AATCGGGAGT TACAATAGTC GAACTTAGGT GA
|
Protein sequence | MNEKTLKILE FNKIIDKLVS LATSSLGKEL AEKLVPDTDL NRVERAQKET SDAVAFIARR GTPPMGGIHD IRDSLKRVEI GAILNPGELL KTADVLRAVR NLKSYASNDR IKTDEDNIVS ELIGCLESNK RIEDRIYMSI LSEDEIADNA SPTLANIRRQ IRNAQESIKD KLNDIIRSSR YQKYIQEPIV TLRGDRYVIP VKQEYRTEIP GLIHDSSASG ATIFIEPMAV VEANNHIREL KIKEQAEIEK ILGELTGEIR GIVDSLKSNV SILGRLDFIF AKARLSLDYN CVCPVLNDEH KILIKKGRHP LLDKKTVVPI DFWIGEDFNT LVVTGPNTGG KTVTLKTVGL FTLMTQAGLH IPANEGTKMS IFKKVYADIG DEQSIEQSLS TFSSHMKNIV GILKDVDEDS LVLFDELGAG TDPTEGAALA MSILEYLRNK GSTTVATTHY SQLKAYAVTT KFVENACCEF NVETLRPTYR LLIGVPGKSN AFAISKRLGL FDDIIEKAKE FLTQDDIKFE DMLMSIEKNL NQSENEKMKA ESYRLEAEKL KKELEEQKRK LAENRERLIQ EARAEARKIL LEARKEAEEI ISKMRRLEQE VHNAQRQKEA EELRLKLKRK VDSIEETLEL PLAPKNALVK PPENLKPGDS VLIVNLDQKG TVITPPDKDG EVVVQAGIMK INVHISNLKL VDEQKIVLNN SGIGKIGMSK AKSISTEIDV RGYNLEEAIE SVDKYLDDAY LSGLTEVSII HGKGTGVLRS GIQKFLKSDS RVKSFRLGKY GEGESGVTIV ELR
|
| |