Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0247 |
Symbol | |
ID | 4808595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 300315 |
End bp | 302120 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640105659 |
Product | DNA mismatch repair protein MutS-like protein |
Protein accession | YP_001036679 |
Protein GI | 125972769 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATACAA AGCAAATGTA TTTAAAGCGT ATGCGCCAAT ATAGTGATAA TGCGGAAAAG CTTGAAAGAC GTTCGGGAAT GTACAGCACA ATAAGGCTTG TAACATTTGC AGCAGGTACA TTCTTTACGG TTCTTGCTTT TATGTATTTA AGCAATGTCC ATGGGTTTAT TAGTATGGGA GTATTTTTTG TACTGTTTCT CTTTTTGGTT GCGAAGCATC AAAAAGTAAT CGATGAAACT TTAAAATATC GCACTCTTGC GGAAATAAAC AAAAGATGCG TAGCAAGAAT GGAAGGCACG TGGACAGAGT TTGAGGACAA GGGAGAGGAA TATGCAGACC CAAATCATAT GTATTCCAAA GACCTTGATG TTTTTGGGCA TGGTTCTCTT TTTCAGTGGA TCAATACGAC AAATACATTT TTTGGCAGGG AAAGGCTAAG GCGTTTGCTG GAATTCCCTG AAAAGGAGGC CGGACAAATC AGAAAGAGGC AGAATGCGGT AAAGGAGCTT TCAAAGAAAA TTGACTTTTG CCAAAGTTTG CAATGTGAAG GAATGATGGC TCAAAATGCT TCGAAAAATC CTGACAAGCT ACTTGACTTT TGTGAGGACG GCACCAAACT TTTTCGTAAC AAGCTTGCAG AACAGTTGTT TTATATTCTT CCGGAGCTTA CAATAGTTTT TCTGGTTATA TGCTGGCTTG ATTCTTCGGT TTCGCTGTAC ATACCTTTCT TTCTTCTTGG TGTGCAGGCT GTCATAAATA TTGTGTTGCA TGGCAGGGTA AGCAGTATTT TAGGTCCGGT AAGCAGGTAC AAGAATGAGA TAAAGGTATT TTATAAAATG ATTGAACTGA TTGAGAAGAA AGAGTTTGAG GATGAATACC TGATAGAATT AAAATCCCGG CTGTTTGACA AGAAGAAACC GGCATCGGAG CAGATAAAGG ATTTGGAGAA AATTGTAGAG GCCACTGACA TCGGGAAGGG TTATATAGTT GAGATTTTAT TGAATTTTTT CTTGTTCTGG AATATTCACT GTGTGTTTGC ATTGGAAAGA TGGAAAGCAA AGGCCGGTAA GGCAATACGA ATCTGGCTTG AGACTATAGG TGATTTTGAA GCTTTGGCAA GTTTGGCTCT TGTTGCGCAG ATGAACCCCG AATGGGCTTT CCCTGAAATA TCGGACCGGA AGGTTTGCTT TAATGCCGTT GACATGGGAC ATCCTTTGAT AAATGAGGGT AAACGTGTGT GCAACAGTAT CAATATGGAC AATAAAATCT GTATTGTTAC AGGGTCGAAT ATGTCCGGGA AAACCACGCT TTTGAGGACT GTGGGGGTAA ATCTTCTGCT TGCTTATGCG GGAACCGCCG TTTGTGCCAA AAAAATGACA TGCTCCGTAA TGGATATATG CACTTCAATG AGGGTTGTAG ATGATTTGAA CGAGGGAATA TCCACTTTTT ATGCGGAGCT TTTAAGAATA AAGATGATAA TAGACCACTC AAGAATGAAA AAGCCGATGA TATTCCTTAT TGACGAAGTG TTCCGGGGAA CCAATTCCCT TGACAGGGTC ACAGGTGCAC GAAATGTTTT GCTGAATCTT GACAAAGACT GGGTTATCGG AATGATTTCA ACCCATGATT TTGAACTCTG CAATTTGGAA AAAGGACGCG AAGGAAGGAT TGTCAACTAC CACTTTGCAG AAACCTACAC CAACAACGAA ATAAAATTTG ATTATATTTT AAGGCGGGGA CAGTGTAAAA AAAGCAATGC AAGATATTTG ATGAAAATGG TGGGAATAGA GCTTTTGGAT GAGTGA
|
Protein sequence | MNTKQMYLKR MRQYSDNAEK LERRSGMYST IRLVTFAAGT FFTVLAFMYL SNVHGFISMG VFFVLFLFLV AKHQKVIDET LKYRTLAEIN KRCVARMEGT WTEFEDKGEE YADPNHMYSK DLDVFGHGSL FQWINTTNTF FGRERLRRLL EFPEKEAGQI RKRQNAVKEL SKKIDFCQSL QCEGMMAQNA SKNPDKLLDF CEDGTKLFRN KLAEQLFYIL PELTIVFLVI CWLDSSVSLY IPFFLLGVQA VINIVLHGRV SSILGPVSRY KNEIKVFYKM IELIEKKEFE DEYLIELKSR LFDKKKPASE QIKDLEKIVE ATDIGKGYIV EILLNFFLFW NIHCVFALER WKAKAGKAIR IWLETIGDFE ALASLALVAQ MNPEWAFPEI SDRKVCFNAV DMGHPLINEG KRVCNSINMD NKICIVTGSN MSGKTTLLRT VGVNLLLAYA GTAVCAKKMT CSVMDICTSM RVVDDLNEGI STFYAELLRI KMIIDHSRMK KPMIFLIDEV FRGTNSLDRV TGARNVLLNL DKDWVIGMIS THDFELCNLE KGREGRIVNY HFAETYTNNE IKFDYILRRG QCKKSNARYL MKMVGIELLD E
|
| |