Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0777 |
Symbol | |
ID | 4810395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 939329 |
End bp | 941941 |
Gene Length | 2613 bp |
Protein Length | 870 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106194 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001037205 |
Protein GI | 125973295 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.235285 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTCAT TGACACCTAT GATGCAACAG TATCTTGAAA TAAAGGAGCA GTACAAGGAC TGTATCCTTT TTTTCAGATT GGGAGACTTT TATGAAATGT TTTTCTCTGA CGCGGAAGTA GCTTCAAGGG AACTGGAAAT AACGCTTACG GGAAAAGATT GCGGCCTTGA AGAGAGAGCT CCCATGTGCG GAGTGCCTTT TCATTCAGCG GATTCATATA TTGCGAAACT GATTAGCAAA GGTTATAAAG TTGCGATATG TGAGCAGATT GAAGATCCTG CGCTGGCAAA AGGATTGGTC AAGAGGGACG TAATCAGGAT TGTTACTCCC GGTACAGTGA CCGATTCCGC TATGTTGGAT GAAAAAAAGA ATAACTATCT TATGTCGATT TATAAAAACA AAAACTATTA CGGTATTGCA TGTGTTGATC TTACCACAGG AGAGTTTTTG TCGACACACA TTACCTTCGG CAACACTTTC AACAAATTAA TGGATGAAAT AGCGAAATTC TCACCATCGG AAATTGTTGT AAACGGTGAG TTTTTTCATG ACGAAAACAT CAAAAAGACT TTAAAACAAA GGTTTGATGT TTATATCTCC GGTTTGGAGG ACAAATTTTT TGAAAAGGAG TTTTCCATAC AGAAGGTAAG AAATTATTTT AAGGATTATG TGATTGAAGA AAATGCGTTT GATTTATACA TAAATGCTTC CGGAGCTCTT TTGGAATATT TGGAGCAGAC GCAGAAAGTG AATTTAAGCC ATATTCAGAA TTTTAACGTC TACAGGATTG AAGAGTACAT GATTCTGGAC ATGGCCACCA GAAGAAACCT TGAACTTACG GAGACCATGA GGGAAAAGAA CAGAAAAGGC TCCCTTCTTT GGGTTTTGGA CAGAACCATG ACCTCCATGG GCGGAAGAAC CCTGAGAAAA TGGATTGAAC AGCCCCTTAT AAACCTCCAT GATATAAAGG ACAGGCTTGA TGCGGTAAAC GAGTTTAAAG AAAGGTTTAT GATAAGAAGC GAAGTACGGG AGCTTTTAAG AGCGGTTTAT GACATTGAGA GGCTGATGAC AAAGGTTATT CTGGGAAGTG CCAATTGCCG CGATTTGATA TCGATAAAAC ACTCTATCGG ACAGGTTCCG TATATAAAGG AGCTTCTTCG TGATTTAAAA GCCGATCTTA ATGTTTTAAG CTACAATGAA TTGGATACAC TAACTGATGT GTATGAAATT ATTGACAAGG CTATAGTCGA TGACCCGCCT GTTGCGGTTA AAGAAGGCGG AATCATAAAA GAAGGTTTTA ATGAGGAGGT GGACAGGCTA AGAAGCGCAT CAAAAGACGG AAAGAAGTGG ATTGCACATT TAGAGAGCAA AGAGAGGGAA AGAACCGGTA TTAAAAATTT GAAAGTGGGA TTTAACAAGG TTTTCGGCTA TTACATTGAA GTGACCAAAT CCTACTACTC ACAAGTACCG GATGATTATA TAAGGAAACA GACCTTGGCG AACTGCGAAA GGTACATAAC ACCGGAGCTG AAGGAAATTG AAAATACCGT ATTGGGTGCG GAGGACAGGC TTGTAGAGCT TGAATATCAG ATATTTGTGG ATGTGAGGAA CAAGGTTGCA AAAGAAATAA ACAGATTAAA GACCACGGCA AGAAGTCTTG CCAGAATCGA TGTTTTATGC TCACTGGCGG AAGTGGCGGA CAGGGAATCC TATACTATGC CGGAAGTGAC CGATGACGAC AAAATTGAAA TTAAAGACGG AAGGCATCCT GTGGTAGAGA AGATAATCGG GCAGGAAGCC TTTGTTCCCA ATGATACCTA TCTTGACATG GATGAAAACC AAATTGCTAT AATTACCGGG CCTAATATGG CGGGTAAGTC GACATATATG CGGCAGGTGG CGTTGATTGT GCTTATGGCC CAAATAGGAA GCTTTGTTCC CGCCAAAAGT GCCAAAATAG GCATAGTTGA CAGGATATTT ACAAGGGTAG GCGCTTCGGA TGACCTGGCA GCAGGCCAGA GTACCTTTAT GGTGGAAATG TCCGAAGTTG CGAATATCCT TGGCAATGCC ACGTCAAAAA GCCTGTTGGT ATTGGATGAG ATAGGCAGGG GTACAAGCAC CTATGACGGA CTGAGTATCG CATGGGCTGT AATTGAGTAT ATCGGTGAGA AGATAGGGGC AAGAACGCTC TTTGCCACCC ATTACCATGA ACTTACCGAA CTGGAAGAAA GAATTGAAGG AATAAAGAAT TATTGCATAT CCGTGGAGGA AAAGGGAGAA GATATAATTT TCTTAAGAAA AATACTCAGG GGCGGAGCCG ACAACAGCTA CGGCGTGCAG GTTGCAAGGC TTGCAGGCAT ACCGGATCCT GTTATTCACA GGGCAAAAGA AATATTAAAA AAATTGGAAG ATGCGGACAT AACCAGGAAG GAAAAACGTA TTACAAGGAG AAAACAGCCC ATTGAGGGGC AAATTGACGT ATTTACGTTT AACGCCGCCC AGAGAAGTTA TGATGAGGTT CTGAATGAAT TAAAGAGTCT TGATATTACG ACTTTGACAC CGCTTGATGC AATAAATGTA TTATACAATC TTCAAAAAAA GGTAAAAGGG TAG
|
Protein sequence | MASLTPMMQQ YLEIKEQYKD CILFFRLGDF YEMFFSDAEV ASRELEITLT GKDCGLEERA PMCGVPFHSA DSYIAKLISK GYKVAICEQI EDPALAKGLV KRDVIRIVTP GTVTDSAMLD EKKNNYLMSI YKNKNYYGIA CVDLTTGEFL STHITFGNTF NKLMDEIAKF SPSEIVVNGE FFHDENIKKT LKQRFDVYIS GLEDKFFEKE FSIQKVRNYF KDYVIEENAF DLYINASGAL LEYLEQTQKV NLSHIQNFNV YRIEEYMILD MATRRNLELT ETMREKNRKG SLLWVLDRTM TSMGGRTLRK WIEQPLINLH DIKDRLDAVN EFKERFMIRS EVRELLRAVY DIERLMTKVI LGSANCRDLI SIKHSIGQVP YIKELLRDLK ADLNVLSYNE LDTLTDVYEI IDKAIVDDPP VAVKEGGIIK EGFNEEVDRL RSASKDGKKW IAHLESKERE RTGIKNLKVG FNKVFGYYIE VTKSYYSQVP DDYIRKQTLA NCERYITPEL KEIENTVLGA EDRLVELEYQ IFVDVRNKVA KEINRLKTTA RSLARIDVLC SLAEVADRES YTMPEVTDDD KIEIKDGRHP VVEKIIGQEA FVPNDTYLDM DENQIAIITG PNMAGKSTYM RQVALIVLMA QIGSFVPAKS AKIGIVDRIF TRVGASDDLA AGQSTFMVEM SEVANILGNA TSKSLLVLDE IGRGTSTYDG LSIAWAVIEY IGEKIGARTL FATHYHELTE LEERIEGIKN YCISVEEKGE DIIFLRKILR GGADNSYGVQ VARLAGIPDP VIHRAKEILK KLEDADITRK EKRITRRKQP IEGQIDVFTF NAAQRSYDEV LNELKSLDIT TLTPLDAINV LYNLQKKVKG
|
| |