Gene Cthe_0777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0777 
Symbol 
ID4810395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp939329 
End bp941941 
Gene Length2613 bp 
Protein Length870 aa 
Translation table11 
GC content41% 
IMG OID640106194 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001037205 
Protein GI125973295 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.235285 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTCAT TGACACCTAT GATGCAACAG TATCTTGAAA TAAAGGAGCA GTACAAGGAC 
TGTATCCTTT TTTTCAGATT GGGAGACTTT TATGAAATGT TTTTCTCTGA CGCGGAAGTA
GCTTCAAGGG AACTGGAAAT AACGCTTACG GGAAAAGATT GCGGCCTTGA AGAGAGAGCT
CCCATGTGCG GAGTGCCTTT TCATTCAGCG GATTCATATA TTGCGAAACT GATTAGCAAA
GGTTATAAAG TTGCGATATG TGAGCAGATT GAAGATCCTG CGCTGGCAAA AGGATTGGTC
AAGAGGGACG TAATCAGGAT TGTTACTCCC GGTACAGTGA CCGATTCCGC TATGTTGGAT
GAAAAAAAGA ATAACTATCT TATGTCGATT TATAAAAACA AAAACTATTA CGGTATTGCA
TGTGTTGATC TTACCACAGG AGAGTTTTTG TCGACACACA TTACCTTCGG CAACACTTTC
AACAAATTAA TGGATGAAAT AGCGAAATTC TCACCATCGG AAATTGTTGT AAACGGTGAG
TTTTTTCATG ACGAAAACAT CAAAAAGACT TTAAAACAAA GGTTTGATGT TTATATCTCC
GGTTTGGAGG ACAAATTTTT TGAAAAGGAG TTTTCCATAC AGAAGGTAAG AAATTATTTT
AAGGATTATG TGATTGAAGA AAATGCGTTT GATTTATACA TAAATGCTTC CGGAGCTCTT
TTGGAATATT TGGAGCAGAC GCAGAAAGTG AATTTAAGCC ATATTCAGAA TTTTAACGTC
TACAGGATTG AAGAGTACAT GATTCTGGAC ATGGCCACCA GAAGAAACCT TGAACTTACG
GAGACCATGA GGGAAAAGAA CAGAAAAGGC TCCCTTCTTT GGGTTTTGGA CAGAACCATG
ACCTCCATGG GCGGAAGAAC CCTGAGAAAA TGGATTGAAC AGCCCCTTAT AAACCTCCAT
GATATAAAGG ACAGGCTTGA TGCGGTAAAC GAGTTTAAAG AAAGGTTTAT GATAAGAAGC
GAAGTACGGG AGCTTTTAAG AGCGGTTTAT GACATTGAGA GGCTGATGAC AAAGGTTATT
CTGGGAAGTG CCAATTGCCG CGATTTGATA TCGATAAAAC ACTCTATCGG ACAGGTTCCG
TATATAAAGG AGCTTCTTCG TGATTTAAAA GCCGATCTTA ATGTTTTAAG CTACAATGAA
TTGGATACAC TAACTGATGT GTATGAAATT ATTGACAAGG CTATAGTCGA TGACCCGCCT
GTTGCGGTTA AAGAAGGCGG AATCATAAAA GAAGGTTTTA ATGAGGAGGT GGACAGGCTA
AGAAGCGCAT CAAAAGACGG AAAGAAGTGG ATTGCACATT TAGAGAGCAA AGAGAGGGAA
AGAACCGGTA TTAAAAATTT GAAAGTGGGA TTTAACAAGG TTTTCGGCTA TTACATTGAA
GTGACCAAAT CCTACTACTC ACAAGTACCG GATGATTATA TAAGGAAACA GACCTTGGCG
AACTGCGAAA GGTACATAAC ACCGGAGCTG AAGGAAATTG AAAATACCGT ATTGGGTGCG
GAGGACAGGC TTGTAGAGCT TGAATATCAG ATATTTGTGG ATGTGAGGAA CAAGGTTGCA
AAAGAAATAA ACAGATTAAA GACCACGGCA AGAAGTCTTG CCAGAATCGA TGTTTTATGC
TCACTGGCGG AAGTGGCGGA CAGGGAATCC TATACTATGC CGGAAGTGAC CGATGACGAC
AAAATTGAAA TTAAAGACGG AAGGCATCCT GTGGTAGAGA AGATAATCGG GCAGGAAGCC
TTTGTTCCCA ATGATACCTA TCTTGACATG GATGAAAACC AAATTGCTAT AATTACCGGG
CCTAATATGG CGGGTAAGTC GACATATATG CGGCAGGTGG CGTTGATTGT GCTTATGGCC
CAAATAGGAA GCTTTGTTCC CGCCAAAAGT GCCAAAATAG GCATAGTTGA CAGGATATTT
ACAAGGGTAG GCGCTTCGGA TGACCTGGCA GCAGGCCAGA GTACCTTTAT GGTGGAAATG
TCCGAAGTTG CGAATATCCT TGGCAATGCC ACGTCAAAAA GCCTGTTGGT ATTGGATGAG
ATAGGCAGGG GTACAAGCAC CTATGACGGA CTGAGTATCG CATGGGCTGT AATTGAGTAT
ATCGGTGAGA AGATAGGGGC AAGAACGCTC TTTGCCACCC ATTACCATGA ACTTACCGAA
CTGGAAGAAA GAATTGAAGG AATAAAGAAT TATTGCATAT CCGTGGAGGA AAAGGGAGAA
GATATAATTT TCTTAAGAAA AATACTCAGG GGCGGAGCCG ACAACAGCTA CGGCGTGCAG
GTTGCAAGGC TTGCAGGCAT ACCGGATCCT GTTATTCACA GGGCAAAAGA AATATTAAAA
AAATTGGAAG ATGCGGACAT AACCAGGAAG GAAAAACGTA TTACAAGGAG AAAACAGCCC
ATTGAGGGGC AAATTGACGT ATTTACGTTT AACGCCGCCC AGAGAAGTTA TGATGAGGTT
CTGAATGAAT TAAAGAGTCT TGATATTACG ACTTTGACAC CGCTTGATGC AATAAATGTA
TTATACAATC TTCAAAAAAA GGTAAAAGGG TAG
 
Protein sequence
MASLTPMMQQ YLEIKEQYKD CILFFRLGDF YEMFFSDAEV ASRELEITLT GKDCGLEERA 
PMCGVPFHSA DSYIAKLISK GYKVAICEQI EDPALAKGLV KRDVIRIVTP GTVTDSAMLD
EKKNNYLMSI YKNKNYYGIA CVDLTTGEFL STHITFGNTF NKLMDEIAKF SPSEIVVNGE
FFHDENIKKT LKQRFDVYIS GLEDKFFEKE FSIQKVRNYF KDYVIEENAF DLYINASGAL
LEYLEQTQKV NLSHIQNFNV YRIEEYMILD MATRRNLELT ETMREKNRKG SLLWVLDRTM
TSMGGRTLRK WIEQPLINLH DIKDRLDAVN EFKERFMIRS EVRELLRAVY DIERLMTKVI
LGSANCRDLI SIKHSIGQVP YIKELLRDLK ADLNVLSYNE LDTLTDVYEI IDKAIVDDPP
VAVKEGGIIK EGFNEEVDRL RSASKDGKKW IAHLESKERE RTGIKNLKVG FNKVFGYYIE
VTKSYYSQVP DDYIRKQTLA NCERYITPEL KEIENTVLGA EDRLVELEYQ IFVDVRNKVA
KEINRLKTTA RSLARIDVLC SLAEVADRES YTMPEVTDDD KIEIKDGRHP VVEKIIGQEA
FVPNDTYLDM DENQIAIITG PNMAGKSTYM RQVALIVLMA QIGSFVPAKS AKIGIVDRIF
TRVGASDDLA AGQSTFMVEM SEVANILGNA TSKSLLVLDE IGRGTSTYDG LSIAWAVIEY
IGEKIGARTL FATHYHELTE LEERIEGIKN YCISVEEKGE DIIFLRKILR GGADNSYGVQ
VARLAGIPDP VIHRAKEILK KLEDADITRK EKRITRRKQP IEGQIDVFTF NAAQRSYDEV
LNELKSLDIT TLTPLDAINV LYNLQKKVKG