Gene Cthe_0247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0247 
Symbol 
ID4808595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp300315 
End bp302120 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content40% 
IMG OID640105659 
ProductDNA mismatch repair protein MutS-like protein 
Protein accessionYP_001036679 
Protein GI125972769 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAA AGCAAATGTA TTTAAAGCGT ATGCGCCAAT ATAGTGATAA TGCGGAAAAG 
CTTGAAAGAC GTTCGGGAAT GTACAGCACA ATAAGGCTTG TAACATTTGC AGCAGGTACA
TTCTTTACGG TTCTTGCTTT TATGTATTTA AGCAATGTCC ATGGGTTTAT TAGTATGGGA
GTATTTTTTG TACTGTTTCT CTTTTTGGTT GCGAAGCATC AAAAAGTAAT CGATGAAACT
TTAAAATATC GCACTCTTGC GGAAATAAAC AAAAGATGCG TAGCAAGAAT GGAAGGCACG
TGGACAGAGT TTGAGGACAA GGGAGAGGAA TATGCAGACC CAAATCATAT GTATTCCAAA
GACCTTGATG TTTTTGGGCA TGGTTCTCTT TTTCAGTGGA TCAATACGAC AAATACATTT
TTTGGCAGGG AAAGGCTAAG GCGTTTGCTG GAATTCCCTG AAAAGGAGGC CGGACAAATC
AGAAAGAGGC AGAATGCGGT AAAGGAGCTT TCAAAGAAAA TTGACTTTTG CCAAAGTTTG
CAATGTGAAG GAATGATGGC TCAAAATGCT TCGAAAAATC CTGACAAGCT ACTTGACTTT
TGTGAGGACG GCACCAAACT TTTTCGTAAC AAGCTTGCAG AACAGTTGTT TTATATTCTT
CCGGAGCTTA CAATAGTTTT TCTGGTTATA TGCTGGCTTG ATTCTTCGGT TTCGCTGTAC
ATACCTTTCT TTCTTCTTGG TGTGCAGGCT GTCATAAATA TTGTGTTGCA TGGCAGGGTA
AGCAGTATTT TAGGTCCGGT AAGCAGGTAC AAGAATGAGA TAAAGGTATT TTATAAAATG
ATTGAACTGA TTGAGAAGAA AGAGTTTGAG GATGAATACC TGATAGAATT AAAATCCCGG
CTGTTTGACA AGAAGAAACC GGCATCGGAG CAGATAAAGG ATTTGGAGAA AATTGTAGAG
GCCACTGACA TCGGGAAGGG TTATATAGTT GAGATTTTAT TGAATTTTTT CTTGTTCTGG
AATATTCACT GTGTGTTTGC ATTGGAAAGA TGGAAAGCAA AGGCCGGTAA GGCAATACGA
ATCTGGCTTG AGACTATAGG TGATTTTGAA GCTTTGGCAA GTTTGGCTCT TGTTGCGCAG
ATGAACCCCG AATGGGCTTT CCCTGAAATA TCGGACCGGA AGGTTTGCTT TAATGCCGTT
GACATGGGAC ATCCTTTGAT AAATGAGGGT AAACGTGTGT GCAACAGTAT CAATATGGAC
AATAAAATCT GTATTGTTAC AGGGTCGAAT ATGTCCGGGA AAACCACGCT TTTGAGGACT
GTGGGGGTAA ATCTTCTGCT TGCTTATGCG GGAACCGCCG TTTGTGCCAA AAAAATGACA
TGCTCCGTAA TGGATATATG CACTTCAATG AGGGTTGTAG ATGATTTGAA CGAGGGAATA
TCCACTTTTT ATGCGGAGCT TTTAAGAATA AAGATGATAA TAGACCACTC AAGAATGAAA
AAGCCGATGA TATTCCTTAT TGACGAAGTG TTCCGGGGAA CCAATTCCCT TGACAGGGTC
ACAGGTGCAC GAAATGTTTT GCTGAATCTT GACAAAGACT GGGTTATCGG AATGATTTCA
ACCCATGATT TTGAACTCTG CAATTTGGAA AAAGGACGCG AAGGAAGGAT TGTCAACTAC
CACTTTGCAG AAACCTACAC CAACAACGAA ATAAAATTTG ATTATATTTT AAGGCGGGGA
CAGTGTAAAA AAAGCAATGC AAGATATTTG ATGAAAATGG TGGGAATAGA GCTTTTGGAT
GAGTGA
 
Protein sequence
MNTKQMYLKR MRQYSDNAEK LERRSGMYST IRLVTFAAGT FFTVLAFMYL SNVHGFISMG 
VFFVLFLFLV AKHQKVIDET LKYRTLAEIN KRCVARMEGT WTEFEDKGEE YADPNHMYSK
DLDVFGHGSL FQWINTTNTF FGRERLRRLL EFPEKEAGQI RKRQNAVKEL SKKIDFCQSL
QCEGMMAQNA SKNPDKLLDF CEDGTKLFRN KLAEQLFYIL PELTIVFLVI CWLDSSVSLY
IPFFLLGVQA VINIVLHGRV SSILGPVSRY KNEIKVFYKM IELIEKKEFE DEYLIELKSR
LFDKKKPASE QIKDLEKIVE ATDIGKGYIV EILLNFFLFW NIHCVFALER WKAKAGKAIR
IWLETIGDFE ALASLALVAQ MNPEWAFPEI SDRKVCFNAV DMGHPLINEG KRVCNSINMD
NKICIVTGSN MSGKTTLLRT VGVNLLLAYA GTAVCAKKMT CSVMDICTSM RVVDDLNEGI
STFYAELLRI KMIIDHSRMK KPMIFLIDEV FRGTNSLDRV TGARNVLLNL DKDWVIGMIS
THDFELCNLE KGREGRIVNY HFAETYTNNE IKFDYILRRG QCKKSNARYL MKMVGIELLD
E