Gene Cthe_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1014 
Symbol 
ID4811308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1212794 
End bp1215175 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content42% 
IMG OID640106432 
ProductMutS2 family protein 
Protein accessionYP_001037439 
Protein GI125973529 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0625051 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAAA AAACACTTAA AATATTGGAG TTTAATAAAA TAATTGACAA ACTGGTAAGC 
CTTGCAACAT CTTCTTTGGG AAAAGAACTG GCGGAAAAGC TGGTGCCAGA TACTGATCTT
AACAGGGTTG AAAGGGCACA AAAGGAAACC AGTGACGCCG TCGCTTTTAT TGCAAGAAGG
GGAACCCCGC CCATGGGGGG AATTCATGAT ATACGGGACA GTTTAAAAAG AGTTGAAATC
GGAGCGATTC TAAACCCTGG AGAGCTTTTA AAAACTGCCG ACGTTTTAAG GGCTGTAAGA
AATCTTAAAA GCTATGCGAG CAATGACAGA ATCAAGACCG ATGAAGACAA TATTGTAAGT
GAGCTTATAG GATGCCTTGA ATCCAATAAG CGGATTGAAG ACAGGATTTA CATGTCAATA
CTAAGTGAGG ATGAAATAGC TGACAATGCA AGCCCGACCC TTGCCAACAT AAGAAGGCAG
ATACGAAATG CCCAGGAATC AATAAAAGAT AAGCTCAATG ATATCATAAG GTCGTCAAGA
TATCAGAAAT ATATACAGGA GCCCATAGTT ACTTTAAGAG GAGACAGATA TGTAATACCT
GTCAAGCAGG AGTACAGAAC CGAAATACCC GGACTTATAC ATGATTCATC CGCCAGCGGG
GCGACCATTT TTATTGAGCC TATGGCGGTT GTGGAGGCCA ACAACCACAT ACGGGAGCTT
AAAATTAAAG AGCAGGCTGA AATTGAGAAA ATACTGGGGG AATTGACCGG GGAGATAAGA
GGAATTGTTG ATTCCTTAAA GTCGAATGTT TCAATTTTAG GCCGTTTGGA TTTCATATTT
GCCAAGGCAA GGCTCAGCCT TGATTATAAC TGTGTTTGCC CTGTACTTAA CGATGAACAT
AAAATATTAA TAAAAAAGGG AAGACATCCT CTTTTAGACA AAAAAACCGT TGTTCCCATC
GATTTTTGGA TCGGGGAAGA CTTCAACACC CTTGTAGTGA CCGGACCCAA TACCGGAGGT
AAAACGGTTA CTTTGAAAAC TGTGGGCCTG TTTACCCTTA TGACGCAGGC AGGGCTTCAT
ATTCCGGCAA ATGAGGGAAC CAAAATGAGT ATTTTCAAAA AAGTCTATGC CGACATAGGG
GATGAGCAGA GTATCGAACA GAGCCTTAGT ACTTTTTCTT CGCATATGAA GAATATAGTT
GGAATATTAA AGGATGTGGA TGAAGATTCC CTTGTTCTGT TTGATGAGCT TGGAGCGGGA
ACAGACCCTA CCGAGGGTGC CGCCCTTGCA ATGTCAATAC TTGAGTATTT AAGAAACAAG
GGCAGTACAA CGGTTGCCAC CACCCATTAC AGCCAGCTGA AAGCGTATGC CGTTACCACA
AAATTTGTGG AAAATGCCTG CTGCGAGTTT AATGTGGAGA CACTAAGGCC CACTTACAGG
CTATTGATTG GAGTTCCCGG AAAAAGCAAC GCCTTTGCAA TATCAAAAAG GCTGGGGCTT
TTTGATGACA TTATTGAGAA GGCCAAAGAA TTTTTAACCC AGGACGATAT AAAGTTTGAA
GACATGCTTA TGTCGATTGA GAAAAACTTA AATCAGTCCG AAAATGAAAA AATGAAAGCT
GAAAGCTATC GACTCGAAGC CGAAAAGCTA AAAAAAGAGC TGGAGGAGCA AAAAAGAAAG
CTTGCTGAAA ATCGGGAAAG ATTAATACAG GAAGCAAGAG CTGAAGCGAG AAAAATTCTT
CTTGAAGCAA GAAAAGAGGC GGAAGAGATT ATTTCTAAAA TGAGGAGGCT TGAACAGGAA
GTCCATAACG CGCAGAGGCA AAAGGAGGCG GAAGAGCTTA GGCTCAAGCT TAAAAGAAAG
GTTGATTCCA TTGAGGAAAC ACTGGAATTG CCCCTTGCTC CGAAAAACGC TTTGGTAAAA
CCCCCGGAGA ATTTAAAGCC CGGTGACAGT GTTCTAATCG TCAATTTGGA CCAGAAAGGA
ACGGTTATCA CTCCTCCGGA CAAGGACGGA GAAGTGGTGG TTCAGGCCGG AATTATGAAA
ATAAACGTTC ATATATCAAA TTTAAAACTG GTGGACGAAC AAAAAATTGT GTTAAACAAT
TCCGGAATTG GCAAAATAGG TATGTCAAAA GCAAAAAGCA TATCAACTGA AATTGATGTA
AGGGGATACA ACTTGGAAGA GGCCATTGAA AGTGTCGACA AGTATTTGGA TGATGCTTAT
CTTTCCGGGC TTACGGAGGT ATCTATTATT CACGGCAAGG GAACCGGAGT ACTCAGAAGT
GGCATACAGA AATTTTTAAA ATCAGATTCC AGGGTTAAAT CTTTCAGGCT TGGAAAGTAC
GGAGAAGGTG AATCGGGAGT TACAATAGTC GAACTTAGGT GA
 
Protein sequence
MNEKTLKILE FNKIIDKLVS LATSSLGKEL AEKLVPDTDL NRVERAQKET SDAVAFIARR 
GTPPMGGIHD IRDSLKRVEI GAILNPGELL KTADVLRAVR NLKSYASNDR IKTDEDNIVS
ELIGCLESNK RIEDRIYMSI LSEDEIADNA SPTLANIRRQ IRNAQESIKD KLNDIIRSSR
YQKYIQEPIV TLRGDRYVIP VKQEYRTEIP GLIHDSSASG ATIFIEPMAV VEANNHIREL
KIKEQAEIEK ILGELTGEIR GIVDSLKSNV SILGRLDFIF AKARLSLDYN CVCPVLNDEH
KILIKKGRHP LLDKKTVVPI DFWIGEDFNT LVVTGPNTGG KTVTLKTVGL FTLMTQAGLH
IPANEGTKMS IFKKVYADIG DEQSIEQSLS TFSSHMKNIV GILKDVDEDS LVLFDELGAG
TDPTEGAALA MSILEYLRNK GSTTVATTHY SQLKAYAVTT KFVENACCEF NVETLRPTYR
LLIGVPGKSN AFAISKRLGL FDDIIEKAKE FLTQDDIKFE DMLMSIEKNL NQSENEKMKA
ESYRLEAEKL KKELEEQKRK LAENRERLIQ EARAEARKIL LEARKEAEEI ISKMRRLEQE
VHNAQRQKEA EELRLKLKRK VDSIEETLEL PLAPKNALVK PPENLKPGDS VLIVNLDQKG
TVITPPDKDG EVVVQAGIMK INVHISNLKL VDEQKIVLNN SGIGKIGMSK AKSISTEIDV
RGYNLEEAIE SVDKYLDDAY LSGLTEVSII HGKGTGVLRS GIQKFLKSDS RVKSFRLGKY
GEGESGVTIV ELR