Gene Ccel_1433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1433 
Symbol 
ID7310204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1739138 
End bp1741084 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content40% 
IMG OID643608357 
ProductDNA mismatch repair protein MutS domain protein 
Protein accessionYP_002505767 
Protein GI220928858 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00229473 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAT ATGCCGTAAC ACTTGAATTT AACAAAATAA TTGAAATGCT TATGGAAAAT 
ACCGTTTCTC AAAGAGCTAA AGAGAATCTT TCGATGCTTC AACCTTTTCT GAAGGAGGGA
GAATGCAGAA GAAGAATGAA GGAAACTACC AACGCAAAAA GAATTCTTGA AAGTCTGGGG
ACTCCTCCTC TCTCTTCAAT GAACGAGCTA GACAGAATTC TTGATCTTTG TTCAAAGGGT
GCAATGCTTG TTCCTGAACA ATTAGAAGAG GTTTCACAGT TTCTTCTTGC CTGCAGACGT
ATGAAATCAT ATCTCAAAAG GACCGAGAGC CTTGAGAATG ATATTGCTTA TTATGGAAGC
TCAATAGATG CACTTAATGA CCTCTGTGAA GAAATTGAAC ACTCAATCAG AAATGGACGG
GTTGATGATT CTGCTTCACC AAAGCTGAAA GATATACGAA GAAGGAAGGA TAATTCCCAC
CAACAAGTCC GTACCAAGCT TGAAAGTATA TTAAGGAGTA AAAAAGAATG GTTTGCGGAC
AGTTTTGTTT CAATGAGAAA TGGTCATTTC GTACTGCCCG TAAGAAAAGA ATATAAGAAC
ATGGTAGTCG GCTCGCTGAT AGAAACCTCC GCCACAGGAG GAACTTATTT TATTGAGCCT
TCCGCTGTAC GCAGACTTCA AGAGGAAATA TCCGCCCTTA CAGTGGAAGA AGAAAATGAA
GAAAGAAAAA TCTTATATAC TCTGACCTCA TTAGTGGATG AACATTCAGC TGTATTTAAA
ACTAACGTAG ATATAATGGA AACTCTTGAT GCAGTTTTTG CTAAGGCTAA GCTTTCAGTT
CAGATGAAGG CCTGTGAAGC TGATATTCAT ATGGACCGCA GGATTGTTAT CAAAGCCGGA
CGCCACCCTC TTCTTAATCA ATCGGAATGT GTTCCTCTTG ACTTTGAAAT CGGTAGTGGT
ATAAGAGGAG TTGTTATAAC TGGGCCAAAT ACAGGAGGAA AAACAGTGGC CCTTAAAACT
GTAGGGTTGC TTTCAATCAT GGCACAAAGC GGACTTCATG TTCCCGCCGC CATGGGAAGT
GAATTTTCAA TGCATAATAT GATTTTATGT GATATCGGAG ACGGACAGAG TATTACCGAG
AACTTGTCAA CTTTTTCGGC ACACATAACA ACCATAAATG ATATTCTAAG GCAATCCACG
GGAGAAAGCC TTGTCCTGCT TGACGAAGTA GGCTCTGGAA CCGACCCGGC AGAAGGTATG
GGTATAGCTA CCGCAATTCT GGAAGATTTG AAAAACAAGG GCTGCCTGTT TGTAGCAACT
ACTCATTATC CTGAAATAAA GGACTATGCC AAAAAAACAC CCGGTCTTGT AAATGCCAGA
ATGGCTTTTG ACAAGGAAAG TTTGAAACCT TTGTATAAAC TTGAAATTGG TGAGGCCGGG
GAAAGCTGTG CCCTGTATAT TGCAAAAAGG CTTGGCTTTC CCCCACATCT TCTCAGGCTT
GCTCATGATG CCGCCTATAA AGAGTACCCT ACTAAGGTTA ATTCAGAGAA TAATGAATCC
TTGTTAAGCC CTATTGGAGC GGACAGCCAG TCCGGAATTC CACCTGACTT AAATGCACCT
GTTCTTATAA AGGAAAACCC AAAGAAAAAT GTACCACAAA ACAGAAGCAG CAGATTTAAC
ATTGGAGATA GTGTTATGGT CTTTCCTCAG AAAGAAATAG GGCTGGTTTA TCAAAAAGCC
AACGAACATG GTGAAATAGG TGTTCAGATT AAAGGACAAA AAAAACTTCT TTCCCACAAA
AGAATAAAAC TACACGTACC AGCCAGCGAG TTATACCCAG CTGACTACGA CTTTTCAATA
ATATTTGATA CTGTGGACAA TCGCAAAGCA AGACACAAGA TGGGGAAAAG ACATAATCCT
GAGTTGTTTA TAGAGTATGA AAATTAG
 
Protein sequence
MNKYAVTLEF NKIIEMLMEN TVSQRAKENL SMLQPFLKEG ECRRRMKETT NAKRILESLG 
TPPLSSMNEL DRILDLCSKG AMLVPEQLEE VSQFLLACRR MKSYLKRTES LENDIAYYGS
SIDALNDLCE EIEHSIRNGR VDDSASPKLK DIRRRKDNSH QQVRTKLESI LRSKKEWFAD
SFVSMRNGHF VLPVRKEYKN MVVGSLIETS ATGGTYFIEP SAVRRLQEEI SALTVEEENE
ERKILYTLTS LVDEHSAVFK TNVDIMETLD AVFAKAKLSV QMKACEADIH MDRRIVIKAG
RHPLLNQSEC VPLDFEIGSG IRGVVITGPN TGGKTVALKT VGLLSIMAQS GLHVPAAMGS
EFSMHNMILC DIGDGQSITE NLSTFSAHIT TINDILRQST GESLVLLDEV GSGTDPAEGM
GIATAILEDL KNKGCLFVAT THYPEIKDYA KKTPGLVNAR MAFDKESLKP LYKLEIGEAG
ESCALYIAKR LGFPPHLLRL AHDAAYKEYP TKVNSENNES LLSPIGADSQ SGIPPDLNAP
VLIKENPKKN VPQNRSSRFN IGDSVMVFPQ KEIGLVYQKA NEHGEIGVQI KGQKKLLSHK
RIKLHVPASE LYPADYDFSI IFDTVDNRKA RHKMGKRHNP ELFIEYEN