Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_2381 |
Symbol | |
ID | 5742451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 2933768 |
End bp | 2936566 |
Gene Length | 2799 bp |
Protein Length | 932 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641293471 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001559481 |
Protein GI | 160880513 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACAAC TAACTCCAAT GATGCAGCAA TATGTGGAGA CAAAAGAACA ATATAAGGAT TGTATTCTTT TTTATCGTTT GGGTGACTTC TATGAAATGT TCTTTGAAGA TGCCTTAGTA GCCTCTAAGG AATTAGAGAT AACCTTAACC GGGAAGAATT GCGGGCAAGA GGAAAGAGCT CCTATGTGTG GGATACCTTA CCATGCCGCA GAAGGGTATA TTTCTAAGCT AATTGGGAAG GGATATAAAG TTGCGATCTG TGAACAAGTA GAAGATCCTA AGTTAGCGAA AGGAATTGTA AAACGAGAGG TTATCCGTAT CGTTACGCCT GGAACGAATC TAAATACCCA GACATTAGAT GAAACGAGAA ATAATTATCT TATGGGAATT ATCTTTACCG ACGAACATTG CGGTATATCA ACGGTTGATA TTACAACAGG TGATTACTAC GTAACTGAGG TCGAGAACAA CCGTAAGATT TTAGATGAAA TATATAAATA TACACCTTCG GAAATTGTTT GTAATCCAGA ATTTTTTCAC TGTGGGCTAG ATGTTGAAGA TTTAAAAAAT AGATATCAGA TAGCAGTATC CACCTTTGAG GACTGGTATT ATGACAGCGA ACAAAGTGTT AAGACATTAA AGGAACATTT TAAAGTAGGC TCTTTAGACG GTCTAGGATT AAAAGATTAT TCTGTCGGGG TGAATGCAGC TGGTGCTATC TTAAAGTACC TTTATAACAC TCAGAAGAAT TCACTTAGTC ATTTGACACA TATAACGCCA TACGTTACAA GTCGCTATAT GGTGATAGAC AGTTCCAGTA GAAGAAATCT AGAATTGACG GAGACACTTC GTGAAAAGCA AAAACGAGGG TCTCTTCTTT GGGTATTAGA TAAAACAAAA ACAGCCATGG GAGCTAGAAT GCTCCGTAGT TTTGTAGAAC AGCCACTGAT CACAATGGAT GAGATTTCAG CTCGTTATGA TGCGATTTCA GAACTGAACG ACAATGTGAT AACGCGGGAA GAAATACGAG AATACTTAAA TTACATTTAT GATTTGGAAC GCTTGATGGG AAAAATCAGC TATAAGAGTG CAAATCCAAG AGATTTAATT GCCTTTGCTT CTTCACTATC TATGCTTCCA CATATCAAAT ACTTGTTATC AACCTGCGAA TCCGCATTGT TAAAACAAAT TCATGAGGAG ATGGATGCTC TTGATGACTT ACAAAACTTA ATTGATCGCT CTATAGCAGA AGAACCACCG ATTGGAATCA AAGAGGGTGG CATCATAAAA GAAGGTTTCC ATACAGAAGT TGATACCCTT CGAAAAGCGA AAACAGAAGG GAAAGTATGG CTTGCAGAAC TGGAAGCGAA AGAAAAAGAG CAGACAGGAA TTAAGAATCT AAAGGTAAAA TACAATCGTG TCTTTGGATA TTACCTAGAA GTGACGAATT CTTATGCAAA TCTGGTACCG GAAAACTGGA TAAGAAAGCA AACGTTATCA AATGCCGAAC GTTATACAAC ACCAGAACTT AAGGAATTAG AAGATAAGAT ATTAAATGCA GAGGATCGTT TATTCTCTCT TGAGTATGAT TTATTTGCCG AAATTAGAGA TCAAATCGCT GAAGAAGTAA AACGAATTCA AAAAACTGCA AAAGCGGTAG CGAACATTGA TGCGTTTGCT TCACTTGCCT ATGTTGCAGA AAGAAATCAA TTTATCCGTC CTGAGTTAAA TACCAACGGA ACGATTGACA TAAAAGAGGG AAGACATCCA GTTGTAGAAC AAATGATACC AAACGATATG TTTGTGTCAA ATGATACGTA TCTTGATAAT GCTGAGAAAA GAATCTCCAT TATCACAGGT CCTAACATGG CTGGTAAATC TACCTATATG AGACAAACAG CGTTAATTGT ATTAATGGCT CAAGTAGGAA GCTTTGTTCC TGCATCTTAT GCAAACATTG GTATTGTTGA TCGTATTTTT ACCAGGGTAG GTGCGTCTGA TGATTTAGCA AGCGGTCAGA GTACCTTTAT GGTGGAGATG ACGGAGGTGG CGAATATCCT TCGAAATGCT ACGAAAAACA GTTTATTAAT CTTAGATGAA ATTGGCCGTG GTACGAGTAC GTTTGACGGA CTAAGTATTG CATGGGCAGT TATTGAACAT ATCAGTAATA CATCAATGCT TGGTGCAAAG ACATTATTTG CGACGCATTA CCATGAGTTA ACAGAATTAG AAGGCAAGAT ATCCGGTGTT AATAATTACT GCATTGCGGT GAAAGAACAA GGAGAAGATA TTGTCTTTCT TCGAAAGATT ATAGGAGGCG GAGCGGATAA GAGTTATGGC ATTCAAGTTG CAAAACTTGC CGGTGTTCCA AACTCGGTAT TAGTAAGAGC AAGAGAAATT GTGGATCAGC TAAGTGAGAA TGACATTGCA GAAAAAGCAA GACATATTGT GTCTGCTGCG GAAATTTCCA ATCTTACACC AGAAACCGAA GGCGAAGTGA ATACCAATAA AATGTATACC ACTAAAGTGA ATGCAACTGA AGTGATTACA ACTGAAGTGA ATACAGCTAA AATGAATACC ACTGAAATGG TAAGTAATCA GGAGTCTGTA GAACAGCCAA GAAACTTTGG CCAGATGTCA TTTTTCATAA CAGAAGATAC AAAACAGAAA AAAGCGTCCT CAGAATTTTC TGAAAAGTTA GTGCAGGAAA TAAATCAGTT TGACCTTGCC AATATGACTC CGGTGGAAGC ATTGTTAAAA TTGGATAAAT TACAGAAAAA AATACGTTCT CACACTTAA
|
Protein sequence | MAQLTPMMQQ YVETKEQYKD CILFYRLGDF YEMFFEDALV ASKELEITLT GKNCGQEERA PMCGIPYHAA EGYISKLIGK GYKVAICEQV EDPKLAKGIV KREVIRIVTP GTNLNTQTLD ETRNNYLMGI IFTDEHCGIS TVDITTGDYY VTEVENNRKI LDEIYKYTPS EIVCNPEFFH CGLDVEDLKN RYQIAVSTFE DWYYDSEQSV KTLKEHFKVG SLDGLGLKDY SVGVNAAGAI LKYLYNTQKN SLSHLTHITP YVTSRYMVID SSSRRNLELT ETLREKQKRG SLLWVLDKTK TAMGARMLRS FVEQPLITMD EISARYDAIS ELNDNVITRE EIREYLNYIY DLERLMGKIS YKSANPRDLI AFASSLSMLP HIKYLLSTCE SALLKQIHEE MDALDDLQNL IDRSIAEEPP IGIKEGGIIK EGFHTEVDTL RKAKTEGKVW LAELEAKEKE QTGIKNLKVK YNRVFGYYLE VTNSYANLVP ENWIRKQTLS NAERYTTPEL KELEDKILNA EDRLFSLEYD LFAEIRDQIA EEVKRIQKTA KAVANIDAFA SLAYVAERNQ FIRPELNTNG TIDIKEGRHP VVEQMIPNDM FVSNDTYLDN AEKRISIITG PNMAGKSTYM RQTALIVLMA QVGSFVPASY ANIGIVDRIF TRVGASDDLA SGQSTFMVEM TEVANILRNA TKNSLLILDE IGRGTSTFDG LSIAWAVIEH ISNTSMLGAK TLFATHYHEL TELEGKISGV NNYCIAVKEQ GEDIVFLRKI IGGGADKSYG IQVAKLAGVP NSVLVRAREI VDQLSENDIA EKARHIVSAA EISNLTPETE GEVNTNKMYT TKVNATEVIT TEVNTAKMNT TEMVSNQESV EQPRNFGQMS FFITEDTKQK KASSEFSEKL VQEINQFDLA NMTPVEALLK LDKLQKKIRS HT
|
| |