Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_1218 |
Symbol | |
ID | 5743317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 1536445 |
End bp | 1538409 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641292323 |
Product | DNA mismatch repair protein MutS domain-containing protein |
Protein accession | YP_001558335 |
Protein GI | 160879367 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.215011 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATACAA GACAGACGTT AGAATTTCAG AAAATATTAG AAATGTTATG CGAATATGCA GTATCAGAAG AAGCAAAAAA GAGTTTGCTT AAGATGGAAC CTAGTCTTAG TGAGACAGAG GTATGTAATC GAACCAAAGG GACAACAGAA GCTAGGATGA TTTATGATGT ACAAGGAAAT CCTCCGATGT CAGAGCGAAA AGATATTATG ATGATATTAT CGCTTGCCAA CAAAGGTGGA ATGTTATCAC CAGAACAACT AACTTTAGTA TCACAGTTTA TCGCTGCCAG CAGACGTTTA AAAAGTTATC TAACCAAGGC TCAATGCCTT AAGGTAGATT TAGCTTTCTA TGCGGATTCT TTCACATCAT TAGAGGATTT ACAAGGAATT ATTGATGGAG CAATCAGAAA TAATCAGATC GATAGCTCGG CATCCAAAGA GCTAAAAGAT ATTAGGCGAA AGATGGAATC CGTAAGTGGA GCAATGAAAT CAAAATTGGA GGCACTTCTA AGAAGTAAAA AAGAGTATTT TAGCGAAGGG TTTGTGTCAT TAAGAAATGG GCATTTTGTA CTTCCAGTAA AAAAAGAGTA TAAGCATCAG GTTTCAGGAA CCGTACATGA TGTTTCCTCT AGCGGTGCAA CGTACTTTAT TGAGCCGGTA ATTGCAGTTC GCTATAGTGA AGAACTATCA GCCTTAAAAT CAGCAGAAGC AAAAGAGGAA GCGGTGATTT TATATACGTT AACCTCTCTT GTGATAGAGA ATGAGTTCGA GCTAATGAGA AATTATGAAA CAATGGGAAT TCTCGACGAA ATATTCGCTA AAGCTAAACT GTCTGCATTT ATGAAGGCAG TTCCAGCAAG CCTCAATACA GATCGAAAGA TTAGGATAGT GAATGGCAGA CATCCACTTT TAAACAGAGA GAATTGCGTT CCTCTTAATT TTGAATTTGC AAATGGTATT CGAGGAGTAA TCATTACCGG GCCTAATACA GGCGGTAAAA CTGTAGCACT AAAAACAGTT GGATTATTAT CCATGATGGC TCAAAGCGGT CTTCATGTTC CATGTGATGA GGCGGTTTTA TGTATGAATG ATGCGATTCT TTGTGATATT GGAGATGGTC AAAGTATCAC AGAGAACCTT TCAACATTCT CAGCTCATAT TACGAACATC ATTGCGATAA TAAAGGAAGT TACGAAAGAT AGTTTGGTAC TTCTAGATGA GTTAGGCTCA GGAACAGACC CTGCAGAGGG GATGGGGATT GCAATTTCGA TACTGGAAGA ACTTAAAAAG AAGCAGTGTT TATTTATAGC TACCACTCAC TACCCACAAG TAAAAGACTA TGCAGCACAG TCAGAGGGAG TTGTGAATGC GAAGATGGCA TTTGATAGAG AAAGCTTAAA ACCACTCTAT CACTTAGAAG TTGGTGAGGC AGGTGAAAGT TGTGCTTTGT ACATTGCGAA AAGATTAGGA TTACCAAAGC ACATGCTTTT GATTGCTTAT CAGAATGCCT ATGATACTAA GGAAAATGGG AAAATTAAAC AAAATAATGA AAGTGAGCTT TTTTTCGAGA ATAGTCATAT AAACGAGGAA CAAGTAAACA TAGAAAATAC AGGGAATACA GAAAATACAG CGAGTAAACC CCATATAGAA AAGAAAATTG AGAGTAGGAA AAAGGAGCTT CCGAAAAAAG CAGCAAGTTT TCACCTTGGG GATTGTGTGA TTGTGTATCC AGAGAAGAAA ATAGGGATCG TGTATCAAGT GTGTAATGAA AAGGGAGAAA TAGGGATTCA AATTGCAAAA ACTAAAAAGC TTATTAATTA TAAACGTATA AAACTTCATG TCGCAGCAAC GCAGATGTAT CCGGAAGATT ATGATTTTTC AATTGTATTT GATACCGTAG CAAACCGAAA GGCCAGACAC AAGATGGAGA AAGGTCATCA GGAGGGAATG GAGATAAGAT ATTAA
|
Protein sequence | MNTRQTLEFQ KILEMLCEYA VSEEAKKSLL KMEPSLSETE VCNRTKGTTE ARMIYDVQGN PPMSERKDIM MILSLANKGG MLSPEQLTLV SQFIAASRRL KSYLTKAQCL KVDLAFYADS FTSLEDLQGI IDGAIRNNQI DSSASKELKD IRRKMESVSG AMKSKLEALL RSKKEYFSEG FVSLRNGHFV LPVKKEYKHQ VSGTVHDVSS SGATYFIEPV IAVRYSEELS ALKSAEAKEE AVILYTLTSL VIENEFELMR NYETMGILDE IFAKAKLSAF MKAVPASLNT DRKIRIVNGR HPLLNRENCV PLNFEFANGI RGVIITGPNT GGKTVALKTV GLLSMMAQSG LHVPCDEAVL CMNDAILCDI GDGQSITENL STFSAHITNI IAIIKEVTKD SLVLLDELGS GTDPAEGMGI AISILEELKK KQCLFIATTH YPQVKDYAAQ SEGVVNAKMA FDRESLKPLY HLEVGEAGES CALYIAKRLG LPKHMLLIAY QNAYDTKENG KIKQNNESEL FFENSHINEE QVNIENTGNT ENTASKPHIE KKIESRKKEL PKKAASFHLG DCVIVYPEKK IGIVYQVCNE KGEIGIQIAK TKKLINYKRI KLHVAATQMY PEDYDFSIVF DTVANRKARH KMEKGHQEGM EIRY
|
| |