Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_2236 |
Symbol | mutS |
ID | 4183770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | - |
Start bp | 2604405 |
End bp | 2606177 |
Gene Length | 1773 bp |
Protein Length | 590 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638072237 |
Product | DNA mismatch repair protein |
Protein accession | YP_678841 |
Protein GI | 110638632 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGCAT ATATTTCATT AAAAAATACC TACAGCCATC AGTTAATCAG GCTTAAGCAA ACACACACCT GGATCAGTAT AGTGCGGATT GCACTTGTTT TTATTGCCCT GATTTGCTTT TATTTTTTTA TAACCAGTTT TGAATCTGGA GCCTTTATTA TTGGTCTTAT TTCAATCATC GCTTTTGTTT TTGTTCTGGT TTGGCATAGG AAAAAATCAG CTGAGATCCT GTTTAAAGAA ACGCTTGTAA CCATTATTTC TCAGGAGATT GCATATCTGG AAAATAAAGA ATTGCCTTTT GAAAACGGAG CTGACTATAA TGAAACGAAT CATCCCTATA CATATGATCT CGATATTTTC GGGTACAGAA GTTTGTTTCA GCATCTGAAC AGAACAGCAA CATACCTTGG TAAGACAAGG CTGGCAAATT CGTTAAAGCA TATATTACCC AATGAAGCTA TTGAAAAGAA TCAGCAAGCC ATAAAAGAGC TGTCAGAAAA ACTAACCTGG CGGCAGGAGA TACTGGCACG GGCTAAAATG GCAAACGATA CGAAAGAAAT TTATACTAGC ATTCTTACCT GGTCTACGAA AAAAGCAGGA GAAGTGCCCG TATACATGCG TATTATTTCT TTTGTGTTTC CAGTGGCATT GTTTATACTA TTTGGGATGG CAGCAATTTC TGATTCAGGC ATATATATGA AGGCCGCTGA AATTTTATTT GTAGTGAATC TGATCATTAT CGGGCTACAT CTTAAAACCA TAAAGGCAGA ATTGTTTCAT GCCGATAAAA TTGAAGTAAT CATTCAGCAG TACAGTCTTA TTCTTGAAAA GATAGAAGGA GAAACATTTT CTGCTGCACG TTTAATAGAG CTGAAGGATC AATTGCTGCA TGCGGATATT TCTGCAAGTG CCCATTTAAA TACACTCTCA AAATTATTTG CCCATGTAGA AACAATTAAT AATGCGGTGG GTTCGCTGTT TATGAATGGA CTGTTTATGT ATCACATACA CAGCTTACAG GCACTATTGA AATGGAAAGA AAGGCATGCT TTTAGAATTG CCGAATGGAT CTCGGTAATT GGTGAGATAG AAATGCTGAA CAGCTATGCA AACCTGTCCT ATAATAATCC GGATTTTATT TTCCCGGCTT TGCGTACAGA TTACAGCATT CAACTGACAG GTGCGGGCCA TCCGCTGATT GATAAAAAGA AACGTATCTG TAACGATGTC GTTTTTAATA CAGGGAATTT TATTATTCTT ACCGGTTCAA ATATGTCCGG GAAAAGTACG TTTCTGCGTA CGCTTGGCGT AAATATGGTA TTAGCGGGAG CAGGCGCTCC CGTGTGCGCT TCTGCTGCAC AGATTCATCC GCTGCCTGTC ATTGTATCCA TGCGTTTATC CGATTCATTG TCTGATAGCG AATCGTATTT CTTTGCGGAA GTAAAACGGC TGAAACAATT GATGCAGATG CTGGATGAGC AGATGTGCTT TGTCTTGCTG GATGAAATTT TGCGTGGCAC CAATTCGGAT GATAAGCGTA TTGGTACTAT TGAAGTGATA AAAAAGATTG TGGCAAAGAA CGCAATCGGT ATTGTTGCCA CGCACGATCT GGAAGTGTGT AACACAACAC AGGAATATCC GGAAAAACTC TCTAACAAAT GCTTTGAGGT ACAAATAATC AACGATGAAC TGGTCTTTGA TTACAAGCTT CGTGAAGGCA TCTGTAAAAA TAAGAGCGCC ACTTTTTTGA TGAAAAAGAT GGGTGTAATA TAA
|
Protein sequence | MQAYISLKNT YSHQLIRLKQ THTWISIVRI ALVFIALICF YFFITSFESG AFIIGLISII AFVFVLVWHR KKSAEILFKE TLVTIISQEI AYLENKELPF ENGADYNETN HPYTYDLDIF GYRSLFQHLN RTATYLGKTR LANSLKHILP NEAIEKNQQA IKELSEKLTW RQEILARAKM ANDTKEIYTS ILTWSTKKAG EVPVYMRIIS FVFPVALFIL FGMAAISDSG IYMKAAEILF VVNLIIIGLH LKTIKAELFH ADKIEVIIQQ YSLILEKIEG ETFSAARLIE LKDQLLHADI SASAHLNTLS KLFAHVETIN NAVGSLFMNG LFMYHIHSLQ ALLKWKERHA FRIAEWISVI GEIEMLNSYA NLSYNNPDFI FPALRTDYSI QLTGAGHPLI DKKKRICNDV VFNTGNFIIL TGSNMSGKST FLRTLGVNMV LAGAGAPVCA SAAQIHPLPV IVSMRLSDSL SDSESYFFAE VKRLKQLMQM LDEQMCFVLL DEILRGTNSD DKRIGTIEVI KKIVAKNAIG IVATHDLEVC NTTQEYPEKL SNKCFEVQII NDELVFDYKL REGICKNKSA TFLMKKMGVI
|
| |