Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_3054 |
Symbol | |
ID | 8359219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | + |
Start bp | 3770578 |
End bp | 3771891 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644965232 |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_003122728 |
Protein GI | 256422075 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.613085 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTTA TTGCAGACAA ACAAATGCTA GACGATTTGT CCTTGTTAGG TAAATTTAAT CCTGGTTCTG TTTTCAGTCT TTTTAACCAG GTAAAAACCA GGGGCGCCGA GAAATTACTG GATGCCATGT TCCTGCATCC CTTATCGGAT GTGGAAGCGA TCAACAACAG AAGTGCTGTT TTCCGCTATT TCAATCAACA TCCTGTCAGC TTCCCTTTTG ATGAAAAGCA GCTGGAACGT ATGGAATCCT ATATGGATGA AGGCGGTGAC GGGAGTTATA TCATGGCATT ATGGGAATTA AGCCGGAAGA AAGTAGCATC AGTACTGGTA AAAGATGATA CCTATGACCT GACCTTAACA GGTATAGAAA GCAGCATCTC CGTATTAAAA GCCTGTGAAA AGCTACTCAA ACAGCTGGAG CAGGAAGGAC GCGATAACGA TCAGCCATGG GTAAAATGGT CAGAGATCGT ACGCACGATC ACTACAGATG ACCGGTTGAA AGATTTCTCT AAGCCTGCCA GGTCTTTAAT GGATAATGTA CGGCTGCATC ATATGCTGAC AGGCGTATTT CGTAGTCAGC TGAAAACGCT CCTTGAACTG ATCTATGAAA CAGACCTGTA CCTGGCAGTA GCAGGCGTAG CGAAGGCAAA AGGCTTCTCC TATGCACAGG CATTACCGAA GGATCGGAAT ATACTGGAAG CTAAAGGACT GAGACATCCG GGGCTTGATA AGGGCGTCGC TAACTCATTG TCCTTTAATG CCGGTACGAA TGTCCTGTTT CTTACCGGTG CAAACATGGC GGGTAAGTCA ACATTGATGA AGTCGACAGG TATTCTCATT TATCTTGCGC ATATGGGATT CCCGGTAGCA GCCACTGAAG TGAAATTTTC TATATTGGAT GGTATTTATT CTTCCGTGAA TGTGCCCGAT GACCTGAATA AAGGATATAG TCACTTCTAT GCGGAAGTAC TGCGCGTGAA AAAGGTAGCA GAAGAAGTAG CTACAGATAA ATCTTTGTTT GTCATCTTTG ATGAGCTGTT CAAAGGTACA AACGTAAAAG ATGCTTATGA CGCTACCCTG GCGGTAACTG AAGCGTTCAC TGATTTTACA AATTGTTTCT TTATCATTTC CACGCATATA TTTGAAGTCG GACATGCATT GAATAATGGT GGATCACAGA TCGCGTTTGA ATTCCTTCCA ACCATCATGA ACAATAACGT GCCGCAGTAT ACATATCAGC TGCAGAAAGG TATTACTACT GACAGACAGG GCATGATCAT TATTGAGAAT GAAGGAATAC TGGATATGTT ATAG
|
Protein sequence | MSFIADKQML DDLSLLGKFN PGSVFSLFNQ VKTRGAEKLL DAMFLHPLSD VEAINNRSAV FRYFNQHPVS FPFDEKQLER MESYMDEGGD GSYIMALWEL SRKKVASVLV KDDTYDLTLT GIESSISVLK ACEKLLKQLE QEGRDNDQPW VKWSEIVRTI TTDDRLKDFS KPARSLMDNV RLHHMLTGVF RSQLKTLLEL IYETDLYLAV AGVAKAKGFS YAQALPKDRN ILEAKGLRHP GLDKGVANSL SFNAGTNVLF LTGANMAGKS TLMKSTGILI YLAHMGFPVA ATEVKFSILD GIYSSVNVPD DLNKGYSHFY AEVLRVKKVA EEVATDKSLF VIFDELFKGT NVKDAYDATL AVTEAFTDFT NCFFIISTHI FEVGHALNNG GSQIAFEFLP TIMNNNVPQY TYQLQKGITT DRQGMIIIEN EGILDML
|
| |