Gene CHU_2236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_2236 
SymbolmutS 
ID4183770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp2604405 
End bp2606177 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content39% 
IMG OID638072237 
ProductDNA mismatch repair protein 
Protein accessionYP_678841 
Protein GI110638632 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCAT ATATTTCATT AAAAAATACC TACAGCCATC AGTTAATCAG GCTTAAGCAA 
ACACACACCT GGATCAGTAT AGTGCGGATT GCACTTGTTT TTATTGCCCT GATTTGCTTT
TATTTTTTTA TAACCAGTTT TGAATCTGGA GCCTTTATTA TTGGTCTTAT TTCAATCATC
GCTTTTGTTT TTGTTCTGGT TTGGCATAGG AAAAAATCAG CTGAGATCCT GTTTAAAGAA
ACGCTTGTAA CCATTATTTC TCAGGAGATT GCATATCTGG AAAATAAAGA ATTGCCTTTT
GAAAACGGAG CTGACTATAA TGAAACGAAT CATCCCTATA CATATGATCT CGATATTTTC
GGGTACAGAA GTTTGTTTCA GCATCTGAAC AGAACAGCAA CATACCTTGG TAAGACAAGG
CTGGCAAATT CGTTAAAGCA TATATTACCC AATGAAGCTA TTGAAAAGAA TCAGCAAGCC
ATAAAAGAGC TGTCAGAAAA ACTAACCTGG CGGCAGGAGA TACTGGCACG GGCTAAAATG
GCAAACGATA CGAAAGAAAT TTATACTAGC ATTCTTACCT GGTCTACGAA AAAAGCAGGA
GAAGTGCCCG TATACATGCG TATTATTTCT TTTGTGTTTC CAGTGGCATT GTTTATACTA
TTTGGGATGG CAGCAATTTC TGATTCAGGC ATATATATGA AGGCCGCTGA AATTTTATTT
GTAGTGAATC TGATCATTAT CGGGCTACAT CTTAAAACCA TAAAGGCAGA ATTGTTTCAT
GCCGATAAAA TTGAAGTAAT CATTCAGCAG TACAGTCTTA TTCTTGAAAA GATAGAAGGA
GAAACATTTT CTGCTGCACG TTTAATAGAG CTGAAGGATC AATTGCTGCA TGCGGATATT
TCTGCAAGTG CCCATTTAAA TACACTCTCA AAATTATTTG CCCATGTAGA AACAATTAAT
AATGCGGTGG GTTCGCTGTT TATGAATGGA CTGTTTATGT ATCACATACA CAGCTTACAG
GCACTATTGA AATGGAAAGA AAGGCATGCT TTTAGAATTG CCGAATGGAT CTCGGTAATT
GGTGAGATAG AAATGCTGAA CAGCTATGCA AACCTGTCCT ATAATAATCC GGATTTTATT
TTCCCGGCTT TGCGTACAGA TTACAGCATT CAACTGACAG GTGCGGGCCA TCCGCTGATT
GATAAAAAGA AACGTATCTG TAACGATGTC GTTTTTAATA CAGGGAATTT TATTATTCTT
ACCGGTTCAA ATATGTCCGG GAAAAGTACG TTTCTGCGTA CGCTTGGCGT AAATATGGTA
TTAGCGGGAG CAGGCGCTCC CGTGTGCGCT TCTGCTGCAC AGATTCATCC GCTGCCTGTC
ATTGTATCCA TGCGTTTATC CGATTCATTG TCTGATAGCG AATCGTATTT CTTTGCGGAA
GTAAAACGGC TGAAACAATT GATGCAGATG CTGGATGAGC AGATGTGCTT TGTCTTGCTG
GATGAAATTT TGCGTGGCAC CAATTCGGAT GATAAGCGTA TTGGTACTAT TGAAGTGATA
AAAAAGATTG TGGCAAAGAA CGCAATCGGT ATTGTTGCCA CGCACGATCT GGAAGTGTGT
AACACAACAC AGGAATATCC GGAAAAACTC TCTAACAAAT GCTTTGAGGT ACAAATAATC
AACGATGAAC TGGTCTTTGA TTACAAGCTT CGTGAAGGCA TCTGTAAAAA TAAGAGCGCC
ACTTTTTTGA TGAAAAAGAT GGGTGTAATA TAA
 
Protein sequence
MQAYISLKNT YSHQLIRLKQ THTWISIVRI ALVFIALICF YFFITSFESG AFIIGLISII 
AFVFVLVWHR KKSAEILFKE TLVTIISQEI AYLENKELPF ENGADYNETN HPYTYDLDIF
GYRSLFQHLN RTATYLGKTR LANSLKHILP NEAIEKNQQA IKELSEKLTW RQEILARAKM
ANDTKEIYTS ILTWSTKKAG EVPVYMRIIS FVFPVALFIL FGMAAISDSG IYMKAAEILF
VVNLIIIGLH LKTIKAELFH ADKIEVIIQQ YSLILEKIEG ETFSAARLIE LKDQLLHADI
SASAHLNTLS KLFAHVETIN NAVGSLFMNG LFMYHIHSLQ ALLKWKERHA FRIAEWISVI
GEIEMLNSYA NLSYNNPDFI FPALRTDYSI QLTGAGHPLI DKKKRICNDV VFNTGNFIIL
TGSNMSGKST FLRTLGVNMV LAGAGAPVCA SAAQIHPLPV IVSMRLSDSL SDSESYFFAE
VKRLKQLMQM LDEQMCFVLL DEILRGTNSD DKRIGTIEVI KKIVAKNAIG IVATHDLEVC
NTTQEYPEKL SNKCFEVQII NDELVFDYKL REGICKNKSA TFLMKKMGVI