Gene Cphy_2381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2381 
Symbol 
ID5742451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2933768 
End bp2936566 
Gene Length2799 bp 
Protein Length932 aa 
Translation table11 
GC content38% 
IMG OID641293471 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001559481 
Protein GI160880513 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACAAC TAACTCCAAT GATGCAGCAA TATGTGGAGA CAAAAGAACA ATATAAGGAT 
TGTATTCTTT TTTATCGTTT GGGTGACTTC TATGAAATGT TCTTTGAAGA TGCCTTAGTA
GCCTCTAAGG AATTAGAGAT AACCTTAACC GGGAAGAATT GCGGGCAAGA GGAAAGAGCT
CCTATGTGTG GGATACCTTA CCATGCCGCA GAAGGGTATA TTTCTAAGCT AATTGGGAAG
GGATATAAAG TTGCGATCTG TGAACAAGTA GAAGATCCTA AGTTAGCGAA AGGAATTGTA
AAACGAGAGG TTATCCGTAT CGTTACGCCT GGAACGAATC TAAATACCCA GACATTAGAT
GAAACGAGAA ATAATTATCT TATGGGAATT ATCTTTACCG ACGAACATTG CGGTATATCA
ACGGTTGATA TTACAACAGG TGATTACTAC GTAACTGAGG TCGAGAACAA CCGTAAGATT
TTAGATGAAA TATATAAATA TACACCTTCG GAAATTGTTT GTAATCCAGA ATTTTTTCAC
TGTGGGCTAG ATGTTGAAGA TTTAAAAAAT AGATATCAGA TAGCAGTATC CACCTTTGAG
GACTGGTATT ATGACAGCGA ACAAAGTGTT AAGACATTAA AGGAACATTT TAAAGTAGGC
TCTTTAGACG GTCTAGGATT AAAAGATTAT TCTGTCGGGG TGAATGCAGC TGGTGCTATC
TTAAAGTACC TTTATAACAC TCAGAAGAAT TCACTTAGTC ATTTGACACA TATAACGCCA
TACGTTACAA GTCGCTATAT GGTGATAGAC AGTTCCAGTA GAAGAAATCT AGAATTGACG
GAGACACTTC GTGAAAAGCA AAAACGAGGG TCTCTTCTTT GGGTATTAGA TAAAACAAAA
ACAGCCATGG GAGCTAGAAT GCTCCGTAGT TTTGTAGAAC AGCCACTGAT CACAATGGAT
GAGATTTCAG CTCGTTATGA TGCGATTTCA GAACTGAACG ACAATGTGAT AACGCGGGAA
GAAATACGAG AATACTTAAA TTACATTTAT GATTTGGAAC GCTTGATGGG AAAAATCAGC
TATAAGAGTG CAAATCCAAG AGATTTAATT GCCTTTGCTT CTTCACTATC TATGCTTCCA
CATATCAAAT ACTTGTTATC AACCTGCGAA TCCGCATTGT TAAAACAAAT TCATGAGGAG
ATGGATGCTC TTGATGACTT ACAAAACTTA ATTGATCGCT CTATAGCAGA AGAACCACCG
ATTGGAATCA AAGAGGGTGG CATCATAAAA GAAGGTTTCC ATACAGAAGT TGATACCCTT
CGAAAAGCGA AAACAGAAGG GAAAGTATGG CTTGCAGAAC TGGAAGCGAA AGAAAAAGAG
CAGACAGGAA TTAAGAATCT AAAGGTAAAA TACAATCGTG TCTTTGGATA TTACCTAGAA
GTGACGAATT CTTATGCAAA TCTGGTACCG GAAAACTGGA TAAGAAAGCA AACGTTATCA
AATGCCGAAC GTTATACAAC ACCAGAACTT AAGGAATTAG AAGATAAGAT ATTAAATGCA
GAGGATCGTT TATTCTCTCT TGAGTATGAT TTATTTGCCG AAATTAGAGA TCAAATCGCT
GAAGAAGTAA AACGAATTCA AAAAACTGCA AAAGCGGTAG CGAACATTGA TGCGTTTGCT
TCACTTGCCT ATGTTGCAGA AAGAAATCAA TTTATCCGTC CTGAGTTAAA TACCAACGGA
ACGATTGACA TAAAAGAGGG AAGACATCCA GTTGTAGAAC AAATGATACC AAACGATATG
TTTGTGTCAA ATGATACGTA TCTTGATAAT GCTGAGAAAA GAATCTCCAT TATCACAGGT
CCTAACATGG CTGGTAAATC TACCTATATG AGACAAACAG CGTTAATTGT ATTAATGGCT
CAAGTAGGAA GCTTTGTTCC TGCATCTTAT GCAAACATTG GTATTGTTGA TCGTATTTTT
ACCAGGGTAG GTGCGTCTGA TGATTTAGCA AGCGGTCAGA GTACCTTTAT GGTGGAGATG
ACGGAGGTGG CGAATATCCT TCGAAATGCT ACGAAAAACA GTTTATTAAT CTTAGATGAA
ATTGGCCGTG GTACGAGTAC GTTTGACGGA CTAAGTATTG CATGGGCAGT TATTGAACAT
ATCAGTAATA CATCAATGCT TGGTGCAAAG ACATTATTTG CGACGCATTA CCATGAGTTA
ACAGAATTAG AAGGCAAGAT ATCCGGTGTT AATAATTACT GCATTGCGGT GAAAGAACAA
GGAGAAGATA TTGTCTTTCT TCGAAAGATT ATAGGAGGCG GAGCGGATAA GAGTTATGGC
ATTCAAGTTG CAAAACTTGC CGGTGTTCCA AACTCGGTAT TAGTAAGAGC AAGAGAAATT
GTGGATCAGC TAAGTGAGAA TGACATTGCA GAAAAAGCAA GACATATTGT GTCTGCTGCG
GAAATTTCCA ATCTTACACC AGAAACCGAA GGCGAAGTGA ATACCAATAA AATGTATACC
ACTAAAGTGA ATGCAACTGA AGTGATTACA ACTGAAGTGA ATACAGCTAA AATGAATACC
ACTGAAATGG TAAGTAATCA GGAGTCTGTA GAACAGCCAA GAAACTTTGG CCAGATGTCA
TTTTTCATAA CAGAAGATAC AAAACAGAAA AAAGCGTCCT CAGAATTTTC TGAAAAGTTA
GTGCAGGAAA TAAATCAGTT TGACCTTGCC AATATGACTC CGGTGGAAGC ATTGTTAAAA
TTGGATAAAT TACAGAAAAA AATACGTTCT CACACTTAA
 
Protein sequence
MAQLTPMMQQ YVETKEQYKD CILFYRLGDF YEMFFEDALV ASKELEITLT GKNCGQEERA 
PMCGIPYHAA EGYISKLIGK GYKVAICEQV EDPKLAKGIV KREVIRIVTP GTNLNTQTLD
ETRNNYLMGI IFTDEHCGIS TVDITTGDYY VTEVENNRKI LDEIYKYTPS EIVCNPEFFH
CGLDVEDLKN RYQIAVSTFE DWYYDSEQSV KTLKEHFKVG SLDGLGLKDY SVGVNAAGAI
LKYLYNTQKN SLSHLTHITP YVTSRYMVID SSSRRNLELT ETLREKQKRG SLLWVLDKTK
TAMGARMLRS FVEQPLITMD EISARYDAIS ELNDNVITRE EIREYLNYIY DLERLMGKIS
YKSANPRDLI AFASSLSMLP HIKYLLSTCE SALLKQIHEE MDALDDLQNL IDRSIAEEPP
IGIKEGGIIK EGFHTEVDTL RKAKTEGKVW LAELEAKEKE QTGIKNLKVK YNRVFGYYLE
VTNSYANLVP ENWIRKQTLS NAERYTTPEL KELEDKILNA EDRLFSLEYD LFAEIRDQIA
EEVKRIQKTA KAVANIDAFA SLAYVAERNQ FIRPELNTNG TIDIKEGRHP VVEQMIPNDM
FVSNDTYLDN AEKRISIITG PNMAGKSTYM RQTALIVLMA QVGSFVPASY ANIGIVDRIF
TRVGASDDLA SGQSTFMVEM TEVANILRNA TKNSLLILDE IGRGTSTFDG LSIAWAVIEH
ISNTSMLGAK TLFATHYHEL TELEGKISGV NNYCIAVKEQ GEDIVFLRKI IGGGADKSYG
IQVAKLAGVP NSVLVRAREI VDQLSENDIA EKARHIVSAA EISNLTPETE GEVNTNKMYT
TKVNATEVIT TEVNTAKMNT TEMVSNQESV EQPRNFGQMS FFITEDTKQK KASSEFSEKL
VQEINQFDLA NMTPVEALLK LDKLQKKIRS HT