Gene Cphy_1218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1218 
Symbol 
ID5743317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp1536445 
End bp1538409 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content37% 
IMG OID641292323 
ProductDNA mismatch repair protein MutS domain-containing protein 
Protein accessionYP_001558335 
Protein GI160879367 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.215011 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAA GACAGACGTT AGAATTTCAG AAAATATTAG AAATGTTATG CGAATATGCA 
GTATCAGAAG AAGCAAAAAA GAGTTTGCTT AAGATGGAAC CTAGTCTTAG TGAGACAGAG
GTATGTAATC GAACCAAAGG GACAACAGAA GCTAGGATGA TTTATGATGT ACAAGGAAAT
CCTCCGATGT CAGAGCGAAA AGATATTATG ATGATATTAT CGCTTGCCAA CAAAGGTGGA
ATGTTATCAC CAGAACAACT AACTTTAGTA TCACAGTTTA TCGCTGCCAG CAGACGTTTA
AAAAGTTATC TAACCAAGGC TCAATGCCTT AAGGTAGATT TAGCTTTCTA TGCGGATTCT
TTCACATCAT TAGAGGATTT ACAAGGAATT ATTGATGGAG CAATCAGAAA TAATCAGATC
GATAGCTCGG CATCCAAAGA GCTAAAAGAT ATTAGGCGAA AGATGGAATC CGTAAGTGGA
GCAATGAAAT CAAAATTGGA GGCACTTCTA AGAAGTAAAA AAGAGTATTT TAGCGAAGGG
TTTGTGTCAT TAAGAAATGG GCATTTTGTA CTTCCAGTAA AAAAAGAGTA TAAGCATCAG
GTTTCAGGAA CCGTACATGA TGTTTCCTCT AGCGGTGCAA CGTACTTTAT TGAGCCGGTA
ATTGCAGTTC GCTATAGTGA AGAACTATCA GCCTTAAAAT CAGCAGAAGC AAAAGAGGAA
GCGGTGATTT TATATACGTT AACCTCTCTT GTGATAGAGA ATGAGTTCGA GCTAATGAGA
AATTATGAAA CAATGGGAAT TCTCGACGAA ATATTCGCTA AAGCTAAACT GTCTGCATTT
ATGAAGGCAG TTCCAGCAAG CCTCAATACA GATCGAAAGA TTAGGATAGT GAATGGCAGA
CATCCACTTT TAAACAGAGA GAATTGCGTT CCTCTTAATT TTGAATTTGC AAATGGTATT
CGAGGAGTAA TCATTACCGG GCCTAATACA GGCGGTAAAA CTGTAGCACT AAAAACAGTT
GGATTATTAT CCATGATGGC TCAAAGCGGT CTTCATGTTC CATGTGATGA GGCGGTTTTA
TGTATGAATG ATGCGATTCT TTGTGATATT GGAGATGGTC AAAGTATCAC AGAGAACCTT
TCAACATTCT CAGCTCATAT TACGAACATC ATTGCGATAA TAAAGGAAGT TACGAAAGAT
AGTTTGGTAC TTCTAGATGA GTTAGGCTCA GGAACAGACC CTGCAGAGGG GATGGGGATT
GCAATTTCGA TACTGGAAGA ACTTAAAAAG AAGCAGTGTT TATTTATAGC TACCACTCAC
TACCCACAAG TAAAAGACTA TGCAGCACAG TCAGAGGGAG TTGTGAATGC GAAGATGGCA
TTTGATAGAG AAAGCTTAAA ACCACTCTAT CACTTAGAAG TTGGTGAGGC AGGTGAAAGT
TGTGCTTTGT ACATTGCGAA AAGATTAGGA TTACCAAAGC ACATGCTTTT GATTGCTTAT
CAGAATGCCT ATGATACTAA GGAAAATGGG AAAATTAAAC AAAATAATGA AAGTGAGCTT
TTTTTCGAGA ATAGTCATAT AAACGAGGAA CAAGTAAACA TAGAAAATAC AGGGAATACA
GAAAATACAG CGAGTAAACC CCATATAGAA AAGAAAATTG AGAGTAGGAA AAAGGAGCTT
CCGAAAAAAG CAGCAAGTTT TCACCTTGGG GATTGTGTGA TTGTGTATCC AGAGAAGAAA
ATAGGGATCG TGTATCAAGT GTGTAATGAA AAGGGAGAAA TAGGGATTCA AATTGCAAAA
ACTAAAAAGC TTATTAATTA TAAACGTATA AAACTTCATG TCGCAGCAAC GCAGATGTAT
CCGGAAGATT ATGATTTTTC AATTGTATTT GATACCGTAG CAAACCGAAA GGCCAGACAC
AAGATGGAGA AAGGTCATCA GGAGGGAATG GAGATAAGAT ATTAA
 
Protein sequence
MNTRQTLEFQ KILEMLCEYA VSEEAKKSLL KMEPSLSETE VCNRTKGTTE ARMIYDVQGN 
PPMSERKDIM MILSLANKGG MLSPEQLTLV SQFIAASRRL KSYLTKAQCL KVDLAFYADS
FTSLEDLQGI IDGAIRNNQI DSSASKELKD IRRKMESVSG AMKSKLEALL RSKKEYFSEG
FVSLRNGHFV LPVKKEYKHQ VSGTVHDVSS SGATYFIEPV IAVRYSEELS ALKSAEAKEE
AVILYTLTSL VIENEFELMR NYETMGILDE IFAKAKLSAF MKAVPASLNT DRKIRIVNGR
HPLLNRENCV PLNFEFANGI RGVIITGPNT GGKTVALKTV GLLSMMAQSG LHVPCDEAVL
CMNDAILCDI GDGQSITENL STFSAHITNI IAIIKEVTKD SLVLLDELGS GTDPAEGMGI
AISILEELKK KQCLFIATTH YPQVKDYAAQ SEGVVNAKMA FDRESLKPLY HLEVGEAGES
CALYIAKRLG LPKHMLLIAY QNAYDTKENG KIKQNNESEL FFENSHINEE QVNIENTGNT
ENTASKPHIE KKIESRKKEL PKKAASFHLG DCVIVYPEKK IGIVYQVCNE KGEIGIQIAK
TKKLINYKRI KLHVAATQMY PEDYDFSIVF DTVANRKARH KMEKGHQEGM EIRY