Gene Cphy_0153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0153 
Symbol 
ID5744307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp197521 
End bp199911 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content38% 
IMG OID641291245 
ProductMutS2 family protein 
Protein accessionYP_001557281 
Protein GI160878313 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.323958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAA AAGCGTTACG TACATTAGAA TATCATAAAA TTATAGAAAA ACTCAGTGCT 
CTTGCTGGTT CTTCCCTTGG ACGGGAAAAA TGTCATCAGC TATTGCCACT TGTTAAGTTA
GAAGATATTG TTCAGATGCA GCAAGAGACA ACGGATGCAC TAACAAGACT CTATGCAAAG
GGGACACTCT CCTTCTCCGG AATACCAGAT ATAAGAGATA CTCTGATGAG ATTAGAGATA
GGTGCATCTT TGGGAGCTGG AGAGTTATTA AAAATCAGCT CTGTTTTAAC TGCAACCTTA
AGAGCTAAAA ATTATGGCTA TAATCAAAAA AATAACGAGG AAACAGAGGA AGCTGCACAG
GATACCTTAA CAGAACGTTT CCATTTGTTA GAACCGTTAT CTCCAATCAA TAATGAAATC
AGACGTTGTA TTATCTCTGA AGAAGAAATT GCAGATGATG CTAGCCCAGG ATTAAAGAGT
GTAAGAAGAC AGATTAAAAT AACAAATGAT AAGATTCACG AATCGCTTGG AAGTATTTTA
AATTCTGCTT CTACCAAGGG AATGCTTCAG GATGCTATTA TTACGATGAG AAACGGTAGA
TATTGTCTTC CGATTAAGCA GGAATATAAG AATACCTTCC AAGGTATGAT GCATGACCAG
TCTTCTACTG GATCAACTGC TTTCATTGAA CCAATGGCGA TTGTTAAGTT AAATAATGAA
CTTGCAGAAC TTGCGGTAAG AGAGCAAGAG GAGATTGAGA AGATTCTTGC TGAACTTAGT
AATTTGGTAG CCACGGAGAA ATATAATCTT AAGTATAATC AGACTACGCT TGCAGAACTA
GATTTTATCT TCGCACGTGC GGGGCTGTCA AAGAATATGA AAGCAAGTCA GCCACATTTT
AATAATCGTC ACTATATTAA TATCAAGAAG GGCCGTCACC CACTGATTGA CCCAAAGAAG
GTCGTTCCGA TTGATATCTA TTTTGGCGAT AAATTTGATC TATTAGTAAT CACAGGGCCA
AACACGGGTG GTAAAACCGT ATCATTAAAA ACAGTTGGCT TGTTTACTTT AATGGGACAA
GCGGGTTTAC ACATTCCGGC ATTCGATGGT TCGGAATTAT CAATTTTTGA GGAAGTATAT
GCAGATATTG GTGATGAGCA GAGTATTGAA CAAAGTTTAA GTACCTTCTC TTCTCATATG
ACCAATACCG TGTCTATTTT AGAACATGCC AACGAGAATT CCCTAGTCCT ATTTGATGAG
CTTGGTGCCG GTACAGATCC AACGGAAGGT GCTGCACTTG CGATGGCAAT TCTTAGTTAC
CTTCATCAAA GAAAGATTCG TACCATGGCA ACTACACATT ATAGCGAGTT AAAAATATTC
GCTCTCTCCA CAGATGGTGT CTCCAATGCA TGTTGCGAAT TTAGCGTGGA AACCCTTCAA
CCTACGTATC GTTTATTAAT CGGTATTCCT GGTAAGAGTA ATGCATTTGC AATATCTTCA
AAGCTTGGTT TATCGAATTA TATCATTGAA AAGGCAAGAG AATTTATCGG TACGAAGGAC
GAGAGTTTTG AAGACGTCAT TAGTAATCTA GAGGCTAGCC GTATTGCGAT GGAGAAAGAC
AAAGCAGAGG CGGAGCAATA CAAAAAAGAA GTAGAAGAAT TAAAACGAAA GTTAGCAGAG
AAGAATTCAA AGATTGATGA TGCAAAGGAT CGAATTCTTA GAGAAGCAAA TGAAAAAGCT
AGGACTATTC TTCAGGAAGC CAAGGACTAT GCGGATGAAA CTATTCGTAA GTATAATAAA
TGGGGTGCGG GTGGCGCCAA CAATAAAGAG ATGGAGAATG AACGTGCTGC TTTACGTGAA
AAGCTTGGAG ATACGGATTC TAGTCTAGTT TCTAAAGCGA AGAAGAACCG CAAGCAGCAT
AAACCTTCTG ACTTTAAAGT TGGTGACTCT GTCCATGTTA TTAGTTTAAA TTTAAAGGGT
TCCGTAAGTA CTCTTCCAAA TGCGAAGGGA GACTTATATG TACAAATGGG AATATTACGA
TCCCTTGTAA ATATTTCAGA CCTAGAGTTA ATCGATGAAG AAACTATTGT TGCGAAGGCT
TTGACAAAGA CCCAAAGCGG AAAGATTCGT ATGAGTAAAT CTATGTCAAT AAGTCCAGAG
TTAAATATTA TTGGAAAACG TGTGGATGAA GCACTTCCAC TCGTAGATAA ATACTTAGAT
GATGCTTATC TTGCTCACCT GCCACAAGTT ACGATTATTC ATGGTAGAGG TACAGGTGCG
TTAAAAGAAG CTGTTCATGC ACATCTAAAA AGAACCAATT ATGTAAAGGG CTATCGTGTA
GGTGGATTTG GCGAAGGTGA CCATGGGGTT ACGATTGTTG AATTTAAGTA A
 
Protein sequence
MNEKALRTLE YHKIIEKLSA LAGSSLGREK CHQLLPLVKL EDIVQMQQET TDALTRLYAK 
GTLSFSGIPD IRDTLMRLEI GASLGAGELL KISSVLTATL RAKNYGYNQK NNEETEEAAQ
DTLTERFHLL EPLSPINNEI RRCIISEEEI ADDASPGLKS VRRQIKITND KIHESLGSIL
NSASTKGMLQ DAIITMRNGR YCLPIKQEYK NTFQGMMHDQ SSTGSTAFIE PMAIVKLNNE
LAELAVREQE EIEKILAELS NLVATEKYNL KYNQTTLAEL DFIFARAGLS KNMKASQPHF
NNRHYINIKK GRHPLIDPKK VVPIDIYFGD KFDLLVITGP NTGGKTVSLK TVGLFTLMGQ
AGLHIPAFDG SELSIFEEVY ADIGDEQSIE QSLSTFSSHM TNTVSILEHA NENSLVLFDE
LGAGTDPTEG AALAMAILSY LHQRKIRTMA TTHYSELKIF ALSTDGVSNA CCEFSVETLQ
PTYRLLIGIP GKSNAFAISS KLGLSNYIIE KAREFIGTKD ESFEDVISNL EASRIAMEKD
KAEAEQYKKE VEELKRKLAE KNSKIDDAKD RILREANEKA RTILQEAKDY ADETIRKYNK
WGAGGANNKE MENERAALRE KLGDTDSSLV SKAKKNRKQH KPSDFKVGDS VHVISLNLKG
SVSTLPNAKG DLYVQMGILR SLVNISDLEL IDEETIVAKA LTKTQSGKIR MSKSMSISPE
LNIIGKRVDE ALPLVDKYLD DAYLAHLPQV TIIHGRGTGA LKEAVHAHLK RTNYVKGYRV
GGFGEGDHGV TIVEFK