Gene CPR_1847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1847 
Symbol 
ID4205204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2041721 
End bp2044081 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content32% 
IMG OID642566397 
Productrecombination and DNA strand exchange inhibitor protein 
Protein accessionYP_699161 
Protein GI110803296 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGATA GAGTTTTAAG AGTTTTAGAA TTTAATAAAA TTAAGGAATT GGTTAAGGGA 
TATGCCATTA CAAAATCAGC TAAGGAGATG GTTTTAGATC TTAAACCATA TGATTCTGTT
TATGATGTAA AAGAACATTT AGAAGAAACA AAAGAAGCTT TAGATATTTT AATGAGAAAA
GGAAATCCTC CTTTTGAAGG GCTTTATGAT GTTAAAGAAG CTATAACAAG AGCTGAAAAA
GGTGGAGTTT TAAGCATAGA AGGCTTACTT AGAATAGGAA ACATGTTATC TGTAACAAGA
AAACTTTCTG ACTTTTTAGC TAGAAAAGAA GAGGAAGAAG AACATAGAAT CCTAGAAGGA
ATGAGAGAAG GGCTTATAGT ACTTAGAGGT GTAGAAAGTG CTATATCAAA GGCCATAGTA
AGTGAAGATG AAATTGCAGA TTCTGCGAGT GATAAACTTT ATAGTATAAG AAGAAGCTTA
AAAGAAAAGA ATTCATCTAT AAGAGATAAA GTTAATTCCA TAGTAAGAAG TAATGCTCAA
TATCTTCAAG ATTCCTTATA CACTGTTAGA GGAGATAGAT ATGTTATTCC TGTAAAAGCT
GAATATAAAT CTCAAGTTCC AGGACTTGTT CATGATCAAA GTTCAACAGG GGCTACACTT
TTTATAGAAC CAACGGCTTT AGTAAACTTA AATAACGAAA TAAAAGAGCT TATGCTAAAG
GAAAGAGCTG AGATTGAGAG AATATTAGCA GAATTATCAG CCTTAGTATA TAAAAATATT
GATGTTATAA AAGTTAACTT TAATATAATA GTTGAATTAG ATTTTATATT TGCAAAGGCT
AAATACGGTA GTGATTTAGG TGGAACAATG CCTATAGTAA ATGAAGAAGG CGTAATAGAT
TTAATGGATG CTAGGCATCC ACTAATTCCT AAAGATAAAG TTGTTTCTTC TGATATTTAT
TTAGGAAGAG AATTTTCAAC ATTACTTATA ACAGGGCCAA ATACTGGTGG TAAAACTGTA
ACCTTAAAGA CTACAGGGCT TATTGAACTT ATGGGCTTAA GTGGACTTTT AATACCAGCA
AGTGAAAATT CAAGCATAAG CTTCTTTGAA GAGATATTTG CAGATATAGG AGATGAGCAA
AGTATAGAGC AAAGCTTATC AACTTTTTCT TCTCATATGA CTAATATAGT TAAAATAATG
GAGAAAGCAA ATAATAAAAG TTTTGTACTT TTTGACGAAC TTGGAGCTGG AACAGACCCT
ACAGAGGGAG CTGCACTTGC CATTTCAATA TTAGAAAACT TAAGAGCAAG AGGATGTAGA
ATAATGTCTA CAACTCACTA TAGTGAATTA AAGGGATATG CATTAAAAAC TGAAAATGTT
GAGAATGCTT CTGTTGAGTT TAATGTTGAA ACCTTAAGAC CAACTTATAG ACTTTTAATA
GGAGTTCCAG GGAAATCAAA TGCTTTTGAA ATTTCAAGAA GATTAGGTCT TAAAGATAAT
ATCATAGAAG AAGCTAAAAA GGTTATTTCT ACTGAATCAC TTCAATTTGA AGATTTAATA
CAATCACTTC AAGAAAAGAG TATAAAAGCA GAAAATGATG CTAGAGAAGC TGCTATATTA
AGAAATGATG CAGAAAAATA TAAGAATAGA TATAAAGAGA AATTTGAAAG AATTGAAAGC
GTAAGAGATA ATGTTTATGC AGATGCTAGA AGAGAAGCAA AGCAAATTTT AGATTCAGCT
AAGGAAGAGG CTGATACTAT TCTTAAAAAT ATGAGAGACC TAGAAAGAAT GGGGATTTCT
AGTGATGCAA GAAGAAAGCT TGAAGCTGAA AGAGGAAAGC TTAGAGATAA AATAAGTGAT
GCAGAAGCTA GACTTCAAAA GAAAAAAGAA GAGCAAAAGG GAGAAGAACT TAAAAAGATT
GAGGTTGGAA TGGAAGCCTT ATTACCTTCA ATAAACCAAA AAGTCATAGT TCTTTCTAAG
CCAGATAATA AAGGTGAAGT TCAAGTTCAA GCTGGAATTA TGAAAATAAA TGTTAAGGCT
AAGGATTTAA GAGTAGCCAA GGAAACTAAA GAAGAAAAGA AAATTAAAAA GAGAGAAGCA
AGATTAAACT TAAGACAGGT GGATCCATCA ATAGACTTAA GAGGTATGGA TTCAGAAGAA
GCTTGTTACA CTGCAGATAA GTATTTAGAT GATGCTTATG TTGCAGGAAG AGGAGAAGTA
ACTTTAGTTC ATGGAAAAGG TACTGGTGTT TTAAGAAAAG CTATAAACGA TATGCTAAAA
AAACATCCTC ATGTAAAATC TCACAGACTA GGTGAGTATG GAGAGGGTGG AACAGGAGTT
ACTGTTGTAA TATTAAAATA A
 
Protein sequence
MNDRVLRVLE FNKIKELVKG YAITKSAKEM VLDLKPYDSV YDVKEHLEET KEALDILMRK 
GNPPFEGLYD VKEAITRAEK GGVLSIEGLL RIGNMLSVTR KLSDFLARKE EEEEHRILEG
MREGLIVLRG VESAISKAIV SEDEIADSAS DKLYSIRRSL KEKNSSIRDK VNSIVRSNAQ
YLQDSLYTVR GDRYVIPVKA EYKSQVPGLV HDQSSTGATL FIEPTALVNL NNEIKELMLK
ERAEIERILA ELSALVYKNI DVIKVNFNII VELDFIFAKA KYGSDLGGTM PIVNEEGVID
LMDARHPLIP KDKVVSSDIY LGREFSTLLI TGPNTGGKTV TLKTTGLIEL MGLSGLLIPA
SENSSISFFE EIFADIGDEQ SIEQSLSTFS SHMTNIVKIM EKANNKSFVL FDELGAGTDP
TEGAALAISI LENLRARGCR IMSTTHYSEL KGYALKTENV ENASVEFNVE TLRPTYRLLI
GVPGKSNAFE ISRRLGLKDN IIEEAKKVIS TESLQFEDLI QSLQEKSIKA ENDAREAAIL
RNDAEKYKNR YKEKFERIES VRDNVYADAR REAKQILDSA KEEADTILKN MRDLERMGIS
SDARRKLEAE RGKLRDKISD AEARLQKKKE EQKGEELKKI EVGMEALLPS INQKVIVLSK
PDNKGEVQVQ AGIMKINVKA KDLRVAKETK EEKKIKKREA RLNLRQVDPS IDLRGMDSEE
ACYTADKYLD DAYVAGRGEV TLVHGKGTGV LRKAINDMLK KHPHVKSHRL GEYGEGGTGV
TVVILK