Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1847 |
Symbol | |
ID | 4205204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 2041721 |
End bp | 2044081 |
Gene Length | 2361 bp |
Protein Length | 786 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642566397 |
Product | recombination and DNA strand exchange inhibitor protein |
Protein accession | YP_699161 |
Protein GI | 110803296 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGATA GAGTTTTAAG AGTTTTAGAA TTTAATAAAA TTAAGGAATT GGTTAAGGGA TATGCCATTA CAAAATCAGC TAAGGAGATG GTTTTAGATC TTAAACCATA TGATTCTGTT TATGATGTAA AAGAACATTT AGAAGAAACA AAAGAAGCTT TAGATATTTT AATGAGAAAA GGAAATCCTC CTTTTGAAGG GCTTTATGAT GTTAAAGAAG CTATAACAAG AGCTGAAAAA GGTGGAGTTT TAAGCATAGA AGGCTTACTT AGAATAGGAA ACATGTTATC TGTAACAAGA AAACTTTCTG ACTTTTTAGC TAGAAAAGAA GAGGAAGAAG AACATAGAAT CCTAGAAGGA ATGAGAGAAG GGCTTATAGT ACTTAGAGGT GTAGAAAGTG CTATATCAAA GGCCATAGTA AGTGAAGATG AAATTGCAGA TTCTGCGAGT GATAAACTTT ATAGTATAAG AAGAAGCTTA AAAGAAAAGA ATTCATCTAT AAGAGATAAA GTTAATTCCA TAGTAAGAAG TAATGCTCAA TATCTTCAAG ATTCCTTATA CACTGTTAGA GGAGATAGAT ATGTTATTCC TGTAAAAGCT GAATATAAAT CTCAAGTTCC AGGACTTGTT CATGATCAAA GTTCAACAGG GGCTACACTT TTTATAGAAC CAACGGCTTT AGTAAACTTA AATAACGAAA TAAAAGAGCT TATGCTAAAG GAAAGAGCTG AGATTGAGAG AATATTAGCA GAATTATCAG CCTTAGTATA TAAAAATATT GATGTTATAA AAGTTAACTT TAATATAATA GTTGAATTAG ATTTTATATT TGCAAAGGCT AAATACGGTA GTGATTTAGG TGGAACAATG CCTATAGTAA ATGAAGAAGG CGTAATAGAT TTAATGGATG CTAGGCATCC ACTAATTCCT AAAGATAAAG TTGTTTCTTC TGATATTTAT TTAGGAAGAG AATTTTCAAC ATTACTTATA ACAGGGCCAA ATACTGGTGG TAAAACTGTA ACCTTAAAGA CTACAGGGCT TATTGAACTT ATGGGCTTAA GTGGACTTTT AATACCAGCA AGTGAAAATT CAAGCATAAG CTTCTTTGAA GAGATATTTG CAGATATAGG AGATGAGCAA AGTATAGAGC AAAGCTTATC AACTTTTTCT TCTCATATGA CTAATATAGT TAAAATAATG GAGAAAGCAA ATAATAAAAG TTTTGTACTT TTTGACGAAC TTGGAGCTGG AACAGACCCT ACAGAGGGAG CTGCACTTGC CATTTCAATA TTAGAAAACT TAAGAGCAAG AGGATGTAGA ATAATGTCTA CAACTCACTA TAGTGAATTA AAGGGATATG CATTAAAAAC TGAAAATGTT GAGAATGCTT CTGTTGAGTT TAATGTTGAA ACCTTAAGAC CAACTTATAG ACTTTTAATA GGAGTTCCAG GGAAATCAAA TGCTTTTGAA ATTTCAAGAA GATTAGGTCT TAAAGATAAT ATCATAGAAG AAGCTAAAAA GGTTATTTCT ACTGAATCAC TTCAATTTGA AGATTTAATA CAATCACTTC AAGAAAAGAG TATAAAAGCA GAAAATGATG CTAGAGAAGC TGCTATATTA AGAAATGATG CAGAAAAATA TAAGAATAGA TATAAAGAGA AATTTGAAAG AATTGAAAGC GTAAGAGATA ATGTTTATGC AGATGCTAGA AGAGAAGCAA AGCAAATTTT AGATTCAGCT AAGGAAGAGG CTGATACTAT TCTTAAAAAT ATGAGAGACC TAGAAAGAAT GGGGATTTCT AGTGATGCAA GAAGAAAGCT TGAAGCTGAA AGAGGAAAGC TTAGAGATAA AATAAGTGAT GCAGAAGCTA GACTTCAAAA GAAAAAAGAA GAGCAAAAGG GAGAAGAACT TAAAAAGATT GAGGTTGGAA TGGAAGCCTT ATTACCTTCA ATAAACCAAA AAGTCATAGT TCTTTCTAAG CCAGATAATA AAGGTGAAGT TCAAGTTCAA GCTGGAATTA TGAAAATAAA TGTTAAGGCT AAGGATTTAA GAGTAGCCAA GGAAACTAAA GAAGAAAAGA AAATTAAAAA GAGAGAAGCA AGATTAAACT TAAGACAGGT GGATCCATCA ATAGACTTAA GAGGTATGGA TTCAGAAGAA GCTTGTTACA CTGCAGATAA GTATTTAGAT GATGCTTATG TTGCAGGAAG AGGAGAAGTA ACTTTAGTTC ATGGAAAAGG TACTGGTGTT TTAAGAAAAG CTATAAACGA TATGCTAAAA AAACATCCTC ATGTAAAATC TCACAGACTA GGTGAGTATG GAGAGGGTGG AACAGGAGTT ACTGTTGTAA TATTAAAATA A
|
Protein sequence | MNDRVLRVLE FNKIKELVKG YAITKSAKEM VLDLKPYDSV YDVKEHLEET KEALDILMRK GNPPFEGLYD VKEAITRAEK GGVLSIEGLL RIGNMLSVTR KLSDFLARKE EEEEHRILEG MREGLIVLRG VESAISKAIV SEDEIADSAS DKLYSIRRSL KEKNSSIRDK VNSIVRSNAQ YLQDSLYTVR GDRYVIPVKA EYKSQVPGLV HDQSSTGATL FIEPTALVNL NNEIKELMLK ERAEIERILA ELSALVYKNI DVIKVNFNII VELDFIFAKA KYGSDLGGTM PIVNEEGVID LMDARHPLIP KDKVVSSDIY LGREFSTLLI TGPNTGGKTV TLKTTGLIEL MGLSGLLIPA SENSSISFFE EIFADIGDEQ SIEQSLSTFS SHMTNIVKIM EKANNKSFVL FDELGAGTDP TEGAALAISI LENLRARGCR IMSTTHYSEL KGYALKTENV ENASVEFNVE TLRPTYRLLI GVPGKSNAFE ISRRLGLKDN IIEEAKKVIS TESLQFEDLI QSLQEKSIKA ENDAREAAIL RNDAEKYKNR YKEKFERIES VRDNVYADAR REAKQILDSA KEEADTILKN MRDLERMGIS SDARRKLEAE RGKLRDKISD AEARLQKKKE EQKGEELKKI EVGMEALLPS INQKVIVLSK PDNKGEVQVQ AGIMKINVKA KDLRVAKETK EEKKIKKREA RLNLRQVDPS IDLRGMDSEE ACYTADKYLD DAYVAGRGEV TLVHGKGTGV LRKAINDMLK KHPHVKSHRL GEYGEGGTGV TVVILK
|
| |