Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3706 |
Symbol | |
ID | 5210685 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 4637360 |
End bp | 4640617 |
Gene Length | 3258 bp |
Protein Length | 1085 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640597299 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001278010 |
Protein GI | 148657805 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0191943 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCTG CCGAGCGCCG CGCCTTCGAG CGCCAGTTGC AGCAAGAGTT CCCCGGTCTC GAGTTGCACG CCTGGTATCG CCAGTATCGC AGCCTCAAAG CTGCGCACCC CGACGCTATT CTGCTCTACC GTCTGGGCGA TTTTTACGAG ACGTTCGACG ATGATGCCAA ACTGGTCGCC GATCTGCTCG AAGTGACGCT GACTTACAAA GAATTCGCCA GCCAGAAGGG GCGCGACCAG AAGCAGCGCT GCCCGATGGC AGGCATACCG TACCACGCGG TCGAAGGGTA TGTGGCGCGG CTCGTCGGCG CCGGGTATCG CGTCGCTATC GCCGAGCAGA TCACTGAGAC CCCTTCCAGT CGCACCGATA CGCGCCCACG TTCGATCTTC GCCGCTGGCA TTGAGCAAAC GTCGCTCACC GGCAGCAACC GGATGGTCGA GCGCAAGGTG GTACGGGTGA TAACGCCCGG CACGATCATC GAAAGCGGCA TGATCCCTGC CGAGCGCAAC AACTACCTTG CCGCTCTGAT CGCCGATCAC GGGCGGATCG GTCTGGCATA CGCCGACCTG AGCACCGGTG AGTTCGCTGC CGTCGAGTTC AGCGGCGAAC GTGCGGCGCA ACAGGCGCAG GGAGAACTGA CGCGCCTCAA TGCCGCCGAA ATCCTGGTTC CCGACCGCGC CGACCTGCGC CTGCCCGGTC TTGAGCCGTC CAGCGCTCGC CTCGAGCAGG ACCTGGAGTT CCTCACCCGC GAAGAGCGCG AACTGCTCCT TCCGGGTGAA CGGGTGGCGC GGCGTGTTGA GCGTGAAAAC AACGCGCGCT GGGCGCACGG GCGCGTGACC GCCTGGCCCG AACGACGCTG GGATCTGCGT AATGCGCACG ATACGCTGCT CCACCAGTTC GGCGTCCGTT CGCTCGCCGG TTTCGGGCTG GAAGACCGTC CGCTGGCGAT CCGCGCCGCT GGCGCTATCG TGCAGTACGC GCGCGAAACG CAGCAGGGAG TGGTCGCAAA TCTGCGCTCG ATCCGCGCTT ACACCCCCGG CGATGCGATG TTCCTCGATC CGCAGACTCA GCGGAACCTC GAATTGCTGG AAGGCGCCAG CGGCACGACG CGCGGCTCGT TGATCGGTGT GCTCGATCAG ACGCGCACAC CAATGGGGGC GCGCCTGCTG CGCCGCTGGG TTTCGCAGCC GCTGTGCGAT CTGACGCGGC TGCATGCCCG CCACGACGCA GTGGAACGCT TTGTCACCGA TGCCATCCTG CGCGCATCGG TGCGCGAAAC CCTGCGACGA GTCGGCGACA TGGAACGGGT CGTCAACCGG ATCATTCAGG GGGTTGGGGT GGCAACGCCG CGCGATATGG CGCGGTTGCG CGATGCGTTG CGTGCGCTCC CTGAACTTGT CGCCGCGCTC GGCGATTGGA CGCCGCCGCC GGGCGAGATC GATCTCACTG GTGTGCGCAC CCTCCCGCCG TCCGCAGCGA CCGATCCCGT GTCGCTGACG GAAAACGGCA ATGAGCGCAT CGAACCGAAC CAGACGTCGG CGGTCAGTCT GCGCGCACAG CGCGAGGCGC GTCGGCGGGT ATCGGCGCGC TACGCCGATG AAGACCTGTT TGGGGAGGAA GAGCAAAACG CACCGCCCGT CGGGTCGTCG AACCATGCCG TCGGAACACA ACCATCGGCA GATGATGAAG CCTCTCCCGT CACAGCGTTC GTGGACGACC AGGCGACCGA AACGCTGGCG CTTGATCCGT GCGCCGATAT GCTGGCATTC CTGGAGACCG CCATCGATGA CGAGCCGCCT GCGCTGCTTG GCGCGTCCAA CTATCTGCGC GCAGGGGAGA ACGGCGAGCC GCCGCGTCGC GTTATCCGCC CCGGTTTTGA ACCGGAGATC GATCAGGTGG TGGCGGCAAG CCGTGACGCA CAGCGCTGGA TCAGCGAACT TGAACCGAAA GAGCGGGAGC GCACCGGCAT CAAGTCGCTG CGCGTCGATT ATAACCGCGT CTTTGGCTAT TACATCGAAG TGCCCAAAAC CTACGCCGAT CAGGTGCCGA AACACTACAT CCGCAAACAG ACGCTGACAA CCGGCGAACG CTATTTCACC GATGAACTCA AGCGCTATGA GGAGATCGTT GAGCAGGCGC AGCAGCGTTT GATCGATCTC GAACGACGCG CTTTTGCGCG AATCTGCGAT GTGGTGACCG GCGCCGGAGC GCGGTTGTTG CGCACGGCGC GCATGATCGC AACCATCGAT GTGTTTGCAG CGCTGGCGGA AGCGGCAGTA CGCGGGCGTT ACGTGCGCCC CGAACTGTAC GACGATACCC GCCTGCGGAT CGTCGGCGGG CGGCATCCGG TCGTCGAGCA GACCCTCGAT GAGACGTTCG TTCCAAACGA CATCGAGATG GACACGGAGA CGCGCCAGAT CTGCCTGATC ACCGGTCCCA ATATGAGCGG CAAGAGTACG GTGTTGCGCC AGGTGGCGCT CATCGCCCTG ATGGCGCAGA TTGGGTCATT CGTACCCGCC GATGCCGCCG AGATCGGGCT GGTTGATCGC ATTTTCACGC GCATCGGCGC TCAGGACGAT ATTGCCACCG GTCGTAGCAC GTTTATGGTC GAAATGACCG AGACTGCGGC ATTACTGGCG CAGAGCACCC GTCGCAGCCT GATCATCCTG GACGAGGTCG GTCGCGGCAC CAGCACCTAC GACGGGATGG CAATTGCGCA GGCGGTCATC GAGTACATTC ACAACGAGCC GCGTCTTGGG TGTCGCACCC TCTTCGCAAC CCATTACCAC GAACTGACCG ACCTGGAACG CACCCTGCCA CGTCTCAAGA ACTACCACAT GGCAGCAACC GAGCAGGATG GGCGGGTAGT GTTTCTCCAC GAACTGCGTC CCGGCGGCGC CGACCGCTCG TATGGCATTC ACGTTGCGGA ACTGGCGGGC ATCCCTCAAT CGGTCATTCG TCGGGCGAGC GAGTTACTGG CGGAACTGGA ACGCCGCGCG CCGCGCAGTG CGCCGCCGAC GGTTCCTGCA CGCGGTGACG ACCGCCGATC TGCGGGACGT GCGTCATCGT CCGGTGCTGG CGCGGCGCGC GGCGAGCAGG GTCGCACGCT TCCCGACGGG CAACTCTCAC TCTTCGACCT GGCACCCGGA CCGGTGATCG AGATGCTGCG CCGGATCGAT ATCAATCAGT TGACGCCGCT GGAGGCGCTC AACAAATTGC ATGAACTGCA AAAACTGGCG CGCGCTGGCG GCGGGTGA
|
Protein sequence | MTPAERRAFE RQLQQEFPGL ELHAWYRQYR SLKAAHPDAI LLYRLGDFYE TFDDDAKLVA DLLEVTLTYK EFASQKGRDQ KQRCPMAGIP YHAVEGYVAR LVGAGYRVAI AEQITETPSS RTDTRPRSIF AAGIEQTSLT GSNRMVERKV VRVITPGTII ESGMIPAERN NYLAALIADH GRIGLAYADL STGEFAAVEF SGERAAQQAQ GELTRLNAAE ILVPDRADLR LPGLEPSSAR LEQDLEFLTR EERELLLPGE RVARRVEREN NARWAHGRVT AWPERRWDLR NAHDTLLHQF GVRSLAGFGL EDRPLAIRAA GAIVQYARET QQGVVANLRS IRAYTPGDAM FLDPQTQRNL ELLEGASGTT RGSLIGVLDQ TRTPMGARLL RRWVSQPLCD LTRLHARHDA VERFVTDAIL RASVRETLRR VGDMERVVNR IIQGVGVATP RDMARLRDAL RALPELVAAL GDWTPPPGEI DLTGVRTLPP SAATDPVSLT ENGNERIEPN QTSAVSLRAQ REARRRVSAR YADEDLFGEE EQNAPPVGSS NHAVGTQPSA DDEASPVTAF VDDQATETLA LDPCADMLAF LETAIDDEPP ALLGASNYLR AGENGEPPRR VIRPGFEPEI DQVVAASRDA QRWISELEPK ERERTGIKSL RVDYNRVFGY YIEVPKTYAD QVPKHYIRKQ TLTTGERYFT DELKRYEEIV EQAQQRLIDL ERRAFARICD VVTGAGARLL RTARMIATID VFAALAEAAV RGRYVRPELY DDTRLRIVGG RHPVVEQTLD ETFVPNDIEM DTETRQICLI TGPNMSGKST VLRQVALIAL MAQIGSFVPA DAAEIGLVDR IFTRIGAQDD IATGRSTFMV EMTETAALLA QSTRRSLIIL DEVGRGTSTY DGMAIAQAVI EYIHNEPRLG CRTLFATHYH ELTDLERTLP RLKNYHMAAT EQDGRVVFLH ELRPGGADRS YGIHVAELAG IPQSVIRRAS ELLAELERRA PRSAPPTVPA RGDDRRSAGR ASSSGAGAAR GEQGRTLPDG QLSLFDLAPG PVIEMLRRID INQLTPLEAL NKLHELQKLA RAGGG
|
| |