Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3531 |
Symbol | |
ID | 5541030 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 4601515 |
End bp | 4604781 |
Gene Length | 3267 bp |
Protein Length | 1088 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640895649 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001433599 |
Protein GI | 156743470 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCAG CCGAGCGCCG TGCTTTCGAG CGACAGTTGC AACAGGAGTT CCCCGGTCTC GAACTGCACG CCTGGTATCG CCAGTATCGC AGCCTCAAAG CGGCGCACCC CGATGCCATC CTGCTCTATC GCCTGGGCGA TTTCTACGAA ACATTCGACG ACGACGCCAA ACTGGTCGCC GATCTGCTCG AAGTGACGCT CACCTACAAA GAGTTCGCCA GCCAGAAGGG GCGCGACCAG AAACAGCGCT GCCCGATGGC CGGCATCCCG TACCATGCCG TCGAAGGATA TGTCGCGCGG CTCGTCGGCG CTGGCTACCG CGTCGCCATC GCCGAACAGA TGACCGAAAC CCCCTCAAGC CGCACCGATA CGCGCCCACG CTCGATCTTC GCCGCTGGCA TCGAGCAGAC GGCGCTCATC GGCGGACACA AAATGGTCGA ACGCAAGGTC GTGCGCATCA TCACCCCTGG CACGATTATC GAAAGCGGGA TGCTGCCCGC CGAACGCAAC AACTATCTGG CTGCGCTGAT TGCCGACCAT GGGCGCATCG GGCTGGCATA TGCCGACTTG AGCACCGGTG AGTTTGCCGC CATCGAGTTC AGCGGCGAAC GCGCCGCACA GCAGGCGCAG GGCGAACTGG CGCGCCTCAA TCCAGCAGAA ATCCTGGTTC CCGACCGCGC CGACCTCCGG TTGCCCGGTC TTGAGCCATC CAGCGCCCGC CTTGAACAGG ACCTGGAGTT CCTCACCCGC GAGGAGCGGG AACGGGTTCT CCCCGGCGAA CGCATCGCCC GGCGCGTCGA ACGCGAAAAC CATGCGCGCT GGGCGCACGG TCATGTCACT GCCTGGTCCG AACAACGCTG GGACTTGCGT AATGCGCGCG ATACGCTGCT CCACCAGTTT GGCGTCCACT CGCTCGCCGG CTTCGGGCTG GCGGATCGTC CGCTGGCGAT CCGTGCCGCT GGCGCGATTG TGCAGTATGC GCGCGAAACG CAGCAGGGAA CGGTCGCCAA CCTCCGCGCA ATCCGCGTTT ACACCCCCGG CGATGCCATG GTCCTCGATC CGCAGACGCA GCGAAACCTG GAATTGCTGG AAGGGAACAG CGGCACAACA CGCGGTTCGC TTATCGGCGT GCTCGACCAG ACGCGCACGC CAATGGGGGC GCGCCTGCTG CGCCGCTGGA TTTCACAGCC GCTCTGTGAT CTGGCACGGT TGCGCGCGCG TCACGATGCG GTGGACCACT TCGTCAATGA TGCCATCCTG CGTGCGTCGG TGCGCGAAAC GCTACGGCGC GTCGGCGATA TGGAGCGAGT GGTCAACCGG ATTATCCAGG GAAGCGGGGT AGCCACACCG CGCGACATGG CGCGGCTGCG CGATGCGTTG CGCGCCCTGC CCGACTTGGT CGCTGCGCTG GAAGACTGGA CGCCCCCGCA GGAGGATGTC GATCTGAGCG GGATGAGCGC CCTCCAGGAG TCTGCGGCAT TGGCGGCTGC GCCGCTTGAT GGCATCACAC CGCCGGACGA TGACCACACT GAGCAGGAAC CGACGACCAT CAGCCTGCGC GCGCAGCGCG AAGCGCGCCG GCGGGTATCG GCGCGCCTCA CCGGAGACGA TCTGTTCGAT GAGGAAGAGG AGCAGGAGAA CGCTGGTCAA CCTGCTCCCC TGCCAACGAC TGAAACGGTC CGTGCATCCG GCGAATCCGC CAGACCATCT TTTGAAATGC CGTCGCTTCA TGGGCATGGC GAGAGTCCGA CACTCGATGC GTGCGCCGAC ATCCTGGCGT TTCTGGAAAC CGCCATCGAC GATGATCCAC CCGCATTGCT TGGCGCGTCC AACTACCTCC GCGCGGGGGA TAATGGCGAA CTGCCACGCC GCGTGATCCG CCCTGGCTTC GAGCCGGAGA TCGATCGGGT AGTGGCGGCA AGTCGTGATG CGCAACGCTG GATCAGTGAA CTCGAACCAA AAGAACGTGA GCGCACCGGC ATCAAATCAC TGCGCGTCGA TTACAACCGT GTCTTTGGCT ATTATATCGA GGTGCCCAAA ACCTACGCCG ATCAGGTGCC GAAACACTAC ATCCGCAAGC AGACGCTGAC GACCGGCGAG CGCTACTTTA CCGACGAACT CAAACGCTAC GAAGAGATCG TCGAACAGGC ACAACAACGC CTGATCGACC TTGAACGTCG CGCCTTCGCC CGCATTTGCG AAACTTTGGC GGGCGCAGGA GTGCGCCTGT TGCGCACAGC GCGAACGATT GCGACGATCG ATGTCTTCGC GGCGCTGGCG GAAGCGGCGG TGCGCGGTCG CTACGTGCGC CCCGAACTGT ACGACGATAC CCGCCTGCGC ATCATCGGCG GACGTCATCC GGTGGTCGAG CAGACTCTCG ATGAAACCTT CATTCCGAAC GACATCGAGA TGGATACCGA GACGCGCCAG ATCTGCCTGA TCACCGGTCC AAACATGAGC GGCAAGAGCA CCGTCTTGCG TCAGGTGGCG CTGATTGCAC TCATGGCGCA GATCGGCTCA TTCGTGCCTG CCGATGCCGC CGAGATCGGG GTGGTGGATC GTATCTTTAC CCGTATTGGC GCACAGGACG ATATCGCCAC CGGGCGCAGC ACGTTTATGG TCGAAATGAC CGAAACGGCG GCGCTGCTGG CGCAGAGCAC GCACCGCAGC CTGATCATTC TGGACGAAGT CGGGCGCGGC ACCAGCACCT ACGACGGCAT GGCGATTGCG CAGGCGGTTA TCGAGTACAT TCACAACGAA CCGCGTCTTG GCTGCCGCAC CCTCTTTGCG ACCCACTACC ACGAACTGAC CGATCTGGAG CGTACCCTGC CGCGCCTGAA AAACTACCAC ATGGCGGCGA CCGAGCAAGA TGGGCGGGTA GTGTTCCTGC ACGAACTGCG ACCCGGCGGC GCCGACCGGT CATATGGCAT TCATGTGGCA GAACTGGCGG GCATTCCACA ACCGGTTATT CGCCGCGCCA CCGAGTTGCT GGCGGAACTC GAGCGCCGCG CGCCGCGCAG TACGCCCCAA CCGGCGCCTG AGCGCACAGA GGAACGCCCA GCGGCAGGGC GCCCCACGGC GCGCAGCCAC AGCGCGGCGC GCGGCGATCC ACCGCGCGCG CCGGATGGTC AACTGTCGCT CTTCGATCTG ACGCCGGGAC CGGTGATCGA GATGCTGCGG CGGCTCGACA TCAATCAGTT GACGCCGCTG GAGGCGCTGA ACAAACTGTA TGAACTGCAA AAACTGGCGC GCATTGGCGG TGGGTAG
|
Protein sequence | MTPAERRAFE RQLQQEFPGL ELHAWYRQYR SLKAAHPDAI LLYRLGDFYE TFDDDAKLVA DLLEVTLTYK EFASQKGRDQ KQRCPMAGIP YHAVEGYVAR LVGAGYRVAI AEQMTETPSS RTDTRPRSIF AAGIEQTALI GGHKMVERKV VRIITPGTII ESGMLPAERN NYLAALIADH GRIGLAYADL STGEFAAIEF SGERAAQQAQ GELARLNPAE ILVPDRADLR LPGLEPSSAR LEQDLEFLTR EERERVLPGE RIARRVEREN HARWAHGHVT AWSEQRWDLR NARDTLLHQF GVHSLAGFGL ADRPLAIRAA GAIVQYARET QQGTVANLRA IRVYTPGDAM VLDPQTQRNL ELLEGNSGTT RGSLIGVLDQ TRTPMGARLL RRWISQPLCD LARLRARHDA VDHFVNDAIL RASVRETLRR VGDMERVVNR IIQGSGVATP RDMARLRDAL RALPDLVAAL EDWTPPQEDV DLSGMSALQE SAALAAAPLD GITPPDDDHT EQEPTTISLR AQREARRRVS ARLTGDDLFD EEEEQENAGQ PAPLPTTETV RASGESARPS FEMPSLHGHG ESPTLDACAD ILAFLETAID DDPPALLGAS NYLRAGDNGE LPRRVIRPGF EPEIDRVVAA SRDAQRWISE LEPKERERTG IKSLRVDYNR VFGYYIEVPK TYADQVPKHY IRKQTLTTGE RYFTDELKRY EEIVEQAQQR LIDLERRAFA RICETLAGAG VRLLRTARTI ATIDVFAALA EAAVRGRYVR PELYDDTRLR IIGGRHPVVE QTLDETFIPN DIEMDTETRQ ICLITGPNMS GKSTVLRQVA LIALMAQIGS FVPADAAEIG VVDRIFTRIG AQDDIATGRS TFMVEMTETA ALLAQSTHRS LIILDEVGRG TSTYDGMAIA QAVIEYIHNE PRLGCRTLFA THYHELTDLE RTLPRLKNYH MAATEQDGRV VFLHELRPGG ADRSYGIHVA ELAGIPQPVI RRATELLAEL ERRAPRSTPQ PAPERTEERP AAGRPTARSH SAARGDPPRA PDGQLSLFDL TPGPVIEMLR RLDINQLTPL EALNKLYELQ KLARIGGG
|
| |