Gene Rcas_3531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3531 
Symbol 
ID5541030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4601515 
End bp4604781 
Gene Length3267 bp 
Protein Length1088 aa 
Translation table11 
GC content64% 
IMG OID640895649 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001433599 
Protein GI156743470 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCAG CCGAGCGCCG TGCTTTCGAG CGACAGTTGC AACAGGAGTT CCCCGGTCTC 
GAACTGCACG CCTGGTATCG CCAGTATCGC AGCCTCAAAG CGGCGCACCC CGATGCCATC
CTGCTCTATC GCCTGGGCGA TTTCTACGAA ACATTCGACG ACGACGCCAA ACTGGTCGCC
GATCTGCTCG AAGTGACGCT CACCTACAAA GAGTTCGCCA GCCAGAAGGG GCGCGACCAG
AAACAGCGCT GCCCGATGGC CGGCATCCCG TACCATGCCG TCGAAGGATA TGTCGCGCGG
CTCGTCGGCG CTGGCTACCG CGTCGCCATC GCCGAACAGA TGACCGAAAC CCCCTCAAGC
CGCACCGATA CGCGCCCACG CTCGATCTTC GCCGCTGGCA TCGAGCAGAC GGCGCTCATC
GGCGGACACA AAATGGTCGA ACGCAAGGTC GTGCGCATCA TCACCCCTGG CACGATTATC
GAAAGCGGGA TGCTGCCCGC CGAACGCAAC AACTATCTGG CTGCGCTGAT TGCCGACCAT
GGGCGCATCG GGCTGGCATA TGCCGACTTG AGCACCGGTG AGTTTGCCGC CATCGAGTTC
AGCGGCGAAC GCGCCGCACA GCAGGCGCAG GGCGAACTGG CGCGCCTCAA TCCAGCAGAA
ATCCTGGTTC CCGACCGCGC CGACCTCCGG TTGCCCGGTC TTGAGCCATC CAGCGCCCGC
CTTGAACAGG ACCTGGAGTT CCTCACCCGC GAGGAGCGGG AACGGGTTCT CCCCGGCGAA
CGCATCGCCC GGCGCGTCGA ACGCGAAAAC CATGCGCGCT GGGCGCACGG TCATGTCACT
GCCTGGTCCG AACAACGCTG GGACTTGCGT AATGCGCGCG ATACGCTGCT CCACCAGTTT
GGCGTCCACT CGCTCGCCGG CTTCGGGCTG GCGGATCGTC CGCTGGCGAT CCGTGCCGCT
GGCGCGATTG TGCAGTATGC GCGCGAAACG CAGCAGGGAA CGGTCGCCAA CCTCCGCGCA
ATCCGCGTTT ACACCCCCGG CGATGCCATG GTCCTCGATC CGCAGACGCA GCGAAACCTG
GAATTGCTGG AAGGGAACAG CGGCACAACA CGCGGTTCGC TTATCGGCGT GCTCGACCAG
ACGCGCACGC CAATGGGGGC GCGCCTGCTG CGCCGCTGGA TTTCACAGCC GCTCTGTGAT
CTGGCACGGT TGCGCGCGCG TCACGATGCG GTGGACCACT TCGTCAATGA TGCCATCCTG
CGTGCGTCGG TGCGCGAAAC GCTACGGCGC GTCGGCGATA TGGAGCGAGT GGTCAACCGG
ATTATCCAGG GAAGCGGGGT AGCCACACCG CGCGACATGG CGCGGCTGCG CGATGCGTTG
CGCGCCCTGC CCGACTTGGT CGCTGCGCTG GAAGACTGGA CGCCCCCGCA GGAGGATGTC
GATCTGAGCG GGATGAGCGC CCTCCAGGAG TCTGCGGCAT TGGCGGCTGC GCCGCTTGAT
GGCATCACAC CGCCGGACGA TGACCACACT GAGCAGGAAC CGACGACCAT CAGCCTGCGC
GCGCAGCGCG AAGCGCGCCG GCGGGTATCG GCGCGCCTCA CCGGAGACGA TCTGTTCGAT
GAGGAAGAGG AGCAGGAGAA CGCTGGTCAA CCTGCTCCCC TGCCAACGAC TGAAACGGTC
CGTGCATCCG GCGAATCCGC CAGACCATCT TTTGAAATGC CGTCGCTTCA TGGGCATGGC
GAGAGTCCGA CACTCGATGC GTGCGCCGAC ATCCTGGCGT TTCTGGAAAC CGCCATCGAC
GATGATCCAC CCGCATTGCT TGGCGCGTCC AACTACCTCC GCGCGGGGGA TAATGGCGAA
CTGCCACGCC GCGTGATCCG CCCTGGCTTC GAGCCGGAGA TCGATCGGGT AGTGGCGGCA
AGTCGTGATG CGCAACGCTG GATCAGTGAA CTCGAACCAA AAGAACGTGA GCGCACCGGC
ATCAAATCAC TGCGCGTCGA TTACAACCGT GTCTTTGGCT ATTATATCGA GGTGCCCAAA
ACCTACGCCG ATCAGGTGCC GAAACACTAC ATCCGCAAGC AGACGCTGAC GACCGGCGAG
CGCTACTTTA CCGACGAACT CAAACGCTAC GAAGAGATCG TCGAACAGGC ACAACAACGC
CTGATCGACC TTGAACGTCG CGCCTTCGCC CGCATTTGCG AAACTTTGGC GGGCGCAGGA
GTGCGCCTGT TGCGCACAGC GCGAACGATT GCGACGATCG ATGTCTTCGC GGCGCTGGCG
GAAGCGGCGG TGCGCGGTCG CTACGTGCGC CCCGAACTGT ACGACGATAC CCGCCTGCGC
ATCATCGGCG GACGTCATCC GGTGGTCGAG CAGACTCTCG ATGAAACCTT CATTCCGAAC
GACATCGAGA TGGATACCGA GACGCGCCAG ATCTGCCTGA TCACCGGTCC AAACATGAGC
GGCAAGAGCA CCGTCTTGCG TCAGGTGGCG CTGATTGCAC TCATGGCGCA GATCGGCTCA
TTCGTGCCTG CCGATGCCGC CGAGATCGGG GTGGTGGATC GTATCTTTAC CCGTATTGGC
GCACAGGACG ATATCGCCAC CGGGCGCAGC ACGTTTATGG TCGAAATGAC CGAAACGGCG
GCGCTGCTGG CGCAGAGCAC GCACCGCAGC CTGATCATTC TGGACGAAGT CGGGCGCGGC
ACCAGCACCT ACGACGGCAT GGCGATTGCG CAGGCGGTTA TCGAGTACAT TCACAACGAA
CCGCGTCTTG GCTGCCGCAC CCTCTTTGCG ACCCACTACC ACGAACTGAC CGATCTGGAG
CGTACCCTGC CGCGCCTGAA AAACTACCAC ATGGCGGCGA CCGAGCAAGA TGGGCGGGTA
GTGTTCCTGC ACGAACTGCG ACCCGGCGGC GCCGACCGGT CATATGGCAT TCATGTGGCA
GAACTGGCGG GCATTCCACA ACCGGTTATT CGCCGCGCCA CCGAGTTGCT GGCGGAACTC
GAGCGCCGCG CGCCGCGCAG TACGCCCCAA CCGGCGCCTG AGCGCACAGA GGAACGCCCA
GCGGCAGGGC GCCCCACGGC GCGCAGCCAC AGCGCGGCGC GCGGCGATCC ACCGCGCGCG
CCGGATGGTC AACTGTCGCT CTTCGATCTG ACGCCGGGAC CGGTGATCGA GATGCTGCGG
CGGCTCGACA TCAATCAGTT GACGCCGCTG GAGGCGCTGA ACAAACTGTA TGAACTGCAA
AAACTGGCGC GCATTGGCGG TGGGTAG
 
Protein sequence
MTPAERRAFE RQLQQEFPGL ELHAWYRQYR SLKAAHPDAI LLYRLGDFYE TFDDDAKLVA 
DLLEVTLTYK EFASQKGRDQ KQRCPMAGIP YHAVEGYVAR LVGAGYRVAI AEQMTETPSS
RTDTRPRSIF AAGIEQTALI GGHKMVERKV VRIITPGTII ESGMLPAERN NYLAALIADH
GRIGLAYADL STGEFAAIEF SGERAAQQAQ GELARLNPAE ILVPDRADLR LPGLEPSSAR
LEQDLEFLTR EERERVLPGE RIARRVEREN HARWAHGHVT AWSEQRWDLR NARDTLLHQF
GVHSLAGFGL ADRPLAIRAA GAIVQYARET QQGTVANLRA IRVYTPGDAM VLDPQTQRNL
ELLEGNSGTT RGSLIGVLDQ TRTPMGARLL RRWISQPLCD LARLRARHDA VDHFVNDAIL
RASVRETLRR VGDMERVVNR IIQGSGVATP RDMARLRDAL RALPDLVAAL EDWTPPQEDV
DLSGMSALQE SAALAAAPLD GITPPDDDHT EQEPTTISLR AQREARRRVS ARLTGDDLFD
EEEEQENAGQ PAPLPTTETV RASGESARPS FEMPSLHGHG ESPTLDACAD ILAFLETAID
DDPPALLGAS NYLRAGDNGE LPRRVIRPGF EPEIDRVVAA SRDAQRWISE LEPKERERTG
IKSLRVDYNR VFGYYIEVPK TYADQVPKHY IRKQTLTTGE RYFTDELKRY EEIVEQAQQR
LIDLERRAFA RICETLAGAG VRLLRTARTI ATIDVFAALA EAAVRGRYVR PELYDDTRLR
IIGGRHPVVE QTLDETFIPN DIEMDTETRQ ICLITGPNMS GKSTVLRQVA LIALMAQIGS
FVPADAAEIG VVDRIFTRIG AQDDIATGRS TFMVEMTETA ALLAQSTHRS LIILDEVGRG
TSTYDGMAIA QAVIEYIHNE PRLGCRTLFA THYHELTDLE RTLPRLKNYH MAATEQDGRV
VFLHELRPGG ADRSYGIHVA ELAGIPQPVI RRATELLAEL ERRAPRSTPQ PAPERTEERP
AAGRPTARSH SAARGDPPRA PDGQLSLFDL TPGPVIEMLR RLDINQLTPL EALNKLYELQ
KLARIGGG