Gene RoseRS_3706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3706 
Symbol 
ID5210685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4637360 
End bp4640617 
Gene Length3258 bp 
Protein Length1085 aa 
Translation table11 
GC content64% 
IMG OID640597299 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001278010 
Protein GI148657805 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0191943 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCTG CCGAGCGCCG CGCCTTCGAG CGCCAGTTGC AGCAAGAGTT CCCCGGTCTC 
GAGTTGCACG CCTGGTATCG CCAGTATCGC AGCCTCAAAG CTGCGCACCC CGACGCTATT
CTGCTCTACC GTCTGGGCGA TTTTTACGAG ACGTTCGACG ATGATGCCAA ACTGGTCGCC
GATCTGCTCG AAGTGACGCT GACTTACAAA GAATTCGCCA GCCAGAAGGG GCGCGACCAG
AAGCAGCGCT GCCCGATGGC AGGCATACCG TACCACGCGG TCGAAGGGTA TGTGGCGCGG
CTCGTCGGCG CCGGGTATCG CGTCGCTATC GCCGAGCAGA TCACTGAGAC CCCTTCCAGT
CGCACCGATA CGCGCCCACG TTCGATCTTC GCCGCTGGCA TTGAGCAAAC GTCGCTCACC
GGCAGCAACC GGATGGTCGA GCGCAAGGTG GTACGGGTGA TAACGCCCGG CACGATCATC
GAAAGCGGCA TGATCCCTGC CGAGCGCAAC AACTACCTTG CCGCTCTGAT CGCCGATCAC
GGGCGGATCG GTCTGGCATA CGCCGACCTG AGCACCGGTG AGTTCGCTGC CGTCGAGTTC
AGCGGCGAAC GTGCGGCGCA ACAGGCGCAG GGAGAACTGA CGCGCCTCAA TGCCGCCGAA
ATCCTGGTTC CCGACCGCGC CGACCTGCGC CTGCCCGGTC TTGAGCCGTC CAGCGCTCGC
CTCGAGCAGG ACCTGGAGTT CCTCACCCGC GAAGAGCGCG AACTGCTCCT TCCGGGTGAA
CGGGTGGCGC GGCGTGTTGA GCGTGAAAAC AACGCGCGCT GGGCGCACGG GCGCGTGACC
GCCTGGCCCG AACGACGCTG GGATCTGCGT AATGCGCACG ATACGCTGCT CCACCAGTTC
GGCGTCCGTT CGCTCGCCGG TTTCGGGCTG GAAGACCGTC CGCTGGCGAT CCGCGCCGCT
GGCGCTATCG TGCAGTACGC GCGCGAAACG CAGCAGGGAG TGGTCGCAAA TCTGCGCTCG
ATCCGCGCTT ACACCCCCGG CGATGCGATG TTCCTCGATC CGCAGACTCA GCGGAACCTC
GAATTGCTGG AAGGCGCCAG CGGCACGACG CGCGGCTCGT TGATCGGTGT GCTCGATCAG
ACGCGCACAC CAATGGGGGC GCGCCTGCTG CGCCGCTGGG TTTCGCAGCC GCTGTGCGAT
CTGACGCGGC TGCATGCCCG CCACGACGCA GTGGAACGCT TTGTCACCGA TGCCATCCTG
CGCGCATCGG TGCGCGAAAC CCTGCGACGA GTCGGCGACA TGGAACGGGT CGTCAACCGG
ATCATTCAGG GGGTTGGGGT GGCAACGCCG CGCGATATGG CGCGGTTGCG CGATGCGTTG
CGTGCGCTCC CTGAACTTGT CGCCGCGCTC GGCGATTGGA CGCCGCCGCC GGGCGAGATC
GATCTCACTG GTGTGCGCAC CCTCCCGCCG TCCGCAGCGA CCGATCCCGT GTCGCTGACG
GAAAACGGCA ATGAGCGCAT CGAACCGAAC CAGACGTCGG CGGTCAGTCT GCGCGCACAG
CGCGAGGCGC GTCGGCGGGT ATCGGCGCGC TACGCCGATG AAGACCTGTT TGGGGAGGAA
GAGCAAAACG CACCGCCCGT CGGGTCGTCG AACCATGCCG TCGGAACACA ACCATCGGCA
GATGATGAAG CCTCTCCCGT CACAGCGTTC GTGGACGACC AGGCGACCGA AACGCTGGCG
CTTGATCCGT GCGCCGATAT GCTGGCATTC CTGGAGACCG CCATCGATGA CGAGCCGCCT
GCGCTGCTTG GCGCGTCCAA CTATCTGCGC GCAGGGGAGA ACGGCGAGCC GCCGCGTCGC
GTTATCCGCC CCGGTTTTGA ACCGGAGATC GATCAGGTGG TGGCGGCAAG CCGTGACGCA
CAGCGCTGGA TCAGCGAACT TGAACCGAAA GAGCGGGAGC GCACCGGCAT CAAGTCGCTG
CGCGTCGATT ATAACCGCGT CTTTGGCTAT TACATCGAAG TGCCCAAAAC CTACGCCGAT
CAGGTGCCGA AACACTACAT CCGCAAACAG ACGCTGACAA CCGGCGAACG CTATTTCACC
GATGAACTCA AGCGCTATGA GGAGATCGTT GAGCAGGCGC AGCAGCGTTT GATCGATCTC
GAACGACGCG CTTTTGCGCG AATCTGCGAT GTGGTGACCG GCGCCGGAGC GCGGTTGTTG
CGCACGGCGC GCATGATCGC AACCATCGAT GTGTTTGCAG CGCTGGCGGA AGCGGCAGTA
CGCGGGCGTT ACGTGCGCCC CGAACTGTAC GACGATACCC GCCTGCGGAT CGTCGGCGGG
CGGCATCCGG TCGTCGAGCA GACCCTCGAT GAGACGTTCG TTCCAAACGA CATCGAGATG
GACACGGAGA CGCGCCAGAT CTGCCTGATC ACCGGTCCCA ATATGAGCGG CAAGAGTACG
GTGTTGCGCC AGGTGGCGCT CATCGCCCTG ATGGCGCAGA TTGGGTCATT CGTACCCGCC
GATGCCGCCG AGATCGGGCT GGTTGATCGC ATTTTCACGC GCATCGGCGC TCAGGACGAT
ATTGCCACCG GTCGTAGCAC GTTTATGGTC GAAATGACCG AGACTGCGGC ATTACTGGCG
CAGAGCACCC GTCGCAGCCT GATCATCCTG GACGAGGTCG GTCGCGGCAC CAGCACCTAC
GACGGGATGG CAATTGCGCA GGCGGTCATC GAGTACATTC ACAACGAGCC GCGTCTTGGG
TGTCGCACCC TCTTCGCAAC CCATTACCAC GAACTGACCG ACCTGGAACG CACCCTGCCA
CGTCTCAAGA ACTACCACAT GGCAGCAACC GAGCAGGATG GGCGGGTAGT GTTTCTCCAC
GAACTGCGTC CCGGCGGCGC CGACCGCTCG TATGGCATTC ACGTTGCGGA ACTGGCGGGC
ATCCCTCAAT CGGTCATTCG TCGGGCGAGC GAGTTACTGG CGGAACTGGA ACGCCGCGCG
CCGCGCAGTG CGCCGCCGAC GGTTCCTGCA CGCGGTGACG ACCGCCGATC TGCGGGACGT
GCGTCATCGT CCGGTGCTGG CGCGGCGCGC GGCGAGCAGG GTCGCACGCT TCCCGACGGG
CAACTCTCAC TCTTCGACCT GGCACCCGGA CCGGTGATCG AGATGCTGCG CCGGATCGAT
ATCAATCAGT TGACGCCGCT GGAGGCGCTC AACAAATTGC ATGAACTGCA AAAACTGGCG
CGCGCTGGCG GCGGGTGA
 
Protein sequence
MTPAERRAFE RQLQQEFPGL ELHAWYRQYR SLKAAHPDAI LLYRLGDFYE TFDDDAKLVA 
DLLEVTLTYK EFASQKGRDQ KQRCPMAGIP YHAVEGYVAR LVGAGYRVAI AEQITETPSS
RTDTRPRSIF AAGIEQTSLT GSNRMVERKV VRVITPGTII ESGMIPAERN NYLAALIADH
GRIGLAYADL STGEFAAVEF SGERAAQQAQ GELTRLNAAE ILVPDRADLR LPGLEPSSAR
LEQDLEFLTR EERELLLPGE RVARRVEREN NARWAHGRVT AWPERRWDLR NAHDTLLHQF
GVRSLAGFGL EDRPLAIRAA GAIVQYARET QQGVVANLRS IRAYTPGDAM FLDPQTQRNL
ELLEGASGTT RGSLIGVLDQ TRTPMGARLL RRWVSQPLCD LTRLHARHDA VERFVTDAIL
RASVRETLRR VGDMERVVNR IIQGVGVATP RDMARLRDAL RALPELVAAL GDWTPPPGEI
DLTGVRTLPP SAATDPVSLT ENGNERIEPN QTSAVSLRAQ REARRRVSAR YADEDLFGEE
EQNAPPVGSS NHAVGTQPSA DDEASPVTAF VDDQATETLA LDPCADMLAF LETAIDDEPP
ALLGASNYLR AGENGEPPRR VIRPGFEPEI DQVVAASRDA QRWISELEPK ERERTGIKSL
RVDYNRVFGY YIEVPKTYAD QVPKHYIRKQ TLTTGERYFT DELKRYEEIV EQAQQRLIDL
ERRAFARICD VVTGAGARLL RTARMIATID VFAALAEAAV RGRYVRPELY DDTRLRIVGG
RHPVVEQTLD ETFVPNDIEM DTETRQICLI TGPNMSGKST VLRQVALIAL MAQIGSFVPA
DAAEIGLVDR IFTRIGAQDD IATGRSTFMV EMTETAALLA QSTRRSLIIL DEVGRGTSTY
DGMAIAQAVI EYIHNEPRLG CRTLFATHYH ELTDLERTLP RLKNYHMAAT EQDGRVVFLH
ELRPGGADRS YGIHVAELAG IPQSVIRRAS ELLAELERRA PRSAPPTVPA RGDDRRSAGR
ASSSGAGAAR GEQGRTLPDG QLSLFDLAPG PVIEMLRRID INQLTPLEAL NKLHELQKLA
RAGGG