Gene Rsph17025_2655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2655 
Symbol 
ID5084443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2696862 
End bp2699492 
Gene Length2631 bp 
Protein Length876 aa 
Translation table11 
GC content71% 
IMG OID640484218 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001168847 
Protein GI146278688 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.607224 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGACG ACACCGTCAC GCCGATGATG GCGCAATATC TGGAGATCAA GGCGCAGCAC 
CCCGGCGCGA TCCTGTTCTA CCGGATGGGC GACTTCTACG AGATGTTCTT CGAGGATGCG
GCGCTGGCGG CCGAGGCGCT CGACATCGCG CTGACCAAGC GCGGCAAGCA CAAGGGCGAG
GATATCGCCA TGTGCGGCGT GCCGATCCAT GCCGCCGAGG GCTACCTGCT GACGCTGATC
CGCAAGGGGT TCCGCGTCGC CATCGCCGAG CAGATGGAGG ACCCGGCCGA AGCGAAGAAG
CGCGGCTCCA AGTCCGTTGT CCGGCGCGAG GTGGTGCGGC TCGTCACCCC CGGCACGCTG
ACCGAGGACA GCCTGCTGGA GGCGCGGCGG CACAACTTCC TCTGCGCCTT CGCCGAGATC
CGCGACGAGG CGGCACTCGC CTGGGCCGAC ATCTCGACCG GCGAGTTCAG CGTCACGCCC
TGCCCGCTGC CCCGCCTGCT GCCCGAGCTT GCCCGCCTCG CGCCGCGCGA ACTGCTGGTG
GCCGACGAAC GCCCGCTCGA CTGGATCGAG GAGGTGGGAT GCGCCCTGAC CCCTCTCGCC
CGCGCGAGCT TTGACAGCGC CTCGGCCGAA AAGCGGCTCT GCACGCTCTT CGGGGTCGGC
ACGCTGGACA GCTTCGGCAA CTTCACCCGC CCCGAGCTGT CGGCCATGGG CGCGCTGGTC
GATTACCTCG ACCTCACGCA GCGCGGAAAG CTGCCGCTCC TGCGCCCGCC CGTGCGCGAG
GTCGCGGGCG GCACGGTGCA GATCGACGCC GCCACCCGGC GCAACCTCGA GATCACGCAA
GCCCTCACCG GCGGGCGCGA AGGTTCGCTG CTCTCGGCGG TGGACCGCAC CGTCACCGCC
CCCGGCGCCC GCCTGCTCGA GCGGCGGCTC TCCAGCCCCT CGCGCGACCT TGGCCTGATC
CACGACCGGC TCGCGGCTGT GAGCTGGCTG ACGGACGAGC CGCGGCTGCG CGAGGATCTG
CGGGCGAGCC TGCGCCGCGT GCCGGACATG GACCGCGCCC TCTCGCGGCT CGCGCTCGAC
CGTGCCGGGC CACGGGACAT GGCGGCGATC CGCGCCGGCC TCACGCAGGC CGAGGCCATC
GCGGGTCGTA TGCCGGCCGA CGCGCCTTCC CTGCTCGCGG AGACACTCGA GGCGCTCCGC
GGCCACGAGA ACCTCGTGGA TCTCCTCGAT CAGGCGCTGG TGGCCGAGCC GCCGCTGCTG
GTGCGCGATG GCGGCTTCAT CGCCCCGGGC TTCGATGACG ACCTCGACGA GACACGGCGC
CTGCGCGACG AGGGCCGCGG CGTGATCGCG TCGATGCAGG CCGGCTTCAT CGAGACGACC
GGCATCCAGA GCCTGAAGAT CAAGCACAAC AACGTGCTGG GCTATTTCAT CGAAGTCACC
TCGACCCACG CCGAAAAGAT GCTCTCACCC CCCCTGTCCG AGAGCTTCAT CCACCGCCAG
ACGACCGCGG GGCAGGTGCG CTTCACCACC GTCGCCCTCT CGGAACTCGA AACGCGCATC
CTGAACGCCG GGAACCGCGC GCTCGAACTC GAGAAGATGC ATTTTGCGGC GCTGCGGACG
GCGATCCTCG ATCAGGCGGG CGCGATCGGC CGCGCCGCCC GGGCGCTGGC CGAGGTGGAC
CTGATCGCGG CCTTCGCCGA CCTCGCCGTG GCCGAGGACT GGACCGAGCC GCAGGTGGAC
GACAGCCGCG CCTTCGCCAT CGAGGCCGGC CGGCATCCGG TCGTCGAGCG TGCCCTCCGC
CGGACCGGCA CGCCCTTCGT GGCGAACGAC TGCGACCTGT CCAAGGCCGA GACGCCGGCC
GTCTGGCTCA TCACCGGGCC GAACATGGCC GGTAAATCCA CCTTCCTGCG CCAGAACGCA
CTGATCGCGC TGCTCGCCCA GGCGGGCAGC TTCGTCCCCG CCCGCCGGGC CCATATCGGC
CTCGTCAGCC AGATCTTCAG CCGCGTCGGC GCCTCGGACG ATCTGGCCCG CGGCCGCTCG
ACCTTCATGG TCGAAATGGT CGAAACCGCC GCCATCCTGA ACCAGGCCGA TGACCGCGCG
CTTGTGATCC TCGACGAGAT CGGCCGCGGT ACAGCCACCT GGGACGGGCT CTCGATCGCC
TGGGCCACGC TCGAGCATCT GCACGACACG AACCGCTGCC GCGCGCTCTT CGCCACCCAC
TACCACGAGA TGACGGCGCT CGCCGGCAAG CTCACCGGCG TCGAGAACGC CACCGTGTCC
GTCAAGGAAT GGCAGGGCGA GGTGATCTTC CTGCACGAGG TGCGGCGCGG CGCGGCTGAT
CGGTCCTATG GTGTGCAGGT GGCGCGGCTC GCGGGCCTTC CCGCCTCGGT AATCGAGCGC
GCCCGTACCG TCCTCGACGC GCTCGAGTCC GGCGAACGCG AGAGCGGTCC ACGGCGGCAG
GCGCTGATCG ACGACCTGCC GCTCTTTCGC GCCGCCCCGC CGCCGCCCGC CCCCGCCGCT
CCTCCCAAAG CCTCGCAGGT GGAAGAGCGG CTGCGCGCGA TCCAGCCCGA CGACCTCAGC
CCGCGCGAGG CGCTCAAACT CCTCTACGAT CTCCGGGCCC TCCTGCCCTG A
 
Protein sequence
MSDDTVTPMM AQYLEIKAQH PGAILFYRMG DFYEMFFEDA ALAAEALDIA LTKRGKHKGE 
DIAMCGVPIH AAEGYLLTLI RKGFRVAIAE QMEDPAEAKK RGSKSVVRRE VVRLVTPGTL
TEDSLLEARR HNFLCAFAEI RDEAALAWAD ISTGEFSVTP CPLPRLLPEL ARLAPRELLV
ADERPLDWIE EVGCALTPLA RASFDSASAE KRLCTLFGVG TLDSFGNFTR PELSAMGALV
DYLDLTQRGK LPLLRPPVRE VAGGTVQIDA ATRRNLEITQ ALTGGREGSL LSAVDRTVTA
PGARLLERRL SSPSRDLGLI HDRLAAVSWL TDEPRLREDL RASLRRVPDM DRALSRLALD
RAGPRDMAAI RAGLTQAEAI AGRMPADAPS LLAETLEALR GHENLVDLLD QALVAEPPLL
VRDGGFIAPG FDDDLDETRR LRDEGRGVIA SMQAGFIETT GIQSLKIKHN NVLGYFIEVT
STHAEKMLSP PLSESFIHRQ TTAGQVRFTT VALSELETRI LNAGNRALEL EKMHFAALRT
AILDQAGAIG RAARALAEVD LIAAFADLAV AEDWTEPQVD DSRAFAIEAG RHPVVERALR
RTGTPFVAND CDLSKAETPA VWLITGPNMA GKSTFLRQNA LIALLAQAGS FVPARRAHIG
LVSQIFSRVG ASDDLARGRS TFMVEMVETA AILNQADDRA LVILDEIGRG TATWDGLSIA
WATLEHLHDT NRCRALFATH YHEMTALAGK LTGVENATVS VKEWQGEVIF LHEVRRGAAD
RSYGVQVARL AGLPASVIER ARTVLDALES GERESGPRRQ ALIDDLPLFR AAPPPPAPAA
PPKASQVEER LRAIQPDDLS PREALKLLYD LRALLP