Gene Rsph17029_2879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2879 
Symbol 
ID4897705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp3036888 
End bp3039530 
Gene Length2643 bp 
Protein Length880 aa 
Translation table11 
GC content71% 
IMG OID640113482 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001044753 
Protein GI126463639 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAGGGG GCAGCGTGAG CGACGACACC GTCACGCCGA TGATGGCGCA ATATCTCGAG 
ATCAAGGCGC AGAACCCCGG CGCCATCCTG TTCTACCGGA TGGGCGACTT CTACGAGATG
TTCTTCGACG ACGCGGCTCT GGCCGCCGAG GCGCTCGACA TCGCGCTCAC CAAGCGCGGC
AAGCACCGCG GCGAGGATAT CGCCATGTGC GGCGTGCCGA TCCATGCGGC CGAGGGCTAT
CTTCTCACGC TGATCCGCAA GGGCTTCCGC GTGGCCATCG CCGAACAGAT GGAGGATCCG
GCCGAGGCGA AGAAGCGCGG CTCCAAGTCC GTGGTCCGGC GCGAGGTCGT GCGCCTCGTC
ACGCCCGGCA CGCTGACCGA GGACACGCTG CTCGAAGCGC GGCGGCACAA CTACCTCTGC
GCCTTCGCCG AGATCCGCGA CGAGGCGGCA CTGGCCTGGG CCGACATCTC GACGGGTGAG
CTCAGCGTCA CGGCCTGCCC GCTGCCGCGC CTCATGCCCG AACTGGCCCG CCTCGCCCCG
CGCGAGCTGC TGGTGGCCGA CGAGCGGGAG CTCGACTGGA TCGAGGAGGT GGGCTGCGCG
CTGACGCCGC TCTCGCGGGC GAGCTTCGAC AGCGCCTCGG CCGAGAAGCG GCTCTGCGCG
CTCTTCGGCG TGGGCACGCT CGAAAGCTTC GGCAATTTCA CCCGGGCCGA GCTTTCGGCC
ATGGGGGCGC TGGTCGACTA CCTCGACCTT ACCCAGCGCG GCAAGCTGCC GCTCCTGCGC
CCGCCCGTGC GCGAGACAGT AGGCGGCACG GTCCAGATCG ACGCCGCCAC CCGTCGCAAC
CTCGAGATCA CCCAGGCGCT TGCCGGAGGC CGCGACGGCT CGCTCCTCTC CGCCGTGGAT
CGCACCGTCA CCGCTCCGGG CGCGCGCCTG CTCGAACGGC GTCTCTCGAG CCCCACCCGC
GACCTCGGCC TGATCCACGA GAGGCTCGGC GCGGTGCGCT GGCTGACCGA AGAGCCGCGC
CTGCGCGAGG AAATGCGCGC GAGCCTCCGC CGCGTGCCCG ACATGGACCG CGCCCTCTCG
CGGCTGGCAC TCGACCGTGC CGGTCCCCGC GATATGGCCG CGATCCGGGC AGGCCTCGCC
CAGGCCCAGG AGATCGCGCA GCGGATGCCC GCCGAGGCTC CCGCCCTCGT CACCCGCGCA
CTCGAGGCGC TCGGCGGCCA CGAGGCGCTG GTGGATCTCC TCGATCAGGC TCTCGTGGCC
GAGCCGCCGC TTCTCGCCCG CGACGGCGGC TTCATCGCAC AGGGGTTCGA TGCGGATCTC
GACGAGACGC GCCGCCTGCG CGACGAGGGC CGCGGCGTCA TCGCCTCGAT GCAGGCGGGC
TTCATCGAGG TGACCGGCAT CCAGAGCCTG AAGATCAAGC ACAACAACGT GCTGGGCTAT
TTCATCGAGG TCACCTCGAC CCATGCCGAG AAGATGCTCT CGGCCCCGCT CTCCGAGCGA
TTCATCCACC GCCAGACCAC CGCCGGGCAG GTGCGCTTCA CGACGGTGGA GCTGTCGGAG
CTGGAAACGC GGATCCTGAA CGCGGGCAAC CGCGCGCTCG ACCTCGAGAA GATGCATTTC
GCAGCCCTGC GGACGGCGAT CCTCGATCTC GCCGGCCAGA TCGGTCGCGC CGCCCGGTCG
CTGGCCGAGC TCGACCTGAT CTCGGCCTTC GCCGACCTCG CGGCGATCGA GGACTGGACC
GAGCCCGAGA TCGACGACAG CCGCGCCTTC GCCATCGAGG CCGGGCGCCA TCCGGTCGTC
GAGCGGGCCC TGCGCCGCAC CGGCACGCCC TTCGTCGCCA ACCACTGCGA CCTCTCCACC
GGCGGGACGC CGGCGGTCTG GCTCATCACC GGGCCGAACA TGGCCGGCAA ATCCACCTTC
CTGCGCCAGA ACGCCCTGAT CGCGCTCCTC GCACAGGCGG GCAGCTTCGT TCCGGCCCGC
CGCGCCCATA TCGGCCTCGT GAGCCAGATC TTCAGCCGCG TCGGCGCCTC CGACGATCTG
GCGCGCGGCC GCTCGACCTT CATGGTCGAG ATGGTCGAAA CCGCGGCCAT CCTGAACCAG
GCCGATGACC GCGCTCTCGT GATCCTCGAC GAGATCGGCC GCGGCACCGC CACCTGGGAC
GGGCTCTCCA TCGCCTGGGC CACTCTCGAA CATCTGCACG ACCGGAACCG CTGCCGCGCC
CTTTTCGCCA CCCACTACCA CGAGATGACC GCGCTGGCGG GCAAGCTCAA GGGCGTGGAA
AACGCGACCG TCGCGGTGAA GGAATGGGAA GGCGACGTGA TCTTCCTGCA CGAGGTGCGG
CGCGGTGCGG CCGACCGCTC CTACGGCGTG CAGGTCGCAC GGCTTGCGGG CCTGCCCGCC
TCGGTGATCG AGCGCGCACG CACCGTGCTC GACGCGCTCG AGTCGGGCGA GCGCGAGAGC
GGCGGGCGAC GCCAGACGCT CATCGACGAC CTGCCGCTCT TCCGCGCGGC CCCGCCGCCG
CCCGCGCCCG CGGCGCCGAA GACCTCACCC GTGGAAGAGC GGCTGCGTGA GATCCAGCCC
GATGACCTCA GCCCGCGCGA GGCGCTGAAG CTCCTCTACG ATCTCCGCGC CCTGCTGACC
TGA
 
Protein sequence
MEGGSVSDDT VTPMMAQYLE IKAQNPGAIL FYRMGDFYEM FFDDAALAAE ALDIALTKRG 
KHRGEDIAMC GVPIHAAEGY LLTLIRKGFR VAIAEQMEDP AEAKKRGSKS VVRREVVRLV
TPGTLTEDTL LEARRHNYLC AFAEIRDEAA LAWADISTGE LSVTACPLPR LMPELARLAP
RELLVADERE LDWIEEVGCA LTPLSRASFD SASAEKRLCA LFGVGTLESF GNFTRAELSA
MGALVDYLDL TQRGKLPLLR PPVRETVGGT VQIDAATRRN LEITQALAGG RDGSLLSAVD
RTVTAPGARL LERRLSSPTR DLGLIHERLG AVRWLTEEPR LREEMRASLR RVPDMDRALS
RLALDRAGPR DMAAIRAGLA QAQEIAQRMP AEAPALVTRA LEALGGHEAL VDLLDQALVA
EPPLLARDGG FIAQGFDADL DETRRLRDEG RGVIASMQAG FIEVTGIQSL KIKHNNVLGY
FIEVTSTHAE KMLSAPLSER FIHRQTTAGQ VRFTTVELSE LETRILNAGN RALDLEKMHF
AALRTAILDL AGQIGRAARS LAELDLISAF ADLAAIEDWT EPEIDDSRAF AIEAGRHPVV
ERALRRTGTP FVANHCDLST GGTPAVWLIT GPNMAGKSTF LRQNALIALL AQAGSFVPAR
RAHIGLVSQI FSRVGASDDL ARGRSTFMVE MVETAAILNQ ADDRALVILD EIGRGTATWD
GLSIAWATLE HLHDRNRCRA LFATHYHEMT ALAGKLKGVE NATVAVKEWE GDVIFLHEVR
RGAADRSYGV QVARLAGLPA SVIERARTVL DALESGERES GGRRQTLIDD LPLFRAAPPP
PAPAAPKTSP VEERLREIQP DDLSPREALK LLYDLRALLT