Gene RPB_0528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0528 
Symbol 
ID3909432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp589261 
End bp591996 
Gene Length2736 bp 
Protein Length911 aa 
Translation table11 
GC content70% 
IMG OID637882416 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_484150 
Protein GI86747654 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0799066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCGGG TCATGACCAT CCACCCCGAC ATCGCTCCGC CGCCCGATCT GCCTGCGCCC 
GCCGAGCCGC CGGCGAAGGT GTCGCCGATG ATGGAGCAGT ACCATGAGAT AAAGGCCGCC
AATCCGGGCC TCCTGCTGTT CTACCGGATG GGCGATTTCT ACGAATTGTT CTTCGAGGAC
GCCGAGATCG CCTCGCGCGC GCTCGGCATC ACGCTGACCA AGCGCGGCAA GCATCAGGGC
ATGGACATCC CGATGTGCGG CGTGCCGGTC GAGCGTTCCG ACGACTATCT GCACCGGCTG
ATCGCGCTCG GCCATCGCGT CGCCGTCTGC GAACAGACCG AGGACCCGGC CGCGGCGCGC
GCCCGCAAGA GCGTGGTGCG GCGCGACGTG GTGCGGCTGA TCACGCCCGG CACGCTGACC
GAAGACACAC TGCTCGATGC CCGCGCCAAC AACTATCTGC TGGCGATCGC CCGCGCCCGC
GGCTCCAGCG GGACCGATCG CATTGGTCTC GCCTGGATCG ACATCTCGAC CGGGGAATTC
AGCGTCACCG AATGCGCCAC CGGCGAATTG TCGGCGACGC TGGCGCGGAT CAATCCGAAC
GAGGCGATCG TCTCGGACGC TTTGTACAGC GATGCCGAAT TGGGGCCGAG CCTGCGCGAA
CTCGCCGCCG TGACGCCGCT GACCCGCGAC GTGTTCGACA GTGCCACCGC CGAGCGGCGG
CTGTGCGATT ACTTCGCGGT CGCCACCATG GACGGCCTCG CGGCGCTGTC GCGGCTGGAA
GCCGCGGCGG CCGCCGCCTG CGTCACCTAT GTGGATCGCA CCCAGCTCGG CAAACGACCG
CCTTTGTCGC CGCCGTCGCG CGAGGCCACC GGCGCGACCA TGGCGATTGA TCCGGCGACG
CGCGCCAATC TCGAACTGAC GCGGACGCTG GCGGGCGAAC GCCGCGGCTC GCTGCTCGAT
GCGATCGACT GCACCGTCAC CGCCGCGGGC TCGCGGCTCC TCGCCCAACG CCTCGCCGCG
CCATTGACCG ATGCGGCCGC GATCGCGCGA CGGCTGGATG CGGTCGAAGT CTTCGTCGTC
GCGCCCGCGT TGCGCGAGCA GATCCGCAGC GCGCTGCGCG CCGCGCCCGA CATGGCCCGC
GCGCTGGCGC GTCTGTCGCT CGGCCGCGGC GGCCCGCGCG ATCTGGCGTC GCTGCGTGAC
GGCATCGTCG CCGCCGATCA GGGGCTGCAA CAATTGTCGC AACTCACCGC ACCGCCGCAG
GAGATCGCCG CCGCGATGGC GGCGCTGCGG CGACCGTCGC GCGATCTGTG CGACGAGCTG
GGCCGCGCAC TCGCCGACGA TCTGCCGCTG CAAAAGCGCG ACGGCGGTTT CGTCCGCGAC
GGCTACGAGG CGGCGCTGGA CGAAACCCGC AAGCTGCGCG ACGCCTCGCG CCTGGTGGTC
GCGGCGATGC AGGCGCGCTA CGCCGACGAC ACCGGCGTCA AGGGCCTGAA GATCCGGCAC
AACAACGTGC TCGGCTATTT CGTCGAGGTG ACCGCGCAGC ACGGCGACAG GCTGATGGCG
CCGCCGCTCA ACGCCACCTT CATCCACCGC CAGACGCTGG CCGGCCAGGT CCGTTTCACC
ACTGCCGAAC TCGGCGAGAT CGAGGCCAAG ATCGCCAATG CGGGCGACCG CGCGCTCGGG
CTCGAACTCG AAATCTTCGA TCGCCTCGCA GCATTGATCG AGACGGCGGG CGAGGATCTG
CGCGCCGCTG CCCATGCGTT CGCGCTGCTC GATGTCGCCA CCGCGCTGGC GAAGCTTGCT
TCCGACGACA ACTACGTGCG GCCGGAGGTC GATCAGTCGC TGTCGTTCGC GATCGAAGGC
GGCCGGCATC CGGTGGTCGA ACAGGCGCTG AAGCGAGCGG GCGAACCGTT CATCGCCAAT
GCCTGCGATC TCTCTCCCGG CCCGGCGCAA GCCTCGGGCC AGATCTGGCT GCTGACCGGC
CCCAACATGG CCGGCAAATC GACCTTCCTG CGCCAGAACG CGCTGATCGC GCTATTGGCG
CAGACCGGCA GCTATGTGCC GGCGGCGCGC GCGCGGATCG GCATCGTCGA CCGGCTGTTC
TCGCGGGTCG GCGCCGCCGA CGATCTGGCG CGCGGCCGCT CCACCTTCAT GGTCGAGATG
GTCGAGACCG CCGCGATCCT CAACCAGGCG ACCGAGCGGG CGCTGGTGAT CCTCGACGAG
ATCGGCCGCG GCACCGCGAC CTTCGACGGC CTGTCGATCG CCTGGGCGGC GATCGAGCAT
CTGCACGAGC AGAACCGCTG CCGCGCGCTG TTCGCGACGC ATTATCACGA GCTGACCGCG
CTGTCGGCCA AGCTGCCGCG GCTGTTCAAC GCCACCGTCC GGGTCAAGGA ATGGCGCGGC
GAGGTGGTGT TCCTGCACGA GGTGCTGCCG GGTTCGGCCG ATCGCTCCTA CGGCATCCAG
GTCGCCAAGC TCGCCGGGCT TCCCGCCTCC GTGGTGGCGC GGGCGAAATC GGTGCTGGCC
AAGCTTGAGG CCAACGACCG CGGCCAGCCC AAGGCGCTGA TCGACGACCT GCCGCTGTTT
GCGATCACAG CCCGCGCCCC GGCCGAGCCA TCGCCGCCGA GCGAGGCCGA GCAACTGATC
GCGGCCGTGC AGGCGCTGCA TCCGGACGAA CTGAGCCCGC GCGAAGCGCT CGACGCGCTG
TATGCGCTGA AGGCGAAGCT GCCGAAGACA ACCTGA
 
Protein sequence
MHRVMTIHPD IAPPPDLPAP AEPPAKVSPM MEQYHEIKAA NPGLLLFYRM GDFYELFFED 
AEIASRALGI TLTKRGKHQG MDIPMCGVPV ERSDDYLHRL IALGHRVAVC EQTEDPAAAR
ARKSVVRRDV VRLITPGTLT EDTLLDARAN NYLLAIARAR GSSGTDRIGL AWIDISTGEF
SVTECATGEL SATLARINPN EAIVSDALYS DAELGPSLRE LAAVTPLTRD VFDSATAERR
LCDYFAVATM DGLAALSRLE AAAAAACVTY VDRTQLGKRP PLSPPSREAT GATMAIDPAT
RANLELTRTL AGERRGSLLD AIDCTVTAAG SRLLAQRLAA PLTDAAAIAR RLDAVEVFVV
APALREQIRS ALRAAPDMAR ALARLSLGRG GPRDLASLRD GIVAADQGLQ QLSQLTAPPQ
EIAAAMAALR RPSRDLCDEL GRALADDLPL QKRDGGFVRD GYEAALDETR KLRDASRLVV
AAMQARYADD TGVKGLKIRH NNVLGYFVEV TAQHGDRLMA PPLNATFIHR QTLAGQVRFT
TAELGEIEAK IANAGDRALG LELEIFDRLA ALIETAGEDL RAAAHAFALL DVATALAKLA
SDDNYVRPEV DQSLSFAIEG GRHPVVEQAL KRAGEPFIAN ACDLSPGPAQ ASGQIWLLTG
PNMAGKSTFL RQNALIALLA QTGSYVPAAR ARIGIVDRLF SRVGAADDLA RGRSTFMVEM
VETAAILNQA TERALVILDE IGRGTATFDG LSIAWAAIEH LHEQNRCRAL FATHYHELTA
LSAKLPRLFN ATVRVKEWRG EVVFLHEVLP GSADRSYGIQ VAKLAGLPAS VVARAKSVLA
KLEANDRGQP KALIDDLPLF AITARAPAEP SPPSEAEQLI AAVQALHPDE LSPREALDAL
YALKAKLPKT T