Gene RPD_0321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0321 
Symbol 
ID4020780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp370238 
End bp373009 
Gene Length2772 bp 
Protein Length923 aa 
Translation table11 
GC content70% 
IMG OID637960499 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_567460 
Protein GI91974801 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.784021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.870381 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCGGG TCATGACCAT CCGTCCCGAC ATTCCTCCGC AGCCCGATAT CGCCGCCCCG 
GCCGAACCGC CGGCGCGGGT ATCGCCGATG ATGGAGCAGT ATCACGAGAT CAAGGCCGCC
AATCCCGGCC TGCTGCTGTT CTACCGGATG GGCGATTTCT ACGAGCTGTT CTTCGAGGAC
GCCGAGATCG CTTCGCGCGC GCTCGGCATC ACGCTGACCA AGCGCGGCAA GCATCTCGGC
GCCGACATTC CGATGTGCGG CGTGCCGGTG GAGCGGTCCG ACGATTACCT GCACCGGCTG
ATCGCGCTGG GGCATCGCGT CGCAGTATGC GAGCAGACCG AAGACCCGGC GGCGGCGCGC
GCCCGCAAGA GCGTGGTGCG CCGCGACGTG GTGCGACTGA TCACGCCCGG CACGCTCACC
GAAGACACCC TGCTCGACGC CCGCGCCAAC AACTATCTGA TGGCGATCGC GCGCACGCGC
GGCTCGGCCG GCGTTGATCG CATCGGGCTC GCCTGGATCG ACATTTCGAC CGGCGAATTC
TGCGTCACCG AATGCGCGAC CGGCGAATTG TCGGCGACGC TGGCGCGGAT CAACCCGAAC
GAGGCGATCG TCTCGGACGC GCTGTACAGC GACGCCGAAC TCGGGCCGAG CCTGCGGGAA
CTCGCCGCGG TGACGCCCTT GACCCGCGAC GTGTTCGACT CGGCCACCGC CGAGCGGCGG
CTGTGCGATT ACTTCGCGGT GGCCACCATG GACGGCCTCG CCGCGCTGTC GCGGCTCGAG
GCCGCTGCGG CCGCCGCCTG CGTCACCTAT GTGGATCGCA CCCAGCTCGG CAAGCGGCCG
CCTTTGTCGC CGCCGGCGCG CGAGGCCGCA GGCGCCACCA TGGCGATCGA TCCCGCCACC
CGCGCCAATC TCGAACTGAC CCGCACGCTC GGCGGCGAAC GCCGCGGCTC GCTGCTCGAT
GCGATCGACT GCACCGTCAC CGCCGCCGGC TCGCGCCTTC TCGCCCAGCG GCTCGCGGCG
CCGCTGACCG ACGAGGCGGC GATCGCACGG CGGCTCGACG CGGTCGCGGC CTTCGTCGCG
GACAGCGCGC TGCGCGAACA GATCCGCAGC GCGCTCCGCG CCGCGCCCGA CATGGCGCGG
GCGCTGGCGC GGCTGTCGCT CGGCCGCGGC GGGCCGCGCG ATCTCGCGAG CCTGCGCGAC
GGCGTGAGCG CCGCCGACAA GGTGCTGGCG CAGCTTTCGC AGCTCGCCCA ACCGCCGCAC
GACATCGCCG CCGCGATGGC GGCGCTGCGG CGGCCGTCGC GCGATCTCTG CCAAGAACTC
GCCCGCGCGC TCGCCGACGA TCTGCCGCTG TTGAAACGCG ACGGCGGTTT CGTCCGCGAC
GGTTATGAGG CCGCGCTCGA CGAGACCCGC AAGCTGCGCG ACGCCTCGCG GCTCGTCGTG
GCGGCGATGC AGGCGCGCTA CGCCGACGAG ACCGGCGTCA AAGGGCTGAA GATCCGGCAC
AACAACGTGC TCGGCTATTT CGTCGAAGTC ACCGCGCAGC ACGGCGACCG TCTGATGGCG
CCGCCGCTCA ACGCCACCTT CATCCATCGC CAGACGCTGG CCGGTCAGGT CCGGTTCACC
ACCGCCGAAC TCGGCGAGAT CGAGGCCAAG ATCGCCAATG CGGGCGACCG CGCGCTGGGG
CTCGAACTGG ACATCTTCGA CCGCCTCGCG GCGATGATCG ACACGGCCGG CGACGATCTG
CGCGCCGCGG CCCATGCGTT TGCGTTGCTC GACGTCGCGA CCGCGCTGGC CAAGCTCGCC
GTCTCCGACA ACTACGTCCG GCCGGAAGTC GACGGGTCGC TCGCTTTCGC GATCGAAGGC
GGCCGGCATC CGGTGGTCGA GCAGGCGCTC AAGCGCGCCG GCGAGCCGTT CATCGCCAAT
GCCTGCGACC TGTCGCCGGT CCCCCCACCC TTCCCTCCCC CGCTTGCGGG GGAGGGAAGG
GTGGGGGCCG GGCAGATCTG GCTGCTGACC GGCCCAAACA TGGCCGGTAA ATCGACTTTC
TTGCGCCAGA ACGCGTTGAT CGCGCTGTTG GCGCAGACCG GCAGCTTCGT GCCGGCGAGC
CGCGCCAGAA TCGGCATCGT CGACCGGCTG TTCTCCCGGG TCGGCGCCGC CGACGATCTC
GCGCGCGGCC GCTCGACCTT CATGGTCGAG ATGGTCGAGA CCGCCACGAT CCTCAATCAG
GCGACCGAGC GGGCGCTGGT GATCCTCGAC GAGATCGGCC GCGGCACGGC GACCTTCGAC
GGCCTGTCGA TCGCCTGGGC GGCGATCGAG CATCTGCACG AGCAGAACCG TTGCCGCGCG
CTGTTCGCGA CGCATTACCA CGAGCTGACC GCGCTCTCCG CCAAACTGCC GCGGCTGTTC
AACGCCACGG TGCGGGTCAA GGAATGGCGC GGCGAGGTGG TGTTCCTGCA CGAGGTGCTG
CCGGGCTCGG CCGACCGCTC TTACGGCATT CAGGTCGCCA AGCTCGCGGG GTTGCCGCCG
TCGGTGGTGG CGCGGGCGAA GTCGGTGCTG GCCAAACTCG AAGCCAACGA CCGCGGTCAA
TCGGCGCGGA CGCTCGCCGA CGATCTGCCG CTGTTCGCCA TGACCGCGCG GGCGCCGGTC
GAGCCCCCGC CGCCGAGCGA GGCCGAGCAA CTGATCGAAG CGGTAAGGGC GCTACACCCC
GACGAACTCA GCCCACGCGA GGCGCTCGAC GCGTTGTATG CCCTGAAGGC GAAGTTGCCG
AAGGCAGATT GA
 
Protein sequence
MHRVMTIRPD IPPQPDIAAP AEPPARVSPM MEQYHEIKAA NPGLLLFYRM GDFYELFFED 
AEIASRALGI TLTKRGKHLG ADIPMCGVPV ERSDDYLHRL IALGHRVAVC EQTEDPAAAR
ARKSVVRRDV VRLITPGTLT EDTLLDARAN NYLMAIARTR GSAGVDRIGL AWIDISTGEF
CVTECATGEL SATLARINPN EAIVSDALYS DAELGPSLRE LAAVTPLTRD VFDSATAERR
LCDYFAVATM DGLAALSRLE AAAAAACVTY VDRTQLGKRP PLSPPAREAA GATMAIDPAT
RANLELTRTL GGERRGSLLD AIDCTVTAAG SRLLAQRLAA PLTDEAAIAR RLDAVAAFVA
DSALREQIRS ALRAAPDMAR ALARLSLGRG GPRDLASLRD GVSAADKVLA QLSQLAQPPH
DIAAAMAALR RPSRDLCQEL ARALADDLPL LKRDGGFVRD GYEAALDETR KLRDASRLVV
AAMQARYADE TGVKGLKIRH NNVLGYFVEV TAQHGDRLMA PPLNATFIHR QTLAGQVRFT
TAELGEIEAK IANAGDRALG LELDIFDRLA AMIDTAGDDL RAAAHAFALL DVATALAKLA
VSDNYVRPEV DGSLAFAIEG GRHPVVEQAL KRAGEPFIAN ACDLSPVPPP FPPPLAGEGR
VGAGQIWLLT GPNMAGKSTF LRQNALIALL AQTGSFVPAS RARIGIVDRL FSRVGAADDL
ARGRSTFMVE MVETATILNQ ATERALVILD EIGRGTATFD GLSIAWAAIE HLHEQNRCRA
LFATHYHELT ALSAKLPRLF NATVRVKEWR GEVVFLHEVL PGSADRSYGI QVAKLAGLPP
SVVARAKSVL AKLEANDRGQ SARTLADDLP LFAMTARAPV EPPPPSEAEQ LIEAVRALHP
DELSPREALD ALYALKAKLP KAD