Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0321 |
Symbol | |
ID | 4020780 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 370238 |
End bp | 373009 |
Gene Length | 2772 bp |
Protein Length | 923 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637960499 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_567460 |
Protein GI | 91974801 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.784021 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.870381 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCGGG TCATGACCAT CCGTCCCGAC ATTCCTCCGC AGCCCGATAT CGCCGCCCCG GCCGAACCGC CGGCGCGGGT ATCGCCGATG ATGGAGCAGT ATCACGAGAT CAAGGCCGCC AATCCCGGCC TGCTGCTGTT CTACCGGATG GGCGATTTCT ACGAGCTGTT CTTCGAGGAC GCCGAGATCG CTTCGCGCGC GCTCGGCATC ACGCTGACCA AGCGCGGCAA GCATCTCGGC GCCGACATTC CGATGTGCGG CGTGCCGGTG GAGCGGTCCG ACGATTACCT GCACCGGCTG ATCGCGCTGG GGCATCGCGT CGCAGTATGC GAGCAGACCG AAGACCCGGC GGCGGCGCGC GCCCGCAAGA GCGTGGTGCG CCGCGACGTG GTGCGACTGA TCACGCCCGG CACGCTCACC GAAGACACCC TGCTCGACGC CCGCGCCAAC AACTATCTGA TGGCGATCGC GCGCACGCGC GGCTCGGCCG GCGTTGATCG CATCGGGCTC GCCTGGATCG ACATTTCGAC CGGCGAATTC TGCGTCACCG AATGCGCGAC CGGCGAATTG TCGGCGACGC TGGCGCGGAT CAACCCGAAC GAGGCGATCG TCTCGGACGC GCTGTACAGC GACGCCGAAC TCGGGCCGAG CCTGCGGGAA CTCGCCGCGG TGACGCCCTT GACCCGCGAC GTGTTCGACT CGGCCACCGC CGAGCGGCGG CTGTGCGATT ACTTCGCGGT GGCCACCATG GACGGCCTCG CCGCGCTGTC GCGGCTCGAG GCCGCTGCGG CCGCCGCCTG CGTCACCTAT GTGGATCGCA CCCAGCTCGG CAAGCGGCCG CCTTTGTCGC CGCCGGCGCG CGAGGCCGCA GGCGCCACCA TGGCGATCGA TCCCGCCACC CGCGCCAATC TCGAACTGAC CCGCACGCTC GGCGGCGAAC GCCGCGGCTC GCTGCTCGAT GCGATCGACT GCACCGTCAC CGCCGCCGGC TCGCGCCTTC TCGCCCAGCG GCTCGCGGCG CCGCTGACCG ACGAGGCGGC GATCGCACGG CGGCTCGACG CGGTCGCGGC CTTCGTCGCG GACAGCGCGC TGCGCGAACA GATCCGCAGC GCGCTCCGCG CCGCGCCCGA CATGGCGCGG GCGCTGGCGC GGCTGTCGCT CGGCCGCGGC GGGCCGCGCG ATCTCGCGAG CCTGCGCGAC GGCGTGAGCG CCGCCGACAA GGTGCTGGCG CAGCTTTCGC AGCTCGCCCA ACCGCCGCAC GACATCGCCG CCGCGATGGC GGCGCTGCGG CGGCCGTCGC GCGATCTCTG CCAAGAACTC GCCCGCGCGC TCGCCGACGA TCTGCCGCTG TTGAAACGCG ACGGCGGTTT CGTCCGCGAC GGTTATGAGG CCGCGCTCGA CGAGACCCGC AAGCTGCGCG ACGCCTCGCG GCTCGTCGTG GCGGCGATGC AGGCGCGCTA CGCCGACGAG ACCGGCGTCA AAGGGCTGAA GATCCGGCAC AACAACGTGC TCGGCTATTT CGTCGAAGTC ACCGCGCAGC ACGGCGACCG TCTGATGGCG CCGCCGCTCA ACGCCACCTT CATCCATCGC CAGACGCTGG CCGGTCAGGT CCGGTTCACC ACCGCCGAAC TCGGCGAGAT CGAGGCCAAG ATCGCCAATG CGGGCGACCG CGCGCTGGGG CTCGAACTGG ACATCTTCGA CCGCCTCGCG GCGATGATCG ACACGGCCGG CGACGATCTG CGCGCCGCGG CCCATGCGTT TGCGTTGCTC GACGTCGCGA CCGCGCTGGC CAAGCTCGCC GTCTCCGACA ACTACGTCCG GCCGGAAGTC GACGGGTCGC TCGCTTTCGC GATCGAAGGC GGCCGGCATC CGGTGGTCGA GCAGGCGCTC AAGCGCGCCG GCGAGCCGTT CATCGCCAAT GCCTGCGACC TGTCGCCGGT CCCCCCACCC TTCCCTCCCC CGCTTGCGGG GGAGGGAAGG GTGGGGGCCG GGCAGATCTG GCTGCTGACC GGCCCAAACA TGGCCGGTAA ATCGACTTTC TTGCGCCAGA ACGCGTTGAT CGCGCTGTTG GCGCAGACCG GCAGCTTCGT GCCGGCGAGC CGCGCCAGAA TCGGCATCGT CGACCGGCTG TTCTCCCGGG TCGGCGCCGC CGACGATCTC GCGCGCGGCC GCTCGACCTT CATGGTCGAG ATGGTCGAGA CCGCCACGAT CCTCAATCAG GCGACCGAGC GGGCGCTGGT GATCCTCGAC GAGATCGGCC GCGGCACGGC GACCTTCGAC GGCCTGTCGA TCGCCTGGGC GGCGATCGAG CATCTGCACG AGCAGAACCG TTGCCGCGCG CTGTTCGCGA CGCATTACCA CGAGCTGACC GCGCTCTCCG CCAAACTGCC GCGGCTGTTC AACGCCACGG TGCGGGTCAA GGAATGGCGC GGCGAGGTGG TGTTCCTGCA CGAGGTGCTG CCGGGCTCGG CCGACCGCTC TTACGGCATT CAGGTCGCCA AGCTCGCGGG GTTGCCGCCG TCGGTGGTGG CGCGGGCGAA GTCGGTGCTG GCCAAACTCG AAGCCAACGA CCGCGGTCAA TCGGCGCGGA CGCTCGCCGA CGATCTGCCG CTGTTCGCCA TGACCGCGCG GGCGCCGGTC GAGCCCCCGC CGCCGAGCGA GGCCGAGCAA CTGATCGAAG CGGTAAGGGC GCTACACCCC GACGAACTCA GCCCACGCGA GGCGCTCGAC GCGTTGTATG CCCTGAAGGC GAAGTTGCCG AAGGCAGATT GA
|
Protein sequence | MHRVMTIRPD IPPQPDIAAP AEPPARVSPM MEQYHEIKAA NPGLLLFYRM GDFYELFFED AEIASRALGI TLTKRGKHLG ADIPMCGVPV ERSDDYLHRL IALGHRVAVC EQTEDPAAAR ARKSVVRRDV VRLITPGTLT EDTLLDARAN NYLMAIARTR GSAGVDRIGL AWIDISTGEF CVTECATGEL SATLARINPN EAIVSDALYS DAELGPSLRE LAAVTPLTRD VFDSATAERR LCDYFAVATM DGLAALSRLE AAAAAACVTY VDRTQLGKRP PLSPPAREAA GATMAIDPAT RANLELTRTL GGERRGSLLD AIDCTVTAAG SRLLAQRLAA PLTDEAAIAR RLDAVAAFVA DSALREQIRS ALRAAPDMAR ALARLSLGRG GPRDLASLRD GVSAADKVLA QLSQLAQPPH DIAAAMAALR RPSRDLCQEL ARALADDLPL LKRDGGFVRD GYEAALDETR KLRDASRLVV AAMQARYADE TGVKGLKIRH NNVLGYFVEV TAQHGDRLMA PPLNATFIHR QTLAGQVRFT TAELGEIEAK IANAGDRALG LELDIFDRLA AMIDTAGDDL RAAAHAFALL DVATALAKLA VSDNYVRPEV DGSLAFAIEG GRHPVVEQAL KRAGEPFIAN ACDLSPVPPP FPPPLAGEGR VGAGQIWLLT GPNMAGKSTF LRQNALIALL AQTGSFVPAS RARIGIVDRL FSRVGAADDL ARGRSTFMVE MVETATILNQ ATERALVILD EIGRGTATFD GLSIAWAAIE HLHEQNRCRA LFATHYHELT ALSAKLPRLF NATVRVKEWR GEVVFLHEVL PGSADRSYGI QVAKLAGLPP SVVARAKSVL AKLEANDRGQ SARTLADDLP LFAMTARAPV EPPPPSEAEQ LIEAVRALHP DELSPREALD ALYALKAKLP KAD
|
| |