Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0528 |
Symbol | |
ID | 3909432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 589261 |
End bp | 591996 |
Gene Length | 2736 bp |
Protein Length | 911 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637882416 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_484150 |
Protein GI | 86747654 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0799066 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCGGG TCATGACCAT CCACCCCGAC ATCGCTCCGC CGCCCGATCT GCCTGCGCCC GCCGAGCCGC CGGCGAAGGT GTCGCCGATG ATGGAGCAGT ACCATGAGAT AAAGGCCGCC AATCCGGGCC TCCTGCTGTT CTACCGGATG GGCGATTTCT ACGAATTGTT CTTCGAGGAC GCCGAGATCG CCTCGCGCGC GCTCGGCATC ACGCTGACCA AGCGCGGCAA GCATCAGGGC ATGGACATCC CGATGTGCGG CGTGCCGGTC GAGCGTTCCG ACGACTATCT GCACCGGCTG ATCGCGCTCG GCCATCGCGT CGCCGTCTGC GAACAGACCG AGGACCCGGC CGCGGCGCGC GCCCGCAAGA GCGTGGTGCG GCGCGACGTG GTGCGGCTGA TCACGCCCGG CACGCTGACC GAAGACACAC TGCTCGATGC CCGCGCCAAC AACTATCTGC TGGCGATCGC CCGCGCCCGC GGCTCCAGCG GGACCGATCG CATTGGTCTC GCCTGGATCG ACATCTCGAC CGGGGAATTC AGCGTCACCG AATGCGCCAC CGGCGAATTG TCGGCGACGC TGGCGCGGAT CAATCCGAAC GAGGCGATCG TCTCGGACGC TTTGTACAGC GATGCCGAAT TGGGGCCGAG CCTGCGCGAA CTCGCCGCCG TGACGCCGCT GACCCGCGAC GTGTTCGACA GTGCCACCGC CGAGCGGCGG CTGTGCGATT ACTTCGCGGT CGCCACCATG GACGGCCTCG CGGCGCTGTC GCGGCTGGAA GCCGCGGCGG CCGCCGCCTG CGTCACCTAT GTGGATCGCA CCCAGCTCGG CAAACGACCG CCTTTGTCGC CGCCGTCGCG CGAGGCCACC GGCGCGACCA TGGCGATTGA TCCGGCGACG CGCGCCAATC TCGAACTGAC GCGGACGCTG GCGGGCGAAC GCCGCGGCTC GCTGCTCGAT GCGATCGACT GCACCGTCAC CGCCGCGGGC TCGCGGCTCC TCGCCCAACG CCTCGCCGCG CCATTGACCG ATGCGGCCGC GATCGCGCGA CGGCTGGATG CGGTCGAAGT CTTCGTCGTC GCGCCCGCGT TGCGCGAGCA GATCCGCAGC GCGCTGCGCG CCGCGCCCGA CATGGCCCGC GCGCTGGCGC GTCTGTCGCT CGGCCGCGGC GGCCCGCGCG ATCTGGCGTC GCTGCGTGAC GGCATCGTCG CCGCCGATCA GGGGCTGCAA CAATTGTCGC AACTCACCGC ACCGCCGCAG GAGATCGCCG CCGCGATGGC GGCGCTGCGG CGACCGTCGC GCGATCTGTG CGACGAGCTG GGCCGCGCAC TCGCCGACGA TCTGCCGCTG CAAAAGCGCG ACGGCGGTTT CGTCCGCGAC GGCTACGAGG CGGCGCTGGA CGAAACCCGC AAGCTGCGCG ACGCCTCGCG CCTGGTGGTC GCGGCGATGC AGGCGCGCTA CGCCGACGAC ACCGGCGTCA AGGGCCTGAA GATCCGGCAC AACAACGTGC TCGGCTATTT CGTCGAGGTG ACCGCGCAGC ACGGCGACAG GCTGATGGCG CCGCCGCTCA ACGCCACCTT CATCCACCGC CAGACGCTGG CCGGCCAGGT CCGTTTCACC ACTGCCGAAC TCGGCGAGAT CGAGGCCAAG ATCGCCAATG CGGGCGACCG CGCGCTCGGG CTCGAACTCG AAATCTTCGA TCGCCTCGCA GCATTGATCG AGACGGCGGG CGAGGATCTG CGCGCCGCTG CCCATGCGTT CGCGCTGCTC GATGTCGCCA CCGCGCTGGC GAAGCTTGCT TCCGACGACA ACTACGTGCG GCCGGAGGTC GATCAGTCGC TGTCGTTCGC GATCGAAGGC GGCCGGCATC CGGTGGTCGA ACAGGCGCTG AAGCGAGCGG GCGAACCGTT CATCGCCAAT GCCTGCGATC TCTCTCCCGG CCCGGCGCAA GCCTCGGGCC AGATCTGGCT GCTGACCGGC CCCAACATGG CCGGCAAATC GACCTTCCTG CGCCAGAACG CGCTGATCGC GCTATTGGCG CAGACCGGCA GCTATGTGCC GGCGGCGCGC GCGCGGATCG GCATCGTCGA CCGGCTGTTC TCGCGGGTCG GCGCCGCCGA CGATCTGGCG CGCGGCCGCT CCACCTTCAT GGTCGAGATG GTCGAGACCG CCGCGATCCT CAACCAGGCG ACCGAGCGGG CGCTGGTGAT CCTCGACGAG ATCGGCCGCG GCACCGCGAC CTTCGACGGC CTGTCGATCG CCTGGGCGGC GATCGAGCAT CTGCACGAGC AGAACCGCTG CCGCGCGCTG TTCGCGACGC ATTATCACGA GCTGACCGCG CTGTCGGCCA AGCTGCCGCG GCTGTTCAAC GCCACCGTCC GGGTCAAGGA ATGGCGCGGC GAGGTGGTGT TCCTGCACGA GGTGCTGCCG GGTTCGGCCG ATCGCTCCTA CGGCATCCAG GTCGCCAAGC TCGCCGGGCT TCCCGCCTCC GTGGTGGCGC GGGCGAAATC GGTGCTGGCC AAGCTTGAGG CCAACGACCG CGGCCAGCCC AAGGCGCTGA TCGACGACCT GCCGCTGTTT GCGATCACAG CCCGCGCCCC GGCCGAGCCA TCGCCGCCGA GCGAGGCCGA GCAACTGATC GCGGCCGTGC AGGCGCTGCA TCCGGACGAA CTGAGCCCGC GCGAAGCGCT CGACGCGCTG TATGCGCTGA AGGCGAAGCT GCCGAAGACA ACCTGA
|
Protein sequence | MHRVMTIHPD IAPPPDLPAP AEPPAKVSPM MEQYHEIKAA NPGLLLFYRM GDFYELFFED AEIASRALGI TLTKRGKHQG MDIPMCGVPV ERSDDYLHRL IALGHRVAVC EQTEDPAAAR ARKSVVRRDV VRLITPGTLT EDTLLDARAN NYLLAIARAR GSSGTDRIGL AWIDISTGEF SVTECATGEL SATLARINPN EAIVSDALYS DAELGPSLRE LAAVTPLTRD VFDSATAERR LCDYFAVATM DGLAALSRLE AAAAAACVTY VDRTQLGKRP PLSPPSREAT GATMAIDPAT RANLELTRTL AGERRGSLLD AIDCTVTAAG SRLLAQRLAA PLTDAAAIAR RLDAVEVFVV APALREQIRS ALRAAPDMAR ALARLSLGRG GPRDLASLRD GIVAADQGLQ QLSQLTAPPQ EIAAAMAALR RPSRDLCDEL GRALADDLPL QKRDGGFVRD GYEAALDETR KLRDASRLVV AAMQARYADD TGVKGLKIRH NNVLGYFVEV TAQHGDRLMA PPLNATFIHR QTLAGQVRFT TAELGEIEAK IANAGDRALG LELEIFDRLA ALIETAGEDL RAAAHAFALL DVATALAKLA SDDNYVRPEV DQSLSFAIEG GRHPVVEQAL KRAGEPFIAN ACDLSPGPAQ ASGQIWLLTG PNMAGKSTFL RQNALIALLA QTGSYVPAAR ARIGIVDRLF SRVGAADDLA RGRSTFMVEM VETAAILNQA TERALVILDE IGRGTATFDG LSIAWAAIEH LHEQNRCRAL FATHYHELTA LSAKLPRLFN ATVRVKEWRG EVVFLHEVLP GSADRSYGIQ VAKLAGLPAS VVARAKSVLA KLEANDRGQP KALIDDLPLF AITARAPAEP SPPSEAEQLI AAVQALHPDE LSPREALDAL YALKAKLPKT T
|
| |