Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0513 |
Symbol | |
ID | 6408162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 558419 |
End bp | 561142 |
Gene Length | 2724 bp |
Protein Length | 907 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 642710425 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001989548 |
Protein GI | 192288943 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCATCC GCCCCGACAT CGCTCTACCG CCCGACGCCG CTCCGCCTCC GGAGGCGCCC GCCAAAATGT CGCCGATGAT GGAGCAGTAC CACGAGATCA AAGCCGCCAA TCCTGGCCTG TTGCTGTTCT ACCGGATGGG CGATTTCTAC GAGCTGTTCT TCGAGGATGC CGAAATCGCG TCACGCGCGC TCGGTATTAC CCTGACCAAG CGCGGCAAGC ATCTCGGCGC CGACATTCCG ATGTGCGGTG TGCCGGTCGA GCGCTCCGAC GACTACCTGC ACCGGCTGAT CGCGCTGGGT CACCGCGTCG CTGTGTGCGA GCAGACCGAA GACCCGGCCG CGGCGCGCGC CCGCAAGAGC GTGGTGCGGC GCGACGTGGT GCGGCTGATC ACGCCCGGTA CGCTGACCGA AGATACCCTG CTCGACGCCC GCGCCAACAA CTACCTGCTG GCGATCGCGC GCGCCCGTGG CTCGGCCGGC GCCGATCGCA TCGGGCTCGC CTGGATCGAC ATCTCGACTG GCGAATTCTG CGTCACCGAG TGCACGACCG CAGAACTCGC CGCGACGCTG GCGCGGATCA ATCCGAACGA AGCCATCGTG CCGGACGCGC TGTACAGCGA CACAGAACTC GCCCCGACCT TGCGCGAGCT CGCCGCCGTC ACGCCGCTGA CGCGTGACGT GTTCGATTCC GCCACCGCCG AGCGGCGGCT GTGCGATTAC TTCGCTGTCG CCACCATGGA CGGCCTCGCC GCGCTGTCAC GGCTGGAAGC GACCGCCGCC GCGGCCTGCG TCACCTATGT CGACCGTACC CAGCTCGGCA AACGGCCGCC GCTGTCGCCG CCGTCACGCG AAGCCGCCGG CACCACGATG GCGATCGACC CGGCGACCCG CGCCAATCTC GAACTCACCC GCACGCTCGC CGGCGAACGC CGCGGCTCGC TGCTCGACGC GATCGACTGC ACGGTTACAG CCGCAGGATC TCGCCTCTTG GCGCAGCGGC TCGCCGCGCC GCTGACCGAT GCGGCGGCGA TCGCGCGGCG GCTCGACGCG GTCGAAGCCT TCACCGGGGA TGCGGGACTT CGCGAACAGA TCCGCAGCTC GCTGCGTGCG GCGCCCGACA TGGCGCGTGC ACTGGCGCGG CTGTCGCTCG GCCGCGGCGG CCCGCGGGAT CTCGCGAACT TGCGCGATGG CATCCGCGCT GCCGACGAGG TGATTGCGCA GCTCGGCCAG CTCGCAAGCC CGCCGCAGGA GATCGCGAGC GCGATGGCGG CGCTGCAGCG GCCGTCACGC GCATTGTGCG CCGAGCTCGG CCGCGCGCTC GCCGACGATC TGCCGCTTCT CAAGCGCGAC GGCGGCTTCG TGCGCGAAGG CTACGAGCCG GCGCTCGACG AGACCCGCAA GCTGCGCGAC GCCTCGCGGC TGGTGGTGGC GTCGATGCAG GCGCGCTACG CCGACGACAC CGGGATCAAG GCGCTGAAGA TCCGGCACAA CAACGTGCTC GGTTACTTCG TCGAGGTCTC GGCGCAGCAC GGCGACAAGT TGATGGCGCC GCCACTGAAC GCCACTTTCA TCCATCGCCA GACGCTGGCC GGGCAGGTGC GCTTCACCAC CGCCGAACTC GGCGAGATCG AGGCCAAGAT CGCCAATGCG GGCGACCGCG CACTCGGGCT GGAGCTGGAG ATCTTCGACC GCCTCGCCGC GATGATCGAT GCGGCCGGTG AAGACCTGCG CGCCGCCGCC CATGCGTTCG CGCTGCTCGA TGTCGCCACC GCGCTCGCCA AGCTCGCCAG CGACGACAAC TACGTGCGGC CCGAGGTCGA CGAGTCGCTG AGCTTTGCGA TCGAAGGCGG CAGGCATCCG GTGGTCGAGC AGGCGCTGAA GAAGGCTGGC GAGCCGTTCA TCGCCAATGC CTGCGACCTG TCGCCCGGCC CGGCGCAGAC CAACGGCCAG ATCTGGCTGC TGACCGGCCC GAACATGGCC GGTAAGTCGA CCTTCCTGCG CCAGAACGCG CTGATCGCCC TGCTCGCCCA GGTCGGCAGC TTCGTGCCGG CGATCCGGGC ACGGATCGGC ATCGTCGACC GGCTGTTCTC GCGCGTCGGC GCCGCCGACG ACCTCGCCCG CGGCCGTTCG ACCTTCATGG TCGAGATGGT CGAGACCGCC GCGATCCTGA ACCAGGCCTC CGAACGGGCG CTGGTGATCC TCGACGAGAT CGGCCGCGGC ACCGCGACGT TCGACGGCCT CTCGATCGCC TGGGCGGCGA TCGAGCACCT GCACGAACAG AACAGGTGTC GTTCGCTGTT CGCCACGCAC TACCATGAAC TGACCGCACT GTCGGCCAAG CTGCCGCGGC TGTTCAACGC CACCGTGCGG GTCAAGGAAT GGCGCGGCGA GGTGGTGTTT CTGCACGAGG TGCTGCCGGG CTCCGCCGAC CGCTCCTACG GCATTCAGGT CGCCAAGCTG GCGGGGCTCC CCCCCAGCGT GGTGAGCCGC GCGAAGGCCG TGCTGGCCAA GCTCGAAGCC AACGACCGCG GTCAGCCGAA GACGCTGATC GACGACCTGC CGCTGTTCGC CATCACGGCT CGCGCACCCG CCGAAGCCGC CCCACCGAGC GAGGCCGAGC AGCTGATCGA CGCGGTCAAG GCGCTGCATC CCGACGAGAT GACCCCGCGC GAGGCGCTGG ATGCGTTGTA CGCCCTGAAG GCGAAGCTAC CGAAGGCCGA CTGA
|
Protein sequence | MTIRPDIALP PDAAPPPEAP AKMSPMMEQY HEIKAANPGL LLFYRMGDFY ELFFEDAEIA SRALGITLTK RGKHLGADIP MCGVPVERSD DYLHRLIALG HRVAVCEQTE DPAAARARKS VVRRDVVRLI TPGTLTEDTL LDARANNYLL AIARARGSAG ADRIGLAWID ISTGEFCVTE CTTAELAATL ARINPNEAIV PDALYSDTEL APTLRELAAV TPLTRDVFDS ATAERRLCDY FAVATMDGLA ALSRLEATAA AACVTYVDRT QLGKRPPLSP PSREAAGTTM AIDPATRANL ELTRTLAGER RGSLLDAIDC TVTAAGSRLL AQRLAAPLTD AAAIARRLDA VEAFTGDAGL REQIRSSLRA APDMARALAR LSLGRGGPRD LANLRDGIRA ADEVIAQLGQ LASPPQEIAS AMAALQRPSR ALCAELGRAL ADDLPLLKRD GGFVREGYEP ALDETRKLRD ASRLVVASMQ ARYADDTGIK ALKIRHNNVL GYFVEVSAQH GDKLMAPPLN ATFIHRQTLA GQVRFTTAEL GEIEAKIANA GDRALGLELE IFDRLAAMID AAGEDLRAAA HAFALLDVAT ALAKLASDDN YVRPEVDESL SFAIEGGRHP VVEQALKKAG EPFIANACDL SPGPAQTNGQ IWLLTGPNMA GKSTFLRQNA LIALLAQVGS FVPAIRARIG IVDRLFSRVG AADDLARGRS TFMVEMVETA AILNQASERA LVILDEIGRG TATFDGLSIA WAAIEHLHEQ NRCRSLFATH YHELTALSAK LPRLFNATVR VKEWRGEVVF LHEVLPGSAD RSYGIQVAKL AGLPPSVVSR AKAVLAKLEA NDRGQPKTLI DDLPLFAITA RAPAEAAPPS EAEQLIDAVK ALHPDEMTPR EALDALYALK AKLPKAD
|
| |