Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3541 |
Symbol | |
ID | 3836996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 4073757 |
End bp | 4076546 |
Gene Length | 2790 bp |
Protein Length | 929 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637827664 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_428622 |
Protein GI | 83594870 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTATTTT TCGCTGATCG AAGCCCCTTC GTCCGTCCTC TGTCAGGATA TGCGCCTGTG ACGCCCGCCG CTCCCCGTTC CACCGGTTCC GCCGCCGCCC CGCCGCCCTC CCCCGCCGTC CTTGATGACC GGAGCGGAAC GGAGGGCGAT GTCACGCCGA TGATGGCCCA GTATCTGGCC GTCAAGGCCG CCCACCCCGA TTGCTTGCTA TTCTATCGCA TGGGCGACTT CTACGAGATG TTTTTCGAGG ACGCGGTCAA GGCCGCCGAG ACGCTGGATA TCGCCCTGAC CAAGCGCGGC CGGCACGCCG GGGCGGATAT TCCCATGTGC GGCGTGCCCA TCCACTCCCA CGAGGGCTAT CTGTCGCGGC TGATCCGCGC CGGCATCAAG GTGGCGATCT GCGAGCAGAT GGAAGATCCC GCCGAGGCCC GGCGCCAGCG CGGCTATAAG GCGGTGGTGC GCCGCGACGT GATCCGCGTG GTGACCGCCG GCACCCTGAC CGAAGACGAA CTGCTTGATG CCCGCGCCCA TAATTATCTG GCCGCCGTCG TCCGTTTGCG CGACGCGGTC GGCATGGCCT GGGTCGATGT CTCGACCGGC GATCTGGTGG CCCAGCCGCT GGCCGAGGCC GATATCGGAC CGGCCCTGGC CCGCCTCGCC CCGGGCGAGG TGCTGATGCC CGAAAAGCTG GCGGGCGATC CGGCGCTGCG CGAGATCCTG GCGCCCTTGG CCGGGCGGAT CAGCCCGCTG CCGGCCAGCC GCTTCGATAG CGAAAACGCC CGCAAGCGGG TGGAGGGCCT GTTCGGGGTC AAGGCGCTCG ATGGCTTCGG TGGCTTCGGC CGGGCCGAGG TGGCGGCGAT CGGCGCCTTG ATCGATTACG TCGAACTGAC CCAGGTCGGC CGCCTGCCCC GGCTGTCGCC GCCGCGCCGG CTGTCGCTTG GCGCCATCCT TGAAATCGAC GGGGCGACCC GGCGCAATCT GGAACTGACC GAAACCCTGG GCGGCGGCCG CAAGGGCAGT CTGCTCGCCC GCATCGATTG CACGGTGACC GGGGCCGGGG CGCGGCTGCT GGCCGAGCGC TTGGCCGCGC CGCTGACCGA TCCCGCGCAG ATCGGCGCCC GCCTTGATGG CGTCGGCTTC CTGGTCAGCG CCGAGCGGGT GCGCGGCGAT CTGCGCGACA CCTTGCGCGG TTGTCCCGAT ATCGCGCGCG CCCTGTCGCG GCTGTCGTTG GGGCGCGGCG GTCCGCGCGA TCTCGCCGCC ATCGGCGAGG CGCTGTCGCG CATTCCGGCG CTGCGCGTGC TGGTGGTCGG CGCCGGCCTG GGCGAGCCGC CGACCGAACT GACCGCCGCC TTGATCGATC TGGGCAGCCA CGAGGGGCTG GTCGATCTGC TGGGCCGGGC CCTTGATGCC GACCTGCCGC TGCTGGCGCG CGATGGCGGT TTCATCCGCC CGGGCTATGA CGCCGGGCTT GATGAACTGC GGGCGCTGCG CGACGAGGGC CGGCGGCTGA TCGCCGGTCT GCAGGCGCGC TATGCCAGCG AAACCGCCAT CCCGGCGCTG AAGATCAAGC ATAACAACGT GCTGGGCTAT TTCATCGAGG TCGCCGCCGG TCGCGCCGAC AAGCTGATGG CCGCCGGCGG CCCCTTCCTC CACCGCCAGA CCCTGGCCTC GCAGGTGCGC TTCACCACGG TGGAATTGTC CGAACTGGAA GACAAGATCC GCGGCGCCGC CGATAAGGCC CTGGCCCTGG AACAGGCGCT GTTCGCCACG TTGTGCGCCG AGGTTCTGGG CTGCGCCGCC GACATCGCCC GCGCCGCCAA CGGGCTGGCC TGCCTGGATG TCGCCGCCGC CCTGGCCGAT CTGGCGGCGC GCGAGCGCTA TGCCCGGCCG GTGGTCGATA ACTCCACCGC CTTTCGCATC CACAAGGGCC GCCATCCGGT GGTCGAGGCG GCTTTGGCCG ATCAGGCCGG CCCGGCCTTC GTCGCCAATG ATTGCGACCT CGCCCCCGAC CAGCGGCTGT GGCTGCTGAC CGGCCCCAAT ATGGCCGGTA AATCCACCTT CCTGCGCCAG AACGCCCTGA TCGCCGTGCT GGCGCAGATG GGATCTTTCG TGCCCGCCGA ATCGGCCGAG ATCGGCGTGA TCGACCGGTT GTTCTCGCGG GTGGGGGCGG CCGACGATCT GGCGCGCGGG CGCTCGACCT TCATGGTCGA AATGGTGGAG ACCGCCGCCA TCCTCAATCA GGCCACCGAA CGCTCGCTGG TGATCCTTGA CGAGATCGGT CGCGGCACCG CCACCTATGA CGGGCTGTCG ATCGCCTGGG CCACGGTCGA GTCGCTCCAC GACGCCACCC GCTGCCGGGC GCTGTTTGCC ACCCATTACC ACGAACTGAC GGCGCTGGCC TCGCGCCTTG ACCGGCTGTC GTGCCACACC TTGCGCATCA AGGAGTGGAA GGATCAGGTG GTCTTCCTGC ACGAGGTCGG GCCCGGGGCG GCCGACCGCT CCTATGGCAT CCATGTCGCC AAGCTGGCCG GGCTGCCCGC CGCGGTGATC GCCCGGGCCG AACAAGTGCT GGCGATCTTG GAAAAGGGCG ATGCGTCGAG CGCGGCGACG CGGCTGGCCG ATGACCTGCC GTTGTTCGCC GCCGCCCGCC CGCGTGCCGG CCTTCCCACC CCGCCGCCCG GCCCCCACCC CCTGGCCGAG GCCCTCAACG CGATCAACCC CGACGAAATG ACCCCGCGCG AGGCCCTTGA CGCCCTTTAC CGGCTGAAGG CGGTGATGAA GCGGGAGTAG
|
Protein sequence | MLFFADRSPF VRPLSGYAPV TPAAPRSTGS AAAPPPSPAV LDDRSGTEGD VTPMMAQYLA VKAAHPDCLL FYRMGDFYEM FFEDAVKAAE TLDIALTKRG RHAGADIPMC GVPIHSHEGY LSRLIRAGIK VAICEQMEDP AEARRQRGYK AVVRRDVIRV VTAGTLTEDE LLDARAHNYL AAVVRLRDAV GMAWVDVSTG DLVAQPLAEA DIGPALARLA PGEVLMPEKL AGDPALREIL APLAGRISPL PASRFDSENA RKRVEGLFGV KALDGFGGFG RAEVAAIGAL IDYVELTQVG RLPRLSPPRR LSLGAILEID GATRRNLELT ETLGGGRKGS LLARIDCTVT GAGARLLAER LAAPLTDPAQ IGARLDGVGF LVSAERVRGD LRDTLRGCPD IARALSRLSL GRGGPRDLAA IGEALSRIPA LRVLVVGAGL GEPPTELTAA LIDLGSHEGL VDLLGRALDA DLPLLARDGG FIRPGYDAGL DELRALRDEG RRLIAGLQAR YASETAIPAL KIKHNNVLGY FIEVAAGRAD KLMAAGGPFL HRQTLASQVR FTTVELSELE DKIRGAADKA LALEQALFAT LCAEVLGCAA DIARAANGLA CLDVAAALAD LAARERYARP VVDNSTAFRI HKGRHPVVEA ALADQAGPAF VANDCDLAPD QRLWLLTGPN MAGKSTFLRQ NALIAVLAQM GSFVPAESAE IGVIDRLFSR VGAADDLARG RSTFMVEMVE TAAILNQATE RSLVILDEIG RGTATYDGLS IAWATVESLH DATRCRALFA THYHELTALA SRLDRLSCHT LRIKEWKDQV VFLHEVGPGA ADRSYGIHVA KLAGLPAAVI ARAEQVLAIL EKGDASSAAT RLADDLPLFA AARPRAGLPT PPPGPHPLAE ALNAINPDEM TPREALDALY RLKAVMKRE
|
| |