Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_0730 |
Symbol | |
ID | 8567368 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | + |
Start bp | 843836 |
End bp | 846250 |
Gene Length | 2415 bp |
Protein Length | 804 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | MutS2 family protein |
Protein accession | YP_003290016 |
Protein GI | 268316297 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCCCA CCGACGCAAC GCTGCACGGC TATCCCGACA CGCTTGAGAC GCGGCTGGGC TTCGACGTGG TGCGCGAGGC GCTGAAGGCG CAGACGCGCA GTCCACTGGG CGCCGAGGCC GTCGCGCAGC TCCGGCCGCT GCGGGATCTG GAGGCGGTCA GGGCCGAGCT GGCCCGTGTC GAGGAGTTGC AGCAGGCGTT GCGTTTCGAC GATCCTGTTC CGCTCGAAAA CCTGATCGAT CTACGGCCCT GGCTGGTGCA GGCCGCACCC GAAGGGTCCC GCCTCGAAGG CGAGGCGCTG GAGGCCGTAC GTCGCGTGCT GATGACGGTG CGCCTGCTGG CCCGCTACTT CCGCGAGCGG CGGAGCAAGT ATCCGGCGCT GGCCGTGCTG GCCGAACGCC TGACGCCGCT TCCGGAGCTG GAGCAGCGGC TGGCCGCCGT GGTGGACGAA GACGGCCAGG TGCGCGACGA CGCGTCGCCG GAACTGCGGC GCCTCCGCCG CCAGCTGGCC CTGCAGCGTC AGCGCCTGCG CGAAGCGCTG CTCGAAGCGC TCCGCGAGGC CATCCGGCAG GGCTACGCGA CCGAGGACCA GCCCACCATT CGCAACGGCC GCATGGTGAT TCCCGTGCGG GCCGAGGCGC GCCGCAAGGT ACCGGGCTTC GTGCACGACA CCTCGGCCTC CGGGCAGACG GTCTACATTG AGCCGGCCTC CTGTCTGGAC CTGAACAACG CGGTGCGCGA GCTGGAGCTG GCCGAACTGC GCGAAATCGA CCGCATCCTG CGCGAGGCCA CCGGCTGGCT CCGTCCGCAT CTTCCCGCGC TGAAGGCTTC GCTCGAAGTG CTGGGCCACT TCGATCTGCT GCAGGCCAAG GCGCGGCTGG CCGAACTGAT GGACGCCCAT GTGCCCGAAG TGGCCGCCGA CGGCGTGATC GAGCTGAAGC GCGCCCGCAA CCCGGTCCTT GTGCTGCACT TTCGGCGCCT TCAGGAGACG ACCGGTGAAG TGCGCGAGGT GGTGCCGCTC GACCTGACGC TGGGCCGCAC CTTCCACACG CTGATCATCA CCGGACCGAA CGCGGGCGGC AAGACCGTCG CCATGAAAAC CGTCGGCCTG CTGGTGCTGA TGCTGGCCTG CGGGCTTCCC ATTCCCGCCG ACCCGGCCTC GCACGTGTCG CTCTTCGATC AATTGCTGAT CGACATCGGC GACGAGCAGT CCGTGGAGGC CGACCTGTCC ACTTTCAGCG CGCACATGAC GCACATGGCC TACATGTTGG CCCGGGCCGA CGCGCGCACG CTCATTCTGA TCGACGAGGC GGGAACGGGG ACCGATCCGG ACGAAGGGGC GGCGCTGGCG CAGGCCATCC TCGAAGAGCT GATGCGGCGC GGCGCGCGCA CGATCGCCAC GACCCACCAC GGCGCGCTCA AGGTCTTCGC CTACGAAACC GAGGGCGTCG AGAACGGCTC CATGCAGTTC GACCAGGCCA CCCTCAGCCC GACGTATCGC TTCCAGCTCG GAGTGCCCGG TTCATCCTAT GCCTTCGAGA TCGCCCGGCG CATGGGGATT CCCGAACCGG TGCTGGCGCG GGCGCAGGCG CTCGTGGGTC GGCAGCAGGT GGCGCTGGAG GCGCTTGTGC GGACGCTGGA GGCGCGCAAC CAGGAGCTGG AAGCCCGGCT GGCAGCGCTG ACCGAAGAGC AGGCACGTCT GGAGCAGCTC CGGCGCGAAT ACGAGGCGCG TCGGGCGCAA CTCGAAGCCG AGACGGAGGC GATCCGGCAG CGCGCTCTGG AGGAGGCCGA ACAGCTGCTG AAAGAGGCCA ACGCGCGCAT CGAACGCACC ATCCGGGAAA TCAAAGAGGC GCAGGCCGAG CGGGAGGCCA CCCGGGCAGC GCGAGAGGCG CTGGAGCGTT TCCGCCGTCG CCTGCACGAG CAGCGGCGCC GGGCCCGTCC GAAGCCATCG GCCGCCGAGG AGCCGCGCTC GACGCTGGCC GTGGGCGATC AGGTGGTGCT CGACGAGGGC GGCACGCCGG CCGAAGTGCT GGCGCTGGAG GACGACGAGG CACTGATCGC CGTGGGCTCG CTGAAAATGC GCGTGCCGGT GAGCCGGTTG CGGCGGCTGA ACCGGGCGGC GCGGCGTGCG CAAACGCGAA CGACCACAGG CGCGACGCTT CCGGCCCTTC AGGCCCGCAC GCGCATCGAC GTGCGCGGCT ACCGGGTGGA TGAGGCGTTG CAGGCGGTCG AGCGACTCAT CGACGAGGCG GTGGCCAGTG GTGTGCGCGA GGTGGAAGTG CTGCACGGCA AAGGTACCGG CGCCCTGCGT CAGGCCATTC GCTCTTATCT GCAGGGCCGC CCCGAAGTGG AGCGCTTTGA AGATGCCCCG TGGGAGCAGG GCGGCCCCGG TGTGACCCGA ATCTGGCTGA AGTAA
|
Protein sequence | MSPTDATLHG YPDTLETRLG FDVVREALKA QTRSPLGAEA VAQLRPLRDL EAVRAELARV EELQQALRFD DPVPLENLID LRPWLVQAAP EGSRLEGEAL EAVRRVLMTV RLLARYFRER RSKYPALAVL AERLTPLPEL EQRLAAVVDE DGQVRDDASP ELRRLRRQLA LQRQRLREAL LEALREAIRQ GYATEDQPTI RNGRMVIPVR AEARRKVPGF VHDTSASGQT VYIEPASCLD LNNAVRELEL AELREIDRIL REATGWLRPH LPALKASLEV LGHFDLLQAK ARLAELMDAH VPEVAADGVI ELKRARNPVL VLHFRRLQET TGEVREVVPL DLTLGRTFHT LIITGPNAGG KTVAMKTVGL LVLMLACGLP IPADPASHVS LFDQLLIDIG DEQSVEADLS TFSAHMTHMA YMLARADART LILIDEAGTG TDPDEGAALA QAILEELMRR GARTIATTHH GALKVFAYET EGVENGSMQF DQATLSPTYR FQLGVPGSSY AFEIARRMGI PEPVLARAQA LVGRQQVALE ALVRTLEARN QELEARLAAL TEEQARLEQL RREYEARRAQ LEAETEAIRQ RALEEAEQLL KEANARIERT IREIKEAQAE REATRAAREA LERFRRRLHE QRRRARPKPS AAEEPRSTLA VGDQVVLDEG GTPAEVLALE DDEALIAVGS LKMRVPVSRL RRLNRAARRA QTRTTTGATL PALQARTRID VRGYRVDEAL QAVERLIDEA VASGVREVEV LHGKGTGALR QAIRSYLQGR PEVERFEDAP WEQGGPGVTR IWLK
|
| |