Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0062 |
Symbol | |
ID | 5537521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 76893 |
End bp | 79373 |
Gene Length | 2481 bp |
Protein Length | 826 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640892228 |
Product | MutS2 family protein |
Protein accession | YP_001430218 |
Protein GI | 156740089 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.21825 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.439412 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTATCG CCCGCCAGAC ACTCGACACC CTCGAGTTTC CCAAAGTTCG CCAGCATCTC GCGCGTTACG CCGCATTCTC GGTTTCGCGC GAGATGGCGC TGAACCTGAT ACCATCGGTC GATCCGGTCG ATGTGCGGCG GCGTTTACGC CGGACTGATG AAGCGCGTCG TTTGCTCGAC GAGATGCCGG ATCTCACCAT CGGCGGCGCG CGTGATGTGC GCCCTGCCGC GGGCCTGGCG CGGCGCGGCG GGGTGTGTGA TGCGACGACA CTCCTCGAAA TCGCTGCAAC ACTTGCCGGT GCGCGGCGTC TGCGCGCGAC GCTGCTGAAA CTCGATCCAG ACCATTTTCT GCTCCTCCGC GAAATTGCCG CCGATCTGCC AGCGCTGCCG GCAATCGAAG ATGCCATTGG GCGCGCCATA GGCGACGATG GTCAGGTGCT CGATAGCGCC AGCCCAAAAC TGGCGCGCTT GCGCGCTGAA GTGCGGATTG CCTTCAATCG TTTGCAGGAA AAACTTCACA ATCTTATCAC AATCCACAGC GATGCGTTGC AAGAGCCGAT CATCACCGTG CGCAACGGGC GCTATGTCGT TCCGGTGAAA GCCACACACC GCCGCGCGAT TCGCGGGTTG GTGCACGATC AATCGGCAAG CGGCGCCACA CTCTACATTG AGCCATTAAC CATCGTGGAT CTGAACAATG CCTGGCGCGA AGCGCAACTC GCCGAGCAGA AAGAAGTCGA ACGCATTCTG GCGGAACTCT CGGCGCAGGT CGGCGATCAC GCCGATGCAA TCGTCACCGG CGTCGAGTCG CTGGCAACGC TTGATCTGGC ATTTGCTATG GCGCGCTATG CCGTTGCTAT GCGCTGTGTG ATGCCGGAGA TCGTTGATGC GCCGCCATCG CCTGATGAAC CGCTGCTCCT GCTCACTGCC GCACGCCATC CGTTGATCGA TCCGCAGCAG GTCGTGCCAA TCGATATGCG CCTCGGCGGT TCGTTCCGCA TACTGCTGAT CACCGGTCCC AACACCGGCG GCAAAACCGT CGCGCTTAAG ACAACTGGGC TGCTGGCGTT GATGGCGCAG GCGGGGATGC ATGTTCCGGC GGCTCATCCA TCGCGCGTGC CGGTCTTTAA GCAGATTTTC GCCGACATCG GCGATGAACA GAGCATCGAG CAGAGTCTTT CGACCTTTTC ATCGCACATG ACGAATATCA TCCGCATTCT GCGGGCGCTT GAAGGCGCAC CGAACGAGGA GAGCGCCGAA GCGTTCGCGG CGGACCCAAC ACCGACCGAA CCGGCATGTG CGGAACGCCT GCCGGCGCTG GTGTTGCTCG ATGAACTCGG CGCCGGCACC GATCCGGTGG AAGGTTCGGC GCTGGCGCGC GCGATTATCG AGCGCTTGCT CGAACTTGGC GTGCTCGGCG TCGCCACCAC GCACTACGCG GAACTGAAAG CGTTCGCCTA TGCCACACCC GGCGTCGAAA ACGCCTCGGT CGAGTTCGAT GTTGAAACCC TGGCGCCAAC CTATAAACTG ACGATTGGTT TGCCAGGGCG CTCGAATGCG CTGGCGATCG CCGCACGTTT GGGTCTGTCG CCGGCATTGA TCGAGCGTGC CCGCGCCAGT ATGGCGCGCC AGGATGTGCA GGTCGAGGAC CTGCTGGCGG GCATCCATCG TGAACGCGCC GCCGCCGCAG CCGAGTTGCA GCGCGCGATG GAAGTGCGCG CCGATGCCGA GAAGTACCGC GAACGACTTG CCGCCGAGTT GCGCGAATTC GAGGCGCGGC GCAGCGAAGC CTGGCAATCG GCGCGCGATG AGATCGAAGC AGAGTTGCGC CAGGCGCGCA GCGAAGTGCG GCGGTTGCGC GACGAGTTTC GTTCGGTGTC GGTTTCGCGC CGGTGGCTCG AAGAAGCGGA ACAACGCCTG CACGAAGTGC GCGCCAGCCT GCCCGAAACC CCGACCGGCG CACTTGCCGA ACGAGCGGTG ACAACCGTTG TCGAGCAGGG TCCGCGCCCG CTGCACCCCG GTGACGTTGT ACGGGTGCGT TCTGTAGGAT TAATCGGCGA GATTCTCTCA ATCGATGAAG AAGAGCAGAC TGCCGAGGTG CAGGTCGGCG GCTTCCGAAT GCAGGCCGAT CTTGCCGAAC TGACGCGCGA AAAGCGCGGC GATGGAAAAG CAGAGCAGCC TGACCGCCCG GCATATGAGT CGCGTGGCGC GTCGATCCCT GCGCCGCGTG ATGTGTCGCT CGAACTCGAT ATGCGCGGCT GGCGGGCTGC TGATGCGCGT GACCGGCTTG ATCGCTATCT GAATGATGCC TATCTTGCCG GATTGCCATG GGTGCGCATC ATCCACGGCA AAGGTACCGG TGCGCTGCGG CAGGCAGTGC GCGATCTGCT CAAAGAGCAT AAACTGGTCG CATCGTTCAG CAGCGCCAGT GCAGCGGAAG GCGGCGAAGG CGTGACCATC GTGCGCCTTC AGGAGCGGTG A
|
Protein sequence | MAIARQTLDT LEFPKVRQHL ARYAAFSVSR EMALNLIPSV DPVDVRRRLR RTDEARRLLD EMPDLTIGGA RDVRPAAGLA RRGGVCDATT LLEIAATLAG ARRLRATLLK LDPDHFLLLR EIAADLPALP AIEDAIGRAI GDDGQVLDSA SPKLARLRAE VRIAFNRLQE KLHNLITIHS DALQEPIITV RNGRYVVPVK ATHRRAIRGL VHDQSASGAT LYIEPLTIVD LNNAWREAQL AEQKEVERIL AELSAQVGDH ADAIVTGVES LATLDLAFAM ARYAVAMRCV MPEIVDAPPS PDEPLLLLTA ARHPLIDPQQ VVPIDMRLGG SFRILLITGP NTGGKTVALK TTGLLALMAQ AGMHVPAAHP SRVPVFKQIF ADIGDEQSIE QSLSTFSSHM TNIIRILRAL EGAPNEESAE AFAADPTPTE PACAERLPAL VLLDELGAGT DPVEGSALAR AIIERLLELG VLGVATTHYA ELKAFAYATP GVENASVEFD VETLAPTYKL TIGLPGRSNA LAIAARLGLS PALIERARAS MARQDVQVED LLAGIHRERA AAAAELQRAM EVRADAEKYR ERLAAELREF EARRSEAWQS ARDEIEAELR QARSEVRRLR DEFRSVSVSR RWLEEAEQRL HEVRASLPET PTGALAERAV TTVVEQGPRP LHPGDVVRVR SVGLIGEILS IDEEEQTAEV QVGGFRMQAD LAELTREKRG DGKAEQPDRP AYESRGASIP APRDVSLELD MRGWRAADAR DRLDRYLNDA YLAGLPWVRI IHGKGTGALR QAVRDLLKEH KLVASFSSAS AAEGGEGVTI VRLQER
|
| |