Gene Rcas_0062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0062 
Symbol 
ID5537521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp76893 
End bp79373 
Gene Length2481 bp 
Protein Length826 aa 
Translation table11 
GC content63% 
IMG OID640892228 
ProductMutS2 family protein 
Protein accessionYP_001430218 
Protein GI156740089 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.21825 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.439412 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTATCG CCCGCCAGAC ACTCGACACC CTCGAGTTTC CCAAAGTTCG CCAGCATCTC 
GCGCGTTACG CCGCATTCTC GGTTTCGCGC GAGATGGCGC TGAACCTGAT ACCATCGGTC
GATCCGGTCG ATGTGCGGCG GCGTTTACGC CGGACTGATG AAGCGCGTCG TTTGCTCGAC
GAGATGCCGG ATCTCACCAT CGGCGGCGCG CGTGATGTGC GCCCTGCCGC GGGCCTGGCG
CGGCGCGGCG GGGTGTGTGA TGCGACGACA CTCCTCGAAA TCGCTGCAAC ACTTGCCGGT
GCGCGGCGTC TGCGCGCGAC GCTGCTGAAA CTCGATCCAG ACCATTTTCT GCTCCTCCGC
GAAATTGCCG CCGATCTGCC AGCGCTGCCG GCAATCGAAG ATGCCATTGG GCGCGCCATA
GGCGACGATG GTCAGGTGCT CGATAGCGCC AGCCCAAAAC TGGCGCGCTT GCGCGCTGAA
GTGCGGATTG CCTTCAATCG TTTGCAGGAA AAACTTCACA ATCTTATCAC AATCCACAGC
GATGCGTTGC AAGAGCCGAT CATCACCGTG CGCAACGGGC GCTATGTCGT TCCGGTGAAA
GCCACACACC GCCGCGCGAT TCGCGGGTTG GTGCACGATC AATCGGCAAG CGGCGCCACA
CTCTACATTG AGCCATTAAC CATCGTGGAT CTGAACAATG CCTGGCGCGA AGCGCAACTC
GCCGAGCAGA AAGAAGTCGA ACGCATTCTG GCGGAACTCT CGGCGCAGGT CGGCGATCAC
GCCGATGCAA TCGTCACCGG CGTCGAGTCG CTGGCAACGC TTGATCTGGC ATTTGCTATG
GCGCGCTATG CCGTTGCTAT GCGCTGTGTG ATGCCGGAGA TCGTTGATGC GCCGCCATCG
CCTGATGAAC CGCTGCTCCT GCTCACTGCC GCACGCCATC CGTTGATCGA TCCGCAGCAG
GTCGTGCCAA TCGATATGCG CCTCGGCGGT TCGTTCCGCA TACTGCTGAT CACCGGTCCC
AACACCGGCG GCAAAACCGT CGCGCTTAAG ACAACTGGGC TGCTGGCGTT GATGGCGCAG
GCGGGGATGC ATGTTCCGGC GGCTCATCCA TCGCGCGTGC CGGTCTTTAA GCAGATTTTC
GCCGACATCG GCGATGAACA GAGCATCGAG CAGAGTCTTT CGACCTTTTC ATCGCACATG
ACGAATATCA TCCGCATTCT GCGGGCGCTT GAAGGCGCAC CGAACGAGGA GAGCGCCGAA
GCGTTCGCGG CGGACCCAAC ACCGACCGAA CCGGCATGTG CGGAACGCCT GCCGGCGCTG
GTGTTGCTCG ATGAACTCGG CGCCGGCACC GATCCGGTGG AAGGTTCGGC GCTGGCGCGC
GCGATTATCG AGCGCTTGCT CGAACTTGGC GTGCTCGGCG TCGCCACCAC GCACTACGCG
GAACTGAAAG CGTTCGCCTA TGCCACACCC GGCGTCGAAA ACGCCTCGGT CGAGTTCGAT
GTTGAAACCC TGGCGCCAAC CTATAAACTG ACGATTGGTT TGCCAGGGCG CTCGAATGCG
CTGGCGATCG CCGCACGTTT GGGTCTGTCG CCGGCATTGA TCGAGCGTGC CCGCGCCAGT
ATGGCGCGCC AGGATGTGCA GGTCGAGGAC CTGCTGGCGG GCATCCATCG TGAACGCGCC
GCCGCCGCAG CCGAGTTGCA GCGCGCGATG GAAGTGCGCG CCGATGCCGA GAAGTACCGC
GAACGACTTG CCGCCGAGTT GCGCGAATTC GAGGCGCGGC GCAGCGAAGC CTGGCAATCG
GCGCGCGATG AGATCGAAGC AGAGTTGCGC CAGGCGCGCA GCGAAGTGCG GCGGTTGCGC
GACGAGTTTC GTTCGGTGTC GGTTTCGCGC CGGTGGCTCG AAGAAGCGGA ACAACGCCTG
CACGAAGTGC GCGCCAGCCT GCCCGAAACC CCGACCGGCG CACTTGCCGA ACGAGCGGTG
ACAACCGTTG TCGAGCAGGG TCCGCGCCCG CTGCACCCCG GTGACGTTGT ACGGGTGCGT
TCTGTAGGAT TAATCGGCGA GATTCTCTCA ATCGATGAAG AAGAGCAGAC TGCCGAGGTG
CAGGTCGGCG GCTTCCGAAT GCAGGCCGAT CTTGCCGAAC TGACGCGCGA AAAGCGCGGC
GATGGAAAAG CAGAGCAGCC TGACCGCCCG GCATATGAGT CGCGTGGCGC GTCGATCCCT
GCGCCGCGTG ATGTGTCGCT CGAACTCGAT ATGCGCGGCT GGCGGGCTGC TGATGCGCGT
GACCGGCTTG ATCGCTATCT GAATGATGCC TATCTTGCCG GATTGCCATG GGTGCGCATC
ATCCACGGCA AAGGTACCGG TGCGCTGCGG CAGGCAGTGC GCGATCTGCT CAAAGAGCAT
AAACTGGTCG CATCGTTCAG CAGCGCCAGT GCAGCGGAAG GCGGCGAAGG CGTGACCATC
GTGCGCCTTC AGGAGCGGTG A
 
Protein sequence
MAIARQTLDT LEFPKVRQHL ARYAAFSVSR EMALNLIPSV DPVDVRRRLR RTDEARRLLD 
EMPDLTIGGA RDVRPAAGLA RRGGVCDATT LLEIAATLAG ARRLRATLLK LDPDHFLLLR
EIAADLPALP AIEDAIGRAI GDDGQVLDSA SPKLARLRAE VRIAFNRLQE KLHNLITIHS
DALQEPIITV RNGRYVVPVK ATHRRAIRGL VHDQSASGAT LYIEPLTIVD LNNAWREAQL
AEQKEVERIL AELSAQVGDH ADAIVTGVES LATLDLAFAM ARYAVAMRCV MPEIVDAPPS
PDEPLLLLTA ARHPLIDPQQ VVPIDMRLGG SFRILLITGP NTGGKTVALK TTGLLALMAQ
AGMHVPAAHP SRVPVFKQIF ADIGDEQSIE QSLSTFSSHM TNIIRILRAL EGAPNEESAE
AFAADPTPTE PACAERLPAL VLLDELGAGT DPVEGSALAR AIIERLLELG VLGVATTHYA
ELKAFAYATP GVENASVEFD VETLAPTYKL TIGLPGRSNA LAIAARLGLS PALIERARAS
MARQDVQVED LLAGIHRERA AAAAELQRAM EVRADAEKYR ERLAAELREF EARRSEAWQS
ARDEIEAELR QARSEVRRLR DEFRSVSVSR RWLEEAEQRL HEVRASLPET PTGALAERAV
TTVVEQGPRP LHPGDVVRVR SVGLIGEILS IDEEEQTAEV QVGGFRMQAD LAELTREKRG
DGKAEQPDRP AYESRGASIP APRDVSLELD MRGWRAADAR DRLDRYLNDA YLAGLPWVRI
IHGKGTGALR QAVRDLLKEH KLVASFSSAS AAEGGEGVTI VRLQER