Gene RoseRS_4014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4014 
Symbol 
ID5210997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5021111 
End bp5023597 
Gene Length2487 bp 
Protein Length828 aa 
Translation table11 
GC content64% 
IMG OID640597603 
ProductMutS2 family protein 
Protein accessionYP_001278309 
Protein GI148658104 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.301308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTATCG CTCTCCAGAC GCTCGAAACC CTCGAGTTTC CCAAAGTGCG CCTGCAACTC 
GCGCGGTACA CGGCGTTCTC CGCTTCACGT GAACTCGCAC TCAATCTGAC GCCGTCGGTC
GATCCCTTCG AGGTGCGTCG TCGTCTGCGC CTGACCGATG AGGCGCGTCG GCTGCTCGAC
GCGATGCCTG ATGTGAGTAT TGGCGGGGCG CGAGATGTGC GTCCAGCGGT TGGTCTGGCG
CGACGCGGCG GGGTGTGTGA TCCGGAGGCG CTGATCGAAA TCGCCGCCAC ACTTGCAAGC
GCCCGCCGTC TGCGCGCCAC GTTGCGGAAA CTCGACGCAG CATCCTTCCC GCTGCTGCAC
GAGACTGCCG TCGATCTGCC GCTGCTGCCG GAGGTTGAGG ATGCCGTCGC CCGCGCAATT
GGCGAGGACG GTCAGGTGCT CGACAGCGCC AGTCCGAAAC TGGCGCGATT GCGCTCTGAG
GTGCGCACGG CGTTCAACCG CCTCCAGGAA AAACTGCACA ACCTGATCAT GACGCACGGT
GATGTGTTGC AGGAACCGAT CATCACTGTG CGCAACGGGC GATATGTTGT GCCGGTGAAA
GCCACACACC GCCGCGCCAT TCGCGGTCTG GTGCACGACC AGTCGGCCAG CGGCGCGACG
CTGTACATCG AGCCGCTGAC GATCGTGGAA CTGAACAATG CCTGGCGGGA ACTGCAACTC
GCCGAACAGG CAGAGGTCGA ACGCATTCTG GCGGAACTTT CTGCGCTCGT CGGCGATCAT
GCCGGTGCGA TCACTGCTGG CGTCGAGGCG CTGGCGACGC TCGATCTGGC GTTCGCAATG
GCGCAGTATG CGGCGGCGAT GCGCTGCGTA ATGCCGGAGA TTGTCGATCC GCCGCTGCCG
CCTGATGAAC CGTTGCTGCT GCTGACCGCT GCGCGCCATC CGCTCCTCGA TCCGCAAAAG
GTCGTCCCGA TCGATATGCG CCTCGGCGGT CGGTTTCGCC TGCTGCTGAT CACCGGTCCC
AATACCGGCG GCAAGACCGT GGCGCTGAAA ACAACCGGTC TGCTGGCGCT AATGGCGCAG
GCGGGGATGC ACATCCCTGC ATCCCAACCG TCGCGCCTGC CGGTGTTCGC GCAAATCTTC
GCCGACATTG GCGATGAGCA GAGCATCGAG CAGAGCCTTT CGACCTTCTC GTCGCATATG
ACGAATATCA TCCGCATTCT GCGTGCGCTT GAAGACGCAC CGGACGTGGC GCCTGCCGAA
ACATCCGTCT CCGGCTCAAC CCAAACAATG CCTCCTGACA CGCAGCGCTT GGGGCGCATG
CCAGCGCTGG TGCTGCTCGA TGAACTCGGC GCCGGCACCG ATCCGGTCGA GGGTGCGGCG
CTGGCGCGCG CGATCATCGA GCGACTGCTC GAACTCGGTG TGCTGGGTGT GGCAACGACG
CACTACGCTG AATTGAAGGC GTTCGCCTAT GCCACACCCG GCGTTGAGAA TGCGTCGGTC
GAGTTCGACG TCGAAACACT TGCACCAACC TACAAACTGA CGATCGGTTT ACCAGGGCGC
TCGAATGCAC TGGCGATTGC GGCGCGCCTG GGACTCGCCC CCGACCTGGT TGAGCGCGCC
CGTGCAACAA TGGCGCGAGA GGATGTGCAG GTCGAAGATC TGCTGGCGGG TATCCATCGT
GAGCGCGATG CTGCGGCGGC AGAGTTGCAG CGAGCAATGG AAGTGCGCGC CGATGCCGAG
AAGTATCGTG ACCGACTTGC CGCTGAGTTG CGCGCATTTG AGGAGCAGCG TGATGAGGCC
TGGCAGGCTG CGCGTGAAGC GATCGAGGCA GAACTGCGCC AGGTGCGCAA CGAGGTGCGC
CGGTTGCGCG ACGAGTCTCG CTCGGTAGCG GCGTCGCGCC GATGGCTCGA AGAGGCGGAA
CGGCGTTTGC AGGAAGCGCG TGCAAGCCTG CCTGCTGTTC CGCCCGGTAA ACCCGCCGGA
CACCCGGCGC CCGCTGCTGA ACAGGTCGCG CGTCTGCAAC CCGGTGATGT CGTCCGCGTG
CGATCGGTCG GGTTAACCGG AGAAATCCTC TCGATCAACG AAGAAGATCA GACAGCCGAG
GTGCAGGTTG GAGGCTTTCG GATGCAGGCC GATCTCGCCG AACTGACGCG CGAAAAGCGT
GCGGCCGGGA ACGGTGGCGG TCAACCTGCC CGTCCCGCCT ATGAGTCGCG TGGCACATCG
CTTCCGGCGC CGCGTGACGT GTCGCTCGAA CTCGATATGC GGGGATGGCG CGCGGCTGAT
GTCGGCGAAC GACTCGACCG GTATTTGAAC GACGCTTACC TGGCTGGCCT GCCGTGGGTT
CGTATCATTC ACGGTAAAGG CACCGGCGCG CTGCGTCAGG CAGTACGCGA TACGCTCAAA
GACCACAAAC TGGTCGCGTC GTTCAGCAGC GCAAGCGCCA CAGAAGGCGG TGAGGGCGTT
ACGATTGTGC GCCTGCAGGA ACGGTGA
 
Protein sequence
MAIALQTLET LEFPKVRLQL ARYTAFSASR ELALNLTPSV DPFEVRRRLR LTDEARRLLD 
AMPDVSIGGA RDVRPAVGLA RRGGVCDPEA LIEIAATLAS ARRLRATLRK LDAASFPLLH
ETAVDLPLLP EVEDAVARAI GEDGQVLDSA SPKLARLRSE VRTAFNRLQE KLHNLIMTHG
DVLQEPIITV RNGRYVVPVK ATHRRAIRGL VHDQSASGAT LYIEPLTIVE LNNAWRELQL
AEQAEVERIL AELSALVGDH AGAITAGVEA LATLDLAFAM AQYAAAMRCV MPEIVDPPLP
PDEPLLLLTA ARHPLLDPQK VVPIDMRLGG RFRLLLITGP NTGGKTVALK TTGLLALMAQ
AGMHIPASQP SRLPVFAQIF ADIGDEQSIE QSLSTFSSHM TNIIRILRAL EDAPDVAPAE
TSVSGSTQTM PPDTQRLGRM PALVLLDELG AGTDPVEGAA LARAIIERLL ELGVLGVATT
HYAELKAFAY ATPGVENASV EFDVETLAPT YKLTIGLPGR SNALAIAARL GLAPDLVERA
RATMAREDVQ VEDLLAGIHR ERDAAAAELQ RAMEVRADAE KYRDRLAAEL RAFEEQRDEA
WQAAREAIEA ELRQVRNEVR RLRDESRSVA ASRRWLEEAE RRLQEARASL PAVPPGKPAG
HPAPAAEQVA RLQPGDVVRV RSVGLTGEIL SINEEDQTAE VQVGGFRMQA DLAELTREKR
AAGNGGGQPA RPAYESRGTS LPAPRDVSLE LDMRGWRAAD VGERLDRYLN DAYLAGLPWV
RIIHGKGTGA LRQAVRDTLK DHKLVASFSS ASATEGGEGV TIVRLQER