Gene Rxyl_0292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_0292 
Symbol 
ID4117819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp299533 
End bp301887 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content72% 
IMG OID638035081 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_643080 
Protein GI108803143 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.518405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGGAC GCTACGCCGA ACTCAAGGCC CAGCTTCCGC CCGGGACGAT CCTCTTCTAC 
CAGGTCGGCA CCTTCTTCGA GACCTTCGAG GAGGACGCCA AGACCGTCTC CCGCGAGCTC
TCGCTGCGGC TCACCAGCCG GGAGGCCGCC GGGGAGGGGA GGGTGCCGCT GGCGGGGGTC
CCCGGGCACG CCCTGCAGGA GCACGTGGCC GCGCTCTTGA GGAAGGGCCA CTCCGTGGCC
ATCGCCGAGC AGCGGCAGCA CCCCACCAAG CCGCGGCAGT TCACCCGCGA GATCACCCAG
ATTTTGACCC CCGGCACCGT GATCGAGGAC AACGTCCTCT CCGCCGGGAG GTCCAACTAC
CTGGCCACCT TCGTCGTCCG GGACGGGAAG GCCGGGATAG CGGTGGCGGA GGCCTCCACC
GGGGAGTTCT CCGGTACGGT GGTGCCGGAG GCGGAGCTGC CGGCGGAGCT GGAGCGCTGG
TCGCCGCGGG AGGTGGTGGT CCCCGAGCGG ACGGCCGCCG AGGATCTGCC CCGGCTGGAG
GCGCGGGTGA GCACCGCCCC CCGCTGGACC TTCGAGCCCT CCGCGGGGGA GCAGGCCCTG
CGGCAGCACT TCGGGGTGGC GAGCCTCAAG GGCTACGGGC TCGACGGGAG CCCGCAGCTC
GTGGCCGCGG CGGGGGCCCT CATCCGCTAC CTGAGCACCC TGCGCGGCGG CAGCCCGCCG
GAGCAGATCG TCTCCTTCAG GCGCTACGAC CCCGGCCAGG CGATGCTGCT GGACGCCGCC
ACCCGCAGGA ACTTGGGGCT GGAGGAGCTG ATCTCCACGG TGGACCGTAC CAGGACCCCG
ATGGGCCAGC GGACCCTGCG GCGCTGGCTG GAGCGGCCGC TCCTGGAGGC CTCCCGGATC
AACCAGCGGC TCGAGGCGGT GGACGCCCTC TTCCCGGACT ACATGCTGCG GGAGGAGGTG
CGCGAGCACC TCGGCGGCAT CCCGGACATC GAGCGGATCG CCACCAGGAT CGTGCGGCTC
TCGGCCTCCC CGGGCGACCT GCTCGCCCTG CGGGGGGCGC TCGAGGCGCT GGGGCCGCTG
CGGCGGGCGC TCGCCCCCGC CGCGCAGCGC AGCGAGCTCC TCCGGCGGGC GCTCTCGGCG
ATGGAGGAGC CGCCGGGGGT CAGGGAGCTC ATCGCCGAGG CCATCTCCGA GGAGGAGGGC
GAGATCATCC GCCCCGGCTA CTCCGCCGAG CTGGACGAGG CCCGCTCCTT CCGGGACGGG
GCCCACGAGT GGCTCACCCG CTTCGAGGCC GAGGAGCGCC TGAAGACCGG GCTCAAGACC
CTCAAGGTCG GCTACCGGGA CGGCGAGGGG TACTTTATAG AGGTGGGGGG GAAGGAGGCC
CACCGGGTTC CGCCCCACTA CGAGCACCGC AAGGCCCTCA AGCACAACGC CCGCTACGTC
ACCGTAGAGC TCAAGGAGCA CGAGTCCAGG ATGCTCACCG CCCGCGAGGA GGTCGAGCGG
CTGGAGCGCA GGATCCTGGG CGAGATCCGG GCGGCCGTCA AGGAGGCGGC GCCGCAGCTG
CAGCGGATCG CCCGGGCGGT GGCGGTGGTG GACGTGGTCG CCTCGTTCGC CGCCGCCGCG
GCCGAGCTGC GCTACTGCCG GCCGCGGGTC GCGGAGGAGC GGGGGATCCG GATCGTCTCC
GGGCGCCACC CGGTGGTCGA GCACGCCACC GAGACGCCCT TCGTGCCCAA CGACGCCCGG
ATAGACGGCG GCTCCCGCCT GCAGATCATC ACCGGCCCCA ACATGGCCGG GAAGTCCGTG
TATCTGCGGC AGGTGGCCCT GATAGTCCTG CTGGCCCAGA CCGGCTCCTA CGTGCCCGCG
GAGGAGGCCT CGCTGGGGGT CGTCGACCGG ATCTTCACCC GGGTCGGCGC GGAGGACCGG
CTGGCGAGCG GGGAGTCCAC CTTCATGGTG GAGATGACGG AGGCGGCGGG TATCCTGAAC
GGCGCCACGG AGCGCAGCCT GGTGATCCTG GACGAGGTGG GCCGGGGGAC CTCCACCTAC
GACGGGATGA GCCTGGCCTG GGCGATCGCC GAGTATCTGC ACGATGACGT GCAGGCCCTC
ACGCTCTTCG CCACCCACTA CCACGAGCTG ACACGGCTCG CCGACTCGCT CCCCGGCTGC
CGCAACCTGA AGGCGGTGGT TGAGGAGGTG GGCGGGGAGA TCGTGTTCCT GCACAGGATT
GAGCCCGGCG CCGAGTCCTC CTCCTACGGG GTGCACGTGG CCCGGCTGGC CGGGCTGCCG
CCGCGGGTGA CCGACCGGGC CCAGGAGATC CTCTCCCGCC TGGAGGCCGA AGGGGTGAAG
GGGTTCGCCG GGTGA
 
Protein sequence
MLGRYAELKA QLPPGTILFY QVGTFFETFE EDAKTVSREL SLRLTSREAA GEGRVPLAGV 
PGHALQEHVA ALLRKGHSVA IAEQRQHPTK PRQFTREITQ ILTPGTVIED NVLSAGRSNY
LATFVVRDGK AGIAVAEAST GEFSGTVVPE AELPAELERW SPREVVVPER TAAEDLPRLE
ARVSTAPRWT FEPSAGEQAL RQHFGVASLK GYGLDGSPQL VAAAGALIRY LSTLRGGSPP
EQIVSFRRYD PGQAMLLDAA TRRNLGLEEL ISTVDRTRTP MGQRTLRRWL ERPLLEASRI
NQRLEAVDAL FPDYMLREEV REHLGGIPDI ERIATRIVRL SASPGDLLAL RGALEALGPL
RRALAPAAQR SELLRRALSA MEEPPGVREL IAEAISEEEG EIIRPGYSAE LDEARSFRDG
AHEWLTRFEA EERLKTGLKT LKVGYRDGEG YFIEVGGKEA HRVPPHYEHR KALKHNARYV
TVELKEHESR MLTAREEVER LERRILGEIR AAVKEAAPQL QRIARAVAVV DVVASFAAAA
AELRYCRPRV AEERGIRIVS GRHPVVEHAT ETPFVPNDAR IDGGSRLQII TGPNMAGKSV
YLRQVALIVL LAQTGSYVPA EEASLGVVDR IFTRVGAEDR LASGESTFMV EMTEAAGILN
GATERSLVIL DEVGRGTSTY DGMSLAWAIA EYLHDDVQAL TLFATHYHEL TRLADSLPGC
RNLKAVVEEV GGEIVFLHRI EPGAESSSYG VHVARLAGLP PRVTDRAQEI LSRLEAEGVK
GFAG