Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_0292 |
Symbol | |
ID | 4117819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | + |
Start bp | 299533 |
End bp | 301887 |
Gene Length | 2355 bp |
Protein Length | 784 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 638035081 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_643080 |
Protein GI | 108803143 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.518405 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGGAC GCTACGCCGA ACTCAAGGCC CAGCTTCCGC CCGGGACGAT CCTCTTCTAC CAGGTCGGCA CCTTCTTCGA GACCTTCGAG GAGGACGCCA AGACCGTCTC CCGCGAGCTC TCGCTGCGGC TCACCAGCCG GGAGGCCGCC GGGGAGGGGA GGGTGCCGCT GGCGGGGGTC CCCGGGCACG CCCTGCAGGA GCACGTGGCC GCGCTCTTGA GGAAGGGCCA CTCCGTGGCC ATCGCCGAGC AGCGGCAGCA CCCCACCAAG CCGCGGCAGT TCACCCGCGA GATCACCCAG ATTTTGACCC CCGGCACCGT GATCGAGGAC AACGTCCTCT CCGCCGGGAG GTCCAACTAC CTGGCCACCT TCGTCGTCCG GGACGGGAAG GCCGGGATAG CGGTGGCGGA GGCCTCCACC GGGGAGTTCT CCGGTACGGT GGTGCCGGAG GCGGAGCTGC CGGCGGAGCT GGAGCGCTGG TCGCCGCGGG AGGTGGTGGT CCCCGAGCGG ACGGCCGCCG AGGATCTGCC CCGGCTGGAG GCGCGGGTGA GCACCGCCCC CCGCTGGACC TTCGAGCCCT CCGCGGGGGA GCAGGCCCTG CGGCAGCACT TCGGGGTGGC GAGCCTCAAG GGCTACGGGC TCGACGGGAG CCCGCAGCTC GTGGCCGCGG CGGGGGCCCT CATCCGCTAC CTGAGCACCC TGCGCGGCGG CAGCCCGCCG GAGCAGATCG TCTCCTTCAG GCGCTACGAC CCCGGCCAGG CGATGCTGCT GGACGCCGCC ACCCGCAGGA ACTTGGGGCT GGAGGAGCTG ATCTCCACGG TGGACCGTAC CAGGACCCCG ATGGGCCAGC GGACCCTGCG GCGCTGGCTG GAGCGGCCGC TCCTGGAGGC CTCCCGGATC AACCAGCGGC TCGAGGCGGT GGACGCCCTC TTCCCGGACT ACATGCTGCG GGAGGAGGTG CGCGAGCACC TCGGCGGCAT CCCGGACATC GAGCGGATCG CCACCAGGAT CGTGCGGCTC TCGGCCTCCC CGGGCGACCT GCTCGCCCTG CGGGGGGCGC TCGAGGCGCT GGGGCCGCTG CGGCGGGCGC TCGCCCCCGC CGCGCAGCGC AGCGAGCTCC TCCGGCGGGC GCTCTCGGCG ATGGAGGAGC CGCCGGGGGT CAGGGAGCTC ATCGCCGAGG CCATCTCCGA GGAGGAGGGC GAGATCATCC GCCCCGGCTA CTCCGCCGAG CTGGACGAGG CCCGCTCCTT CCGGGACGGG GCCCACGAGT GGCTCACCCG CTTCGAGGCC GAGGAGCGCC TGAAGACCGG GCTCAAGACC CTCAAGGTCG GCTACCGGGA CGGCGAGGGG TACTTTATAG AGGTGGGGGG GAAGGAGGCC CACCGGGTTC CGCCCCACTA CGAGCACCGC AAGGCCCTCA AGCACAACGC CCGCTACGTC ACCGTAGAGC TCAAGGAGCA CGAGTCCAGG ATGCTCACCG CCCGCGAGGA GGTCGAGCGG CTGGAGCGCA GGATCCTGGG CGAGATCCGG GCGGCCGTCA AGGAGGCGGC GCCGCAGCTG CAGCGGATCG CCCGGGCGGT GGCGGTGGTG GACGTGGTCG CCTCGTTCGC CGCCGCCGCG GCCGAGCTGC GCTACTGCCG GCCGCGGGTC GCGGAGGAGC GGGGGATCCG GATCGTCTCC GGGCGCCACC CGGTGGTCGA GCACGCCACC GAGACGCCCT TCGTGCCCAA CGACGCCCGG ATAGACGGCG GCTCCCGCCT GCAGATCATC ACCGGCCCCA ACATGGCCGG GAAGTCCGTG TATCTGCGGC AGGTGGCCCT GATAGTCCTG CTGGCCCAGA CCGGCTCCTA CGTGCCCGCG GAGGAGGCCT CGCTGGGGGT CGTCGACCGG ATCTTCACCC GGGTCGGCGC GGAGGACCGG CTGGCGAGCG GGGAGTCCAC CTTCATGGTG GAGATGACGG AGGCGGCGGG TATCCTGAAC GGCGCCACGG AGCGCAGCCT GGTGATCCTG GACGAGGTGG GCCGGGGGAC CTCCACCTAC GACGGGATGA GCCTGGCCTG GGCGATCGCC GAGTATCTGC ACGATGACGT GCAGGCCCTC ACGCTCTTCG CCACCCACTA CCACGAGCTG ACACGGCTCG CCGACTCGCT CCCCGGCTGC CGCAACCTGA AGGCGGTGGT TGAGGAGGTG GGCGGGGAGA TCGTGTTCCT GCACAGGATT GAGCCCGGCG CCGAGTCCTC CTCCTACGGG GTGCACGTGG CCCGGCTGGC CGGGCTGCCG CCGCGGGTGA CCGACCGGGC CCAGGAGATC CTCTCCCGCC TGGAGGCCGA AGGGGTGAAG GGGTTCGCCG GGTGA
|
Protein sequence | MLGRYAELKA QLPPGTILFY QVGTFFETFE EDAKTVSREL SLRLTSREAA GEGRVPLAGV PGHALQEHVA ALLRKGHSVA IAEQRQHPTK PRQFTREITQ ILTPGTVIED NVLSAGRSNY LATFVVRDGK AGIAVAEAST GEFSGTVVPE AELPAELERW SPREVVVPER TAAEDLPRLE ARVSTAPRWT FEPSAGEQAL RQHFGVASLK GYGLDGSPQL VAAAGALIRY LSTLRGGSPP EQIVSFRRYD PGQAMLLDAA TRRNLGLEEL ISTVDRTRTP MGQRTLRRWL ERPLLEASRI NQRLEAVDAL FPDYMLREEV REHLGGIPDI ERIATRIVRL SASPGDLLAL RGALEALGPL RRALAPAAQR SELLRRALSA MEEPPGVREL IAEAISEEEG EIIRPGYSAE LDEARSFRDG AHEWLTRFEA EERLKTGLKT LKVGYRDGEG YFIEVGGKEA HRVPPHYEHR KALKHNARYV TVELKEHESR MLTAREEVER LERRILGEIR AAVKEAAPQL QRIARAVAVV DVVASFAAAA AELRYCRPRV AEERGIRIVS GRHPVVEHAT ETPFVPNDAR IDGGSRLQII TGPNMAGKSV YLRQVALIVL LAQTGSYVPA EEASLGVVDR IFTRVGAEDR LASGESTFMV EMTEAAGILN GATERSLVIL DEVGRGTSTY DGMSLAWAIA EYLHDDVQAL TLFATHYHEL TRLADSLPGC RNLKAVVEEV GGEIVFLHRI EPGAESSSYG VHVARLAGLP PRVTDRAQEI LSRLEAEGVK GFAG
|
| |