Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3293 |
Symbol | |
ID | 3915940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 3512607 |
End bp | 3515183 |
Gene Length | 2577 bp |
Protein Length | 858 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640446078 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_498562 |
Protein GI | 87201305 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0310827 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGCGC AATATCTTGC GCTAAAGGAC CAGGCTGGCG ACTGCCTGCT GTTCTATCGC ATGGGCGACT TCTTCGAGCT TTTCTTCGAC GACGCGAAGG TCGCCGCACA GGTTCTCGAC ATAGCCCTCA CCAGCCGGGG CGAGCATGGC GGCGCGCCCA TTCCGATGTG CGGCGTGCCG GTCCATTCGG CCGAGGGATA CCTTGCCCGG CTGATCAAGG CGGGGTGCCG CGTCGCCATC GCGGAACAGG TGGAAACCCC CGAGGAAGCG AAGAAACGCG GGGGTTCCAA GGCTCTCGTC GCGCGCGACA TCGTCCGCTT CGTCACGGCC GGGACGCTGA CCGAGGAGGC GCTGCTCGAA CCGCGCCGGG CCAACGTGCT TGCGGCGGTG TGCGAAGTAC GCGGCCTCAT CGGCATCGCC GCCTGCGACA TCTCGACCGG CCGGATGGAG CTGGAGGAAT GCGCCGCCGA CCAGATTGGC GCGGCTCTTG CCCGGCTTGG TGCGAGCGAG ATCGTCGCAC CCGATTCCTG GGATGACCGG CCCTTCGATT GCGTTCCGCG TCCGAATCGG ACCTTCGCCA GCGAAGAGGG CGAAGCGCGA CTCAAGGCCG TGCACGGCGT GTCGACCCTC GACGGCTTCG GCCAGTTCAC CCGGGCGATG CTCTCGGCGG CCGGCGGGCT GGTCACCTAC CTCGACCATG TCGGGCGCGG CGCGCTGCCG CTCCTGCTGC CCCCGGTGGC GCGGGAAGCG GGCACCCACA TGGCAATGGA CGAGGCCACG CGGGCAAGCC TCGAGATCCT GAACAGTTCC ACCGGGACGC GGCGGGGCAG CCTTGTCGAG GCGATCGACC GCTGCGTCAC GGGCGCCGGC GCGCGCCTCC TGGCGGAAGA CCTCTCCGCG CCGCTGACCG ACGCACGCGC GATCAACCGC CGCCTCGAAA TGGTCAGCTG GCTCCACGAC GATCCGCTGC TGAGGGGCGA CATCCGCGCC ATCCTGCGCT CGCTGCCCGA CGTCGGGCGT GCGCTTGGCC GCGTCGTCGC GGGGCGTGGA AGCCCTCGCG ATCTCGGGCA ATTGCGCGAC GGCCTGTCCG AAGCGCGGCG GCTGCACGGT CTGCTGCACG CTCGCGCTGA CCGGCCCGAA CCGGTCCACG CGCTGCTCCC CTCGCTCGCC GGCCACGGCG CACTTTGCGA CCTCTATGCC CGTGCGCTGG TCCCTGCCCC TCCGACCGAA AGGTCACAGG GGGGCTACAT CGCCGAGGGC TACGACGCGG CGCTCGATGA ACTGCGCCGC ATATCGGGCA ACGCCCGCCG CGCGATTGCC GCGCTGGAGG CCAAGTACCG CGACGACACC GGGATCACTG CCCTCAAGAT CCGCCACAAC GGTGTGCTCG GCTATTTCAT CGAGGTTCCC GCAAAGCACG CCGACCGGTT GATGGCGCCC GATTCCGGTT TCACCCATCG CCAGACCATG GCTGGAGCCG TGCGTTTCAA CGCACTGGCG CTGCATGAGG AGGCGAGCCG CATCGCCGAG AGCGGCGGAC ACGCGCTGGC AGCGGAAGAA GCACACTTCG AGGACCTCGT CGGCCACGCG GTGCGCGCGA AGGAGGCGAT CGCGGCCACC GCCGCCGCGC TTGCGCGCAT CGACGTCGCC GCCGGTCAGG CCGAACGCGC TGCCGAAGGC GGCTGGGCCC TGCCGCGCGT GGTAGACGAG CCTTGTCTCG AAATAAGTGG CGGGCGCCAT CCGGTCGTGG AAGCGGCGCT TGCCGCCAAG GGCGAGCGCT TTGTCGCCAA CGACTGCGCG CTCGGGCCGC AGGACCGGCT GTGGCTGGTC GGAGGGCCTA ACATGGGCGG CAAGTCCACG TTCCTGAGGC AGAACGCGCT GATCGTACTG CTCGCCCAGG CAGGCGGCTT CGTTCCGGCA CGGTCGGCGA CAGTGGGCCT CGTCGACCGC CTGTTCAGCC GCGTCGGCGC ATCGGACAAT CTCGCGCGCG GCCGCTCGAC CTTCATGGTC GAGATGGTCG AGACGGCAGC GATCCTCAGC CAGGCAACGG ACCGCAGCTT CGTCATTCTC GACGAAGTCG GGCGCGGCAC TTCGACCTAC GACGGACTCG CGCTCGCCTG GGCGGTAGCC GAGGCGGTCC ACACCATCAA CCGCTGCCGC TGCCTTTTCG CCACGCACTA CCACGAACTC GCCCGCCTCG CCGAAAGCTG CGACGCCCTC TCGCTCCATC ACGTCCGCGC GCGCGAGTGG AAGGGCGACC TCGTCCTGCT GCACGAACTG GCCGATGGTC CGGCCGACAA GTCCTACGGC CTTGCCGTGG CCCGCCTCGC CGGCGTTCCC GCGCCCGTGA TCAAGCGCGC CAAGTCGGTG CTGGAGAAGC TGGAGAAAGG CCGCGCCGCC ACCGGCGGGC TGGCGGCCGG GCTCGACGAC CTGCCCCTCT TCGCCGCCGC CATCGAGGCC GCCGAGGAAA AGGTCGATGC CCTTCGCGAA CGCCTCAACG GCCTCGACAT CGACGCACTG TCCCCTCGCG AGGCTCTGGA CCTGCTCTAC GAACTGAAAG CCCAGGCCAA TGGTTGA
|
Protein sequence | MMAQYLALKD QAGDCLLFYR MGDFFELFFD DAKVAAQVLD IALTSRGEHG GAPIPMCGVP VHSAEGYLAR LIKAGCRVAI AEQVETPEEA KKRGGSKALV ARDIVRFVTA GTLTEEALLE PRRANVLAAV CEVRGLIGIA ACDISTGRME LEECAADQIG AALARLGASE IVAPDSWDDR PFDCVPRPNR TFASEEGEAR LKAVHGVSTL DGFGQFTRAM LSAAGGLVTY LDHVGRGALP LLLPPVAREA GTHMAMDEAT RASLEILNSS TGTRRGSLVE AIDRCVTGAG ARLLAEDLSA PLTDARAINR RLEMVSWLHD DPLLRGDIRA ILRSLPDVGR ALGRVVAGRG SPRDLGQLRD GLSEARRLHG LLHARADRPE PVHALLPSLA GHGALCDLYA RALVPAPPTE RSQGGYIAEG YDAALDELRR ISGNARRAIA ALEAKYRDDT GITALKIRHN GVLGYFIEVP AKHADRLMAP DSGFTHRQTM AGAVRFNALA LHEEASRIAE SGGHALAAEE AHFEDLVGHA VRAKEAIAAT AAALARIDVA AGQAERAAEG GWALPRVVDE PCLEISGGRH PVVEAALAAK GERFVANDCA LGPQDRLWLV GGPNMGGKST FLRQNALIVL LAQAGGFVPA RSATVGLVDR LFSRVGASDN LARGRSTFMV EMVETAAILS QATDRSFVIL DEVGRGTSTY DGLALAWAVA EAVHTINRCR CLFATHYHEL ARLAESCDAL SLHHVRAREW KGDLVLLHEL ADGPADKSYG LAVARLAGVP APVIKRAKSV LEKLEKGRAA TGGLAAGLDD LPLFAAAIEA AEEKVDALRE RLNGLDIDAL SPREALDLLY ELKAQANG
|
| |