Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xaut_0035 |
Symbol | |
ID | 5424135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xanthobacter autotrophicus Py2 |
Kingdom | Bacteria |
Replicon accession | NC_009720 |
Strand | + |
Start bp | 34202 |
End bp | 36967 |
Gene Length | 2766 bp |
Protein Length | 921 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640879280 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001414951 |
Protein GI | 154243993 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.892203 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTCAG ACCGCGCCCG GACCGCCACC CCAGACCTTC CGCAGGACTC CCCCGAGCCC GCCAGCGTCG CTCCCGCCCC CGCGCGGGCG GAGGAGGCGC GTGTCACGCC CATGATGGCG CAGTATCTGG AGATCAAGGC GGCCAATCCG GACAGCCTGC TATTTTACCG CATGGGCGAT TTCTACGAGC TGTTCTTCGC CGACGCGGAA GCCGCCTCGC AGGCGCTCGG CATCGTGCTG ACCAAGCGCG GCAAGCACCT GGGCGAGGAC ATTCCCATGT GCGGCGTGCC CATCGACCGC GCCGAGGAAT ATCTGCACAA GCTCATCGCT CTGGGCTTCC GCGTGTCGGT GTGCGAGCAG CTGGAGGACC CGGCGGAGGC GAAGAAGCGC GGACCGAAAT CCGTGGTGCG GCGGGACGTG ACCCGCCTCG TCACCCCCGG CACGATCACC GAGGACGCGC TGCTGGATGC CCGGCGCGAG AACGTGCTGG CGGCGCTCGC CCGCGTGCGG GCCGGATCGG GGCCGGAGGA TTTCGCCTAC GCGCTGGCCT ATACCGATAT GTCCACCGGC AGCTTCCGGG TCACCGCCAC CGCGCGTGAC GACCTCTCCG GCGACCTCGC CCGCCTCGAT CCGGCGGAGA TCCTGGTCTC CGACGCGGTG CTGGACGACG GCGAACTGCG CGCGCTGCTG CGGGCCTTTC CGGCGGTGAC GCCCCTGCCG CGCCAGTCCT TCGACGGAGC GGGGGCGGAA AAGCGCCTCG CCGATTTCTT CGGCGTGGCG GCGCTCGACG CCTTCGGCAC CTTCGCCCGC GCCGAGCTGA TCGCGGCGGC GGCCATCGTC GCCTATGTGG ATCGCACCCA GCTGGGCGCC AAGCCGCTGC TCTCCCGTCC GGTGAAGGAA GCGGAAGGCG GCATCATGGC CATCGATGCC GGCACGCGGG CCAATCTGGA GCTGGTGCGC ACCACCTCCG GCGAGCGGCG CGGCTCCCTG CTCGCCGCCG TGGATCGCAC GGTGACGGCG GCCGGCGCGC GCCTCATCGC CCGCCGCATC GCCGAGCCGC TTACGGACCT TGCCGCCATC CGTGCCCGGC ACGACGGCGT CGCCCATCTG GTGGAGGAGG GGGAATTGCG GCGCGAGCTG CGCGCCCGGC TCTCCCGCGC CCCCGACATG GCCCGAGCAG TCACCCGCCT CGCCCTCCAG CGCGGCGGCC CGCGCGACCT TGCTGCCGTG CGCGATGCGC TGGACGGTGC CCTCGCCATC GCCGGCCTGT TCGCCGCCGC GCCACCGGCG GACCTTGCCC GCTCAGCGGC GGCGCTGGCG CGGGTGCCCC ATGCCCTGGT GCTCGACCTC GCCTCGGCGC TCGCCGAGAG CCTGCCGCCC CTCCGTCGCG ACGGCGGCTT CATCCGCGAA GGCTGCGACG CCGAGCTGGA TGCGACGCGT GCCCTGCGCG ACGAGAGCCG GCGCGTGGTG GCGGCGCTGG AGCGGCGCTA CGTGGACGAG ACCGGCGTGC GCGCCCTGAA GATCCGGCAC AATGCGGTGC TCGGCTATTT CGTCGAGGTC TCCGCCCAGA ACGCCGACCG CCTGCGCGAG GCGCCACACG ACGCCGTTTT CGTCCACCGC CAGACCATGG CGGGGGCGGT GCGCTTCTCG TCCGTCGAGC TGGGCGATCT TGAGAGCCGT ATCGCCAGCG CCGGCGAGCG GGCGCTGGGG CTGGAGCAGG CCATCTTCGA CCGGCTGGCT GCCGCCGTGG TGGCCGAGAC CGAGACCATC CGCGCCGCCG CGGAGGCGCT GGCGGAACTC GATGTGGCTG CGGGCTTTGC AGAGCTTGCG GCGGTGGAGA ACCATGTGCG CCCGCACATG GAGCCGGGGG TCGCCTTCGC CATCGCCGGC GGGCGCCATC CGGTGGTGGA GCAGGCCCTC GCCAAAGAGG GCGGGCCGTT CGTGCCCAAC GATTGCGACC TCTCCCCGCC GGAGGGGTTC GAGGACGGGC GCATCGTGCT GGTCACCGGG CCGAACATGG CCGGCAAGTC CACCTTCCTG CGCCAGAACG CGCTCATCTG CGTGCTGGCG CAGGCCGGCG CCTTCGTGCC CGCCCGGTCC GCGCGCATTG GCGTGGTGGA CCGCCTGTTC TCCCGCGTGG GCGCGGCGGA TGACCTCGCC CGCGGGCGTT CCACCTTCAT GGTGGAGATG GTGGAGACCG CTGCCATCCT GAACCAGGCC ACCGCCCGCT CCCTGGTCAT CCTCGACGAG ATCGGGCGCG GCACCGCCAC CTTCGACGGC ATGTCCATCG CCTGGGCGAG CCTTGAGCAC CTGCACGAGG TGAACCGCTG CCGGGCGCTG TTCGCCACAC ATTTCCACGA ACTGACCGCC CTGTCCCAGC GCTGCAAGCG GCTCTCCAAC GCCACGGTGA AGGTCACCGA ATGGCATGGC GACGTGATCT TCCTGCATGA GGTGGTGCCG GGGGCGGCGG ATCGTTCCTA CGGCATCCAG GTGGCCAAGC TCGCCGGCCT GCCGGAGGCG GTGATCACCC GCGCCAAGGC GGTGCTGGCG GAGCTGGAAG CGGCCGAGCG CGCCTCTCCG GCCCAGAAGC TCATCGATGA TCTGCCCCTG TTCGCGGTGC GCCCGAAGCC TGCGGCCGCC GCATCAGCGG ACCCGAAGGC CGAGGCGACG CTCTCCGCCC TCGACGGCAT CGACCCCGAC AGCCTGAGCC CGCGCGAGGC GCTGGATGCG CTCTATCGGC TGAAGGGCCT CCGGCGCGAG GGGTGA
|
Protein sequence | MSSDRARTAT PDLPQDSPEP ASVAPAPARA EEARVTPMMA QYLEIKAANP DSLLFYRMGD FYELFFADAE AASQALGIVL TKRGKHLGED IPMCGVPIDR AEEYLHKLIA LGFRVSVCEQ LEDPAEAKKR GPKSVVRRDV TRLVTPGTIT EDALLDARRE NVLAALARVR AGSGPEDFAY ALAYTDMSTG SFRVTATARD DLSGDLARLD PAEILVSDAV LDDGELRALL RAFPAVTPLP RQSFDGAGAE KRLADFFGVA ALDAFGTFAR AELIAAAAIV AYVDRTQLGA KPLLSRPVKE AEGGIMAIDA GTRANLELVR TTSGERRGSL LAAVDRTVTA AGARLIARRI AEPLTDLAAI RARHDGVAHL VEEGELRREL RARLSRAPDM ARAVTRLALQ RGGPRDLAAV RDALDGALAI AGLFAAAPPA DLARSAAALA RVPHALVLDL ASALAESLPP LRRDGGFIRE GCDAELDATR ALRDESRRVV AALERRYVDE TGVRALKIRH NAVLGYFVEV SAQNADRLRE APHDAVFVHR QTMAGAVRFS SVELGDLESR IASAGERALG LEQAIFDRLA AAVVAETETI RAAAEALAEL DVAAGFAELA AVENHVRPHM EPGVAFAIAG GRHPVVEQAL AKEGGPFVPN DCDLSPPEGF EDGRIVLVTG PNMAGKSTFL RQNALICVLA QAGAFVPARS ARIGVVDRLF SRVGAADDLA RGRSTFMVEM VETAAILNQA TARSLVILDE IGRGTATFDG MSIAWASLEH LHEVNRCRAL FATHFHELTA LSQRCKRLSN ATVKVTEWHG DVIFLHEVVP GAADRSYGIQ VAKLAGLPEA VITRAKAVLA ELEAAERASP AQKLIDDLPL FAVRPKPAAA ASADPKAEAT LSALDGIDPD SLSPREALDA LYRLKGLRRE G
|
| |