Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Coch_2031 |
Symbol | |
ID | 8368492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Capnocytophaga ochracea DSM 7271 |
Kingdom | Bacteria |
Replicon accession | NC_013162 |
Strand | + |
Start bp | 2431741 |
End bp | 2433912 |
Gene Length | 2172 bp |
Protein Length | 723 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644984485 |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_003142136 |
Protein GI | 256820857 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATACAA AAACACTTGA AGACTTAGAG TTCCCCATTG TACTCTCTCA TTTGTCTGAC CTTTGCCTTA CTGAACTCGG CAAGAAATAC GCCTTACGTA TCAAGCCTTT CGACAATCAG GAGACGCTGT TATTGGCACT GAATCAAACC AATGAATACC TATCGTCTTT TGATAATAAC AACACAATCC CTTCACACTA TTGCGAATCT ATCACTTCTG AAATTAAACT TCTATCTATT GAAAACGCCT TGTTAGAAGT GTCGAGCATT AGGAAGATTC ATCGTATCAG CGAGATTGTA AACACTCAAA TTCTCTTCTT TAAGAAGTTC AAAACGCTAT ACCCTACCCT TTTCGAAACG GCTGATAGTA TCGAGTATAC TACCGAACTG CTGAACGCTA TCGATAAAGT GTTGGACAAG TACGGCGAGA TAAAAAACGA AGCCTCTCCT ACGCTTGGCG ATATACGTCG CGAGCTGAGC GCCTTGAAAG GAAAACTCAA CGAAAGTTTC AACCGTGCTC TCGCCGAGTA TAACACTGTC GACTATCTCG ATGATATCCG CGAGACCGTA GTAGAAAACC GCCGCGTATT GGCAGTGAAA GCAATGTACC GCCGAAAAGT ACAAGGCACC GTATGGGGGA GCTCAAAAAC GGGTAGCATA GTGTATATAG AACCGCGACA AACCGAAATC TACTCTCGCG AACTCTCCAA TCTGCTTTAT GACGAAAAAG AGGAAATCCA GCGTATATTA AGAGAACTTA CCGCCTTCAT CAGTCAGTTT GCCGACCTGC TGAAAGACTA TCAGCGATAC ATCACAGCCG TTGATATTAT TTGCGCCAAA GCCAAGTACG CTCACCAAAT GAACGCACTT TTGCCAGAAA TCACTCAGGA GCGCGAACTA TTCCTCCGTG AAGCATATCA CCCTTTGCTG TATCTAAACA ATGCCAAAAA AGGTGTTACT ACCTTTCCTC AAACCATAGA ACTGAATGAT GAAAATCGTA TTATCGTCAT CTCGGGACCT AATGCGGGCG GAAAGAGTAT TACGCTTAAA ACCATTGGCT TATTGCAATT AATGTTACAA AGTGGAATGC TCGTTCCGGT GCATCATCGT TCTAAAATGT GTTTGTTTGA ACGCATTCTT ACCGATATCG GCGATAACCA ATCTATTGAA AACCACCTTA GCACATACAG CTACCGACTC AAAAATATGA ATTACTTCCT CAAAAAATGC AACCACCGCA CCTTGTTTTT GATAGATGAG TTCGGGACAG GTAGCGACCC CGAATTAGGA GGTGCTTTGG CTGAAATATT CTTAGAAGAG TTCTACCACC GAAAGGCTTT TGGGGTGATT ACCACCCATT ACACCAATTT AAAAATGCTG GCAGATGAAT TGCCCCACGC GAGCAATGCC AATATGCTTT TCAACGATAA AACCTTAGAG CCCATCTATA AGCTAATAAT AGGCGAAGCG GGAAGTTCTT TCACTTTTGA AGTGGCGCAG AAAAACGGCA TTCCGTTTAG CCTTATCAAT CGGGCTAAGA AAAAAATAGA GAAAGGAAAA GTGCGTTTTG ATGCCACTAT CGCCAAATTG CAGAAAGAGC GCTCCAAAAT GGAAAAAACT GCCGAAACCC TGAAAGATGA GGAAACCAAA GCCCGCGAAG AAGCCAAACG TTTGGAGGAA CTTAATGATA AAGTAAAAAG CAAGCTCATA AATTATCAAG AACTCTATGA CCAAAGCCAA CGTATGATTA CCTTAGGTAG TAAGGTAGAC CAGATAGCCG AACGTTATTT CTACGATGGT AAGCGTCGCC CTTTGGTATC CGAATTTCTC AAACTCATTG AAATGGAAAA TGCTAAGCGC AAACAAATAA GCAAAGAGGA ACGCACCAAG CAAAAAGAGG AGAAAAAAAC TACTACCGAA GAAGTTCATA AGCAAATGGA AGCTATCCGC CAGCAACGCA AAGAAGAGAA GAAAGAGCGT ATTGCCCAAG AACGCGCCGA AAAGGAAAAG CTACAACGCG TCCTTTCAGT AGGCGACCGT GTGCGCATTA AAGACAGCCG AAGCGTGGGA AGTATCGATA AGATTGAAAA AGGCAAAGCC ATAGTAAATT ACGGGGCGTT TACTACCTCT GTTTCTTTGG ACGAATTGGA GCTCGTACAG AAAATCCGAT AA
|
Protein sequence | MNTKTLEDLE FPIVLSHLSD LCLTELGKKY ALRIKPFDNQ ETLLLALNQT NEYLSSFDNN NTIPSHYCES ITSEIKLLSI ENALLEVSSI RKIHRISEIV NTQILFFKKF KTLYPTLFET ADSIEYTTEL LNAIDKVLDK YGEIKNEASP TLGDIRRELS ALKGKLNESF NRALAEYNTV DYLDDIRETV VENRRVLAVK AMYRRKVQGT VWGSSKTGSI VYIEPRQTEI YSRELSNLLY DEKEEIQRIL RELTAFISQF ADLLKDYQRY ITAVDIICAK AKYAHQMNAL LPEITQEREL FLREAYHPLL YLNNAKKGVT TFPQTIELND ENRIIVISGP NAGGKSITLK TIGLLQLMLQ SGMLVPVHHR SKMCLFERIL TDIGDNQSIE NHLSTYSYRL KNMNYFLKKC NHRTLFLIDE FGTGSDPELG GALAEIFLEE FYHRKAFGVI TTHYTNLKML ADELPHASNA NMLFNDKTLE PIYKLIIGEA GSSFTFEVAQ KNGIPFSLIN RAKKKIEKGK VRFDATIAKL QKERSKMEKT AETLKDEETK AREEAKRLEE LNDKVKSKLI NYQELYDQSQ RMITLGSKVD QIAERYFYDG KRRPLVSEFL KLIEMENAKR KQISKEERTK QKEEKKTTTE EVHKQMEAIR QQRKEEKKER IAQERAEKEK LQRVLSVGDR VRIKDSRSVG SIDKIEKGKA IVNYGAFTTS VSLDELELVQ KIR
|
| |