Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0919 |
Symbol | |
ID | 3707309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 1010149 |
End bp | 1012740 |
Gene Length | 2592 bp |
Protein Length | 863 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637737427 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_342961 |
Protein GI | 77164436 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGCAA ATGACCCTCA GAAACCCCAT ACGCCTATGA TGCAGCAGTA TCTGCGGATA AAGGCAGAGT ATCCCAATAC GCTCTTGCTC TATCGCATGG GGGACTTCTA CGAATTGTTT TATGACGATG CTCAACGCGC CTCGGAACTA TTGGATATCG CGCTTACCAG CCGTGGCCGA TCGGCCGGCG AGCCCATTCC CATGGCAGGA ATTCCCTACC ATGCGTTGGA TTCCTATTTG GCAAGGTTAG TGCGCCAGGG GGAGTCGGTA GCCATCTGCG AGCAGATAGG CAATCCGGCG GCAAGCAAGG GTCCCGTGGA ACGCCAAGTG GTGCGAATCA TTACCCCCGG AACGGTGACC GAGGAAGCCC TCTTGGAAGC CCGCCGGGAT AATCTGCTAG CCGCACTTCA GAAAGAGGGG GATGTTTTTG GATTTGCTGT GCTTGATCTT TGCAGTGGGC GCTTCAATAT TCTAGAAGTA GCTAGTGAAT CGGCGGCTAC TAGCGAATTA GCCCGCATCC GGCCAGCGGA ACTTTTGGTA AGCGAAGACC TAGCGCTTAT CCTGGTCGAT TCCAAAACTG AGGCGGTGGT GCGACCCTTG CCTCCCTGGT ATTTTGATAG GGAAAGCGCC CAGCGCCAGC TATGTCGGCA GTTTGGGACT CAAGACCTAG CCGGTTTTGG CTGCGAGGAA ATGAAAACCG CAATTGCCGC CGCCGGGTGC CTGCTGCATT ATGTCCAGGA TACCCAACGC ACCCAATTTC CCCACATTCA CGCACTCCAA GTCGAGCGAC AAGAAACCAG CATTATTCTG GACCCCAGCA CTCGGCGCAA TCTAGAATTA GAAGAAAGCC TGAGCGGCGA TTCGGGCCGT AATACCTTAA TCGCGGTACT GGACCATACG GCAACCGCCA TGGGCAGCCG CCTGCTACGG CGCTATCTCC ACCGTCCTCT GCGGGATCAA ACCCTGCTCA AACAACGCCA ACAGGCGCTT GCTACTCTCC TAGAAGGAGG ACTGAGCGAT GTTTTACAAA CATTACTCCG GGGAATAGGC GATATTGAAC GCATTCTCTC CCGCGTGGCC TTGCGTTCCG CCCGTCCGCG AGATCTCGTC CAATTTCGGC AAGCCTTGGG TCTATTACCC AAGATCCAAG AGAGCTTGTT GCAGTTAAAC AGAGACAGCC TTTTACTTCA GTCGCTACAA GAAGATTTGG GTCCCTTTCC CAATCTCCAT GAACTGTTAC AACGGGCCAT TTGTGAAAAT CCGCCGGTGC TCATTAGAGA TGGCGGGGTA ATTGCCCTCG GCTTTGACTC GGAACTAGAT GAATTGCGGC ATTTAAGCGG CAATGCCGGG CAATTTTTAG TGAAATTGGA GCAGCGGGAG CGGGAGCGCA CCAAAATCCC AACTCTCAAG GTAGGCTACA ACAAAGTTCA TGGCTACTAT CTTGAGATCA CACGTGCTCA GGCCCATCAA GCGCCTCCTG ACTATATTCG TCGTCAAACC TTAAAGGGGG CGGAACGCTA TATTACCCCG GAATTGAAAG GCTTTGAAGA CCAGGTGTTA AGCGCCCGAG AACGGGCGCT GGCGCGGGAA AAAGCCCTTT ACGAGGAGCT ATTGGAACAA TTTATGGAAC CCCTCCCCGC TTTGCGGGCC TGCGCCAACG CCTTGGCGGA GCTGGATGTG CTTCATAACC TGGCCGAGCG GGCTAAAACC TTGGAATATG TGGCACCCCT ATTGAGCGAT CAGCCAGGGA TATTTATCGA AAGGGGCCGC CACCCGGTGG TGGAACAAAC CCTAGAGGAT CCTTTTGTGC CTAATGATCT GACTCTCCAT GAAGCACGGC GGATGCTGAT CATTACCGGC CCCAATATGG GAGGAAAGTC TACGTACATG CGCCAGACAG CCTTAATTGT CTTGCTCGCC CATATTGGCA GTTTTGTGCC AGCCCGCCGG GCTGTCATTG GCCCTATTGA CCGAATTTTT ACCCGTATCG GCGCAGCCGA TGATCTTGCC GGGGGACGCT CCACTTTTAT GGTCGAAATG ACCGAGACCG CCAACATTCT ACATAATGCG ACTGAGCACA GCTTAGTCTT GCTGGATGAG GTTGGCCGGG GCACCAGTAC TTTCGACGGT CTATCTCTCG CTTGGGCAGT GGTCTCCCAC CTGGCAAACA AAGTGCGTTC CCTAACGCTA TTTGCGACCC ATTACTTTGA GCTCACCACT CTTCCCGAGT GTCTTCCCGG CGTGGTTAAT CTTCACCTTA CAGCAACCGA GCATAAGGAG CATATCGTTT TTCTCCATGC GGTGAAAGAA GGTCCTGCCA GCCAAAGTTA TGGCCTTCAG GTAGCCGCAT TGGCGGGTGT GCCCCAGGAG ATCATTGCCC AGGCGCGGCA ACAGCTTATG GAATTAGAAA ACAATACTTG GCAAAAATCG ATCAATGGAG GCGGCCCTCA ACTAGACTTG CTTGCGCCCC CTGCGGATCA TCCTGCCGTT CAAATACTAC AAGATTTAGA CCCCGATGAA CTTACTCCCC GGCAAGCTTT AGAGAAACTC TACGAACTCA AACAGCTATT AGACCTTGCT GTTACACACT AA
|
Protein sequence | MPANDPQKPH TPMMQQYLRI KAEYPNTLLL YRMGDFYELF YDDAQRASEL LDIALTSRGR SAGEPIPMAG IPYHALDSYL ARLVRQGESV AICEQIGNPA ASKGPVERQV VRIITPGTVT EEALLEARRD NLLAALQKEG DVFGFAVLDL CSGRFNILEV ASESAATSEL ARIRPAELLV SEDLALILVD SKTEAVVRPL PPWYFDRESA QRQLCRQFGT QDLAGFGCEE MKTAIAAAGC LLHYVQDTQR TQFPHIHALQ VERQETSIIL DPSTRRNLEL EESLSGDSGR NTLIAVLDHT ATAMGSRLLR RYLHRPLRDQ TLLKQRQQAL ATLLEGGLSD VLQTLLRGIG DIERILSRVA LRSARPRDLV QFRQALGLLP KIQESLLQLN RDSLLLQSLQ EDLGPFPNLH ELLQRAICEN PPVLIRDGGV IALGFDSELD ELRHLSGNAG QFLVKLEQRE RERTKIPTLK VGYNKVHGYY LEITRAQAHQ APPDYIRRQT LKGAERYITP ELKGFEDQVL SARERALARE KALYEELLEQ FMEPLPALRA CANALAELDV LHNLAERAKT LEYVAPLLSD QPGIFIERGR HPVVEQTLED PFVPNDLTLH EARRMLIITG PNMGGKSTYM RQTALIVLLA HIGSFVPARR AVIGPIDRIF TRIGAADDLA GGRSTFMVEM TETANILHNA TEHSLVLLDE VGRGTSTFDG LSLAWAVVSH LANKVRSLTL FATHYFELTT LPECLPGVVN LHLTATEHKE HIVFLHAVKE GPASQSYGLQ VAALAGVPQE IIAQARQQLM ELENNTWQKS INGGGPQLDL LAPPADHPAV QILQDLDPDE LTPRQALEKL YELKQLLDLA VTH
|
| |