Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0670 |
Symbol | |
ID | 3785155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 765489 |
End bp | 768197 |
Gene Length | 2709 bp |
Protein Length | 902 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637810752 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_411369 |
Protein GI | 82701803 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.48055 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCAAT CCAGTAAAGC CAGACTTTCA ACCGATCCCA CAGTATTTGA AGCGGTATTG AATAACCATA CCCCCATGAT GCAGCAATAC CTGCGCATCA AGGCGCAGCA TCCGGATATG CTGATGTTTT ACCGGATGGG GGATTTCTAT GAACTGTTCT TTGACGATGC GGAAAAGGCA GCGAAGCTGC TCGACATCAC CCTGACCCGT CGCGGCACTT CGGCGGGAGA GCCGATCAAG ATGGCTGGTG TGCCTTACCA TGCGGCGGAA CAGTATCTGG CAAAGCTCGT CAAGCTTGGA GAATCGGTCG TCATCTGCGA ACAGGTGGGC GATCCCGCCA CTTCGAAAGG ACCGGTAGAA CGCCAGGTGA CACGCATCAT CACCCCCGGC ACCCTGACCG ATGCTGCGCT CCTGGAGGAC AAGCGCGACA GCGCCCTGCT TGCCTTGCTC GTGCATGAAT CCACCCTGGG GCTGGCGTGG CTGAATCTTG CAGCAGGGCA ATTTTCCGTG ATGGAGACTT CGGTGAACAA TCTCACAGCC GAACTCGAAC GCCTGAAGCC TGCCGAGATT CTTTTGCCGG AATCGCTGAA TCTTGCCGGG ATCAACGACA GGGTAATACA GGAGAAGTTA TGCGTGAAGC ATTTGCCGGC ATGGCAGTTC GATACCGCCG CGGCTGTGCG CAATCTCTCC CGGCAGTTCG GTACCCATGA CCTTTCCGGT TTCGGCTGCG AGGATCTGGA TGTTTCTCTC GGCGCCGCAA GTGCGTTGCT GGATTATACC CGGCTGACGC AGGGCGCCAG CATAGGTCAT ATCAAGGGGT TGCGGGTTGA GCGGGAGGAT ACCTATCTGC GCATGGACGC CACCACTCGC CGCAATCTGG AGATCTCCGA AACTATACGA GGTGACGCGG CGCCCACTTT ATTGTCCCTG CTGGATACCT GTTCGACCAA CATGGGCAGC CGACTGCTGT GCCACTGGCT GCACCACCCG CTTCGCGACC GCGGGCTGAT CCAGAACCGG CTCAATGGTG TATCTTTTTT GATGGGGGAA GCAGGATCAG GCCCCTGCCT TTCGGTGCGC GACTGCTTGA AGCGCGTGAC GGATATCGAG CGCATTACTG CCCGTATCGC CCTGAAATCG GCACGGCCAC GGGACTTATC CGGGCTGCGC GACAGCCTGA AACGGCTGCC CGCAGTCAAC AACGCCGTTG CCGGTACCGC TACTACAAGT AGCGGCGGCA GTGACGTAAG CGCGCATGTC GCGGCGCTCA TCCACTCGAT GGCGCCAGAC AATGCTCTCG TTGCGCTGCT GGAGAAATCG CTGAAGGAAG AACCGGAGGT GATGCTGCGC ACCGGGGGCG TGATTGCCGA TGGCTACGAT GCCGAATTGG ATGAACTGCG CGCGATACAC AACAATTGCG ATGAATTCCT GCTGCAACTC GAAACCCGGG AAAAGGCCCG TACCGGTATT GCGAATCTCA AGGTGGAATA CAACCGTTTG CACGGTTTTT ACATTGAAGT GACGCATGCG CACACCGAGA AAATCCCCGA CGACTATCGG CGCAGACAGA CGCTGAAGAA TGCGGAGCGC TACATTACGC CTGAGCTCAA AGCTTTCGAG GAAAAGGCGC TTTCTGCCCA GAGCCGGGCA CTGGAGCGGG AGAAATTGCT GTATGGCGAG CTGCTGGATA TGCTCTCCCA ATATATCGAC CATCTGCAGC AGGTTGCACG CAGCGTGGCA GAACTGGATG TCCTTGCGAC CTTTGCCGAA CGCGCGCTGG CACTTGACTA CAGCCTGCCC CTTTTTACCA GTGACAGTGT TATCGAAATT CAGGCAGGGC GGCATCCGGT AGTTGAAAAA CAAGTGGACA GCTTCATCGC CAATGATGTC CAGCTTGGCG CCCGCACGGG TGGCAGACGG CAGATGCTCG TCATTACCGG GCCCAACATG GGCGGGAAGT CTACCTACAT GCGCCAGGTT GCCCTGATTG CGCTACTCGC CCATTGCGGG AGTTTTGTTC CCGCGAGAAG CGCGCTTATT GGACCGCTCG ATCAGCTTTT CACGCGGATC GGCGCATCCG ACGATCTGGC GGGAGGGCGC TCCACCTTCA TGATGGAAAT GACCGAGGCG GCAAATATCC TGCACAACGC CACGGCGCAA AGCCTGGTGC TGATGGATGA AGTGGGCCGG GGAACCTCTA CGTTCGATGG ACTGGCGCTC GCTTTCGCAA TCGCCCGTTA TCTGCTGGAA AAGAACCGTA GCTACACCCT ATTCGCCACA CATTATTTCG AATTGACGCG GCTTGCGGAG GAGTTTGCAC AGGTCGCCAA TGTGCACCTG CGCGCGGTGG AGCACAAACA TCATATCGTG TTCCTGCACG CCGTCAACGA GGGGCCGGCC AGCCAGAGCT ACGGTCTCCA GGTGGCGGCA TTGGCCGGAG TGCCTGATCC GGTAATAAGA ACAGCGAGAA GATATCTGCT GAAACTCGAG CAGGAAGCGT TGAGCAATCA GCCGCAAGGA GACTTGTTCT CCAGGGACGA CCTCTTCTGG AAGCAGGACA GGATGCCGGA AGGTTCCGTT GACAAAAATG ACAGCGCCCC GGAGCATCCC GTACTTGCAC TGTTACGCAC TATCGTTCCC GACGACTTGA GCCCGAAACA GGCCCTGGAG CAGCTCTACG GCTTGAAGAA GGCGGCAGAG AAAGAATAG
|
Protein sequence | MSQSSKARLS TDPTVFEAVL NNHTPMMQQY LRIKAQHPDM LMFYRMGDFY ELFFDDAEKA AKLLDITLTR RGTSAGEPIK MAGVPYHAAE QYLAKLVKLG ESVVICEQVG DPATSKGPVE RQVTRIITPG TLTDAALLED KRDSALLALL VHESTLGLAW LNLAAGQFSV METSVNNLTA ELERLKPAEI LLPESLNLAG INDRVIQEKL CVKHLPAWQF DTAAAVRNLS RQFGTHDLSG FGCEDLDVSL GAASALLDYT RLTQGASIGH IKGLRVERED TYLRMDATTR RNLEISETIR GDAAPTLLSL LDTCSTNMGS RLLCHWLHHP LRDRGLIQNR LNGVSFLMGE AGSGPCLSVR DCLKRVTDIE RITARIALKS ARPRDLSGLR DSLKRLPAVN NAVAGTATTS SGGSDVSAHV AALIHSMAPD NALVALLEKS LKEEPEVMLR TGGVIADGYD AELDELRAIH NNCDEFLLQL ETREKARTGI ANLKVEYNRL HGFYIEVTHA HTEKIPDDYR RRQTLKNAER YITPELKAFE EKALSAQSRA LEREKLLYGE LLDMLSQYID HLQQVARSVA ELDVLATFAE RALALDYSLP LFTSDSVIEI QAGRHPVVEK QVDSFIANDV QLGARTGGRR QMLVITGPNM GGKSTYMRQV ALIALLAHCG SFVPARSALI GPLDQLFTRI GASDDLAGGR STFMMEMTEA ANILHNATAQ SLVLMDEVGR GTSTFDGLAL AFAIARYLLE KNRSYTLFAT HYFELTRLAE EFAQVANVHL RAVEHKHHIV FLHAVNEGPA SQSYGLQVAA LAGVPDPVIR TARRYLLKLE QEALSNQPQG DLFSRDDLFW KQDRMPEGSV DKNDSAPEHP VLALLRTIVP DDLSPKQALE QLYGLKKAAE KE
|
| |