Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2282 |
Symbol | |
ID | 3785098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2595348 |
End bp | 2596991 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637812370 |
Product | DNA mismatch repair protein MutS-like |
Protein accession | YP_412966 |
Protein GI | 82703400 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTCGC CCTCTGCTCC CAATTCATAC TGGAATCAAA AACCTGTTCT CTCCAGTGGA CAGAAGCCCA GCGGTATTCG AGACAGCCGG CCGAAGGAAG ACGACTACAG TGTACTGGAT GCGAAAACGT TCGATGCAAT CGAAGCCGAT GAGTTATTCG ATTCGGTCAA TCACGCTATT ACTCATGCGG GGCAATCCGT GCTCTATCGC TCGCTTGCGC GACCAGAGAC AGATGCGGCG CTGATTCAGA AGAAACAGGA AGCAGTAAGG GAACTTGCGT CAAATTCAAT GCTTCGTGAA GCGCTTCAGC ACTTCGTCGA TACGATGAAA GGGGGAGAAC GGTCTCTTTA TCATTTACTC TACGGTACTT TTACCGGGGG AATTGCCATA GACAATTCCA GGGGCGGTAA AGATGAAATG GAATTCAGCG GTTACGGCTA CCACCAGTTC ATCGATGGCA CGGGATTTGC AATCGACATG GTCGAGACGA TGGAAAATCT ACCTCAGCCG CAAAGTGCTT ATCTCCGGGA GCTATTTCAG GCTGTTCGCG ACTTTGGCAA ATCGAGGATT TATTCTCTGA TGCGCGGCCC GGTCTATGTC TCGGAAGGAA AATTCAAGAC CCGGGAAGAA AAACCGCACT ATCTGCCGCT TTCGCGCTTC ACGCCCTCCA TGTTCAAGCC GATTCCGGTA TTTTTCGCTG TGGCTGCGCT CGGTGCGGCG CTTTATTTTT TTCAGGGGTT AATGGCGAGT TTCGGCATCG CTTATCTCGG CTATGGGATA CTTGTGCTGG CAGTGCCGAT TTTACCCGTT GTTCTGATCG CAATTGCCGC CTCCGATCGC GATACTGTCA TCTATCCACT GCGCAAGCTG TTCAAGCACA GCCCTGAACT GGCACGATTG GTGGATGTGC TGGGAATGCT CGACGAACTG CTATCCTTCC ATCGCTATTC GGCAGCCTTC GGCGGCAAGA TGACTTTGCC GGAAATATCG GAAAGCGAGC GGCACACCCT GGAAGTCAGG GAAGCGCGGA ATCCCGTACT CGCGAGGACC AACTCCAATT ATGTGCCGAA TGATGTTTCG CTCGATGATG CCGGCCGCTT GCTGGTCATT ACCGGACCCA ACAGCGGCGG CAAAACTGCG TATTGCAAGA CAGTCGTCCA GATTCAACTG CTGGGACAGA TAGGTTGCTA TATTCCCGCC GCTGCCGGGT GGCTGGTGCT GGCTGAGCAT GTTTTCTACC AGGTACCGGA CCCCGGCCAT CTGGATGAGG GCATGGGGCG CTTTGGCCAC GAGTTGAAAC GCACGCGCGA AATTTTCTTC AATTCCACGC CGCGCAGTCT CGTCGTGCTG GATGAGCTTT CGGAGGGAAC GACATTCGAA GAGAAGATGG ATCTCTCGGA GTACGTTCTC GCTGGTTTTT ACAAGCTTGG CGCAAGCACC TTGCTGGTTA CCCATAACCA CGAGCTATGC GAGCGCTTGC AGGACAAGGG GATAGGCCGC TATCTCCAGG GTGAATTTTC GCTGCAGGGC CCGACCTATC GACTGATTCC GGGTGTTTCC CGGGTGAGTC ATGCGGATCG GGTTGCGGCT GCCCTGGGTT TCAGTAAGGA AGATGTGGAA AAGCACCTGG CCAGCCGCAG CTAG
|
Protein sequence | MNSPSAPNSY WNQKPVLSSG QKPSGIRDSR PKEDDYSVLD AKTFDAIEAD ELFDSVNHAI THAGQSVLYR SLARPETDAA LIQKKQEAVR ELASNSMLRE ALQHFVDTMK GGERSLYHLL YGTFTGGIAI DNSRGGKDEM EFSGYGYHQF IDGTGFAIDM VETMENLPQP QSAYLRELFQ AVRDFGKSRI YSLMRGPVYV SEGKFKTREE KPHYLPLSRF TPSMFKPIPV FFAVAALGAA LYFFQGLMAS FGIAYLGYGI LVLAVPILPV VLIAIAASDR DTVIYPLRKL FKHSPELARL VDVLGMLDEL LSFHRYSAAF GGKMTLPEIS ESERHTLEVR EARNPVLART NSNYVPNDVS LDDAGRLLVI TGPNSGGKTA YCKTVVQIQL LGQIGCYIPA AAGWLVLAEH VFYQVPDPGH LDEGMGRFGH ELKRTREIFF NSTPRSLVVL DELSEGTTFE EKMDLSEYVL AGFYKLGAST LLVTHNHELC ERLQDKGIGR YLQGEFSLQG PTYRLIPGVS RVSHADRVAA ALGFSKEDVE KHLASRS
|
| |