Gene Nmul_A0670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0670 
Symbol 
ID3785155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp765489 
End bp768197 
Gene Length2709 bp 
Protein Length902 aa 
Translation table11 
GC content58% 
IMG OID637810752 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_411369 
Protein GI82701803 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.48055 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCAAT CCAGTAAAGC CAGACTTTCA ACCGATCCCA CAGTATTTGA AGCGGTATTG 
AATAACCATA CCCCCATGAT GCAGCAATAC CTGCGCATCA AGGCGCAGCA TCCGGATATG
CTGATGTTTT ACCGGATGGG GGATTTCTAT GAACTGTTCT TTGACGATGC GGAAAAGGCA
GCGAAGCTGC TCGACATCAC CCTGACCCGT CGCGGCACTT CGGCGGGAGA GCCGATCAAG
ATGGCTGGTG TGCCTTACCA TGCGGCGGAA CAGTATCTGG CAAAGCTCGT CAAGCTTGGA
GAATCGGTCG TCATCTGCGA ACAGGTGGGC GATCCCGCCA CTTCGAAAGG ACCGGTAGAA
CGCCAGGTGA CACGCATCAT CACCCCCGGC ACCCTGACCG ATGCTGCGCT CCTGGAGGAC
AAGCGCGACA GCGCCCTGCT TGCCTTGCTC GTGCATGAAT CCACCCTGGG GCTGGCGTGG
CTGAATCTTG CAGCAGGGCA ATTTTCCGTG ATGGAGACTT CGGTGAACAA TCTCACAGCC
GAACTCGAAC GCCTGAAGCC TGCCGAGATT CTTTTGCCGG AATCGCTGAA TCTTGCCGGG
ATCAACGACA GGGTAATACA GGAGAAGTTA TGCGTGAAGC ATTTGCCGGC ATGGCAGTTC
GATACCGCCG CGGCTGTGCG CAATCTCTCC CGGCAGTTCG GTACCCATGA CCTTTCCGGT
TTCGGCTGCG AGGATCTGGA TGTTTCTCTC GGCGCCGCAA GTGCGTTGCT GGATTATACC
CGGCTGACGC AGGGCGCCAG CATAGGTCAT ATCAAGGGGT TGCGGGTTGA GCGGGAGGAT
ACCTATCTGC GCATGGACGC CACCACTCGC CGCAATCTGG AGATCTCCGA AACTATACGA
GGTGACGCGG CGCCCACTTT ATTGTCCCTG CTGGATACCT GTTCGACCAA CATGGGCAGC
CGACTGCTGT GCCACTGGCT GCACCACCCG CTTCGCGACC GCGGGCTGAT CCAGAACCGG
CTCAATGGTG TATCTTTTTT GATGGGGGAA GCAGGATCAG GCCCCTGCCT TTCGGTGCGC
GACTGCTTGA AGCGCGTGAC GGATATCGAG CGCATTACTG CCCGTATCGC CCTGAAATCG
GCACGGCCAC GGGACTTATC CGGGCTGCGC GACAGCCTGA AACGGCTGCC CGCAGTCAAC
AACGCCGTTG CCGGTACCGC TACTACAAGT AGCGGCGGCA GTGACGTAAG CGCGCATGTC
GCGGCGCTCA TCCACTCGAT GGCGCCAGAC AATGCTCTCG TTGCGCTGCT GGAGAAATCG
CTGAAGGAAG AACCGGAGGT GATGCTGCGC ACCGGGGGCG TGATTGCCGA TGGCTACGAT
GCCGAATTGG ATGAACTGCG CGCGATACAC AACAATTGCG ATGAATTCCT GCTGCAACTC
GAAACCCGGG AAAAGGCCCG TACCGGTATT GCGAATCTCA AGGTGGAATA CAACCGTTTG
CACGGTTTTT ACATTGAAGT GACGCATGCG CACACCGAGA AAATCCCCGA CGACTATCGG
CGCAGACAGA CGCTGAAGAA TGCGGAGCGC TACATTACGC CTGAGCTCAA AGCTTTCGAG
GAAAAGGCGC TTTCTGCCCA GAGCCGGGCA CTGGAGCGGG AGAAATTGCT GTATGGCGAG
CTGCTGGATA TGCTCTCCCA ATATATCGAC CATCTGCAGC AGGTTGCACG CAGCGTGGCA
GAACTGGATG TCCTTGCGAC CTTTGCCGAA CGCGCGCTGG CACTTGACTA CAGCCTGCCC
CTTTTTACCA GTGACAGTGT TATCGAAATT CAGGCAGGGC GGCATCCGGT AGTTGAAAAA
CAAGTGGACA GCTTCATCGC CAATGATGTC CAGCTTGGCG CCCGCACGGG TGGCAGACGG
CAGATGCTCG TCATTACCGG GCCCAACATG GGCGGGAAGT CTACCTACAT GCGCCAGGTT
GCCCTGATTG CGCTACTCGC CCATTGCGGG AGTTTTGTTC CCGCGAGAAG CGCGCTTATT
GGACCGCTCG ATCAGCTTTT CACGCGGATC GGCGCATCCG ACGATCTGGC GGGAGGGCGC
TCCACCTTCA TGATGGAAAT GACCGAGGCG GCAAATATCC TGCACAACGC CACGGCGCAA
AGCCTGGTGC TGATGGATGA AGTGGGCCGG GGAACCTCTA CGTTCGATGG ACTGGCGCTC
GCTTTCGCAA TCGCCCGTTA TCTGCTGGAA AAGAACCGTA GCTACACCCT ATTCGCCACA
CATTATTTCG AATTGACGCG GCTTGCGGAG GAGTTTGCAC AGGTCGCCAA TGTGCACCTG
CGCGCGGTGG AGCACAAACA TCATATCGTG TTCCTGCACG CCGTCAACGA GGGGCCGGCC
AGCCAGAGCT ACGGTCTCCA GGTGGCGGCA TTGGCCGGAG TGCCTGATCC GGTAATAAGA
ACAGCGAGAA GATATCTGCT GAAACTCGAG CAGGAAGCGT TGAGCAATCA GCCGCAAGGA
GACTTGTTCT CCAGGGACGA CCTCTTCTGG AAGCAGGACA GGATGCCGGA AGGTTCCGTT
GACAAAAATG ACAGCGCCCC GGAGCATCCC GTACTTGCAC TGTTACGCAC TATCGTTCCC
GACGACTTGA GCCCGAAACA GGCCCTGGAG CAGCTCTACG GCTTGAAGAA GGCGGCAGAG
AAAGAATAG
 
Protein sequence
MSQSSKARLS TDPTVFEAVL NNHTPMMQQY LRIKAQHPDM LMFYRMGDFY ELFFDDAEKA 
AKLLDITLTR RGTSAGEPIK MAGVPYHAAE QYLAKLVKLG ESVVICEQVG DPATSKGPVE
RQVTRIITPG TLTDAALLED KRDSALLALL VHESTLGLAW LNLAAGQFSV METSVNNLTA
ELERLKPAEI LLPESLNLAG INDRVIQEKL CVKHLPAWQF DTAAAVRNLS RQFGTHDLSG
FGCEDLDVSL GAASALLDYT RLTQGASIGH IKGLRVERED TYLRMDATTR RNLEISETIR
GDAAPTLLSL LDTCSTNMGS RLLCHWLHHP LRDRGLIQNR LNGVSFLMGE AGSGPCLSVR
DCLKRVTDIE RITARIALKS ARPRDLSGLR DSLKRLPAVN NAVAGTATTS SGGSDVSAHV
AALIHSMAPD NALVALLEKS LKEEPEVMLR TGGVIADGYD AELDELRAIH NNCDEFLLQL
ETREKARTGI ANLKVEYNRL HGFYIEVTHA HTEKIPDDYR RRQTLKNAER YITPELKAFE
EKALSAQSRA LEREKLLYGE LLDMLSQYID HLQQVARSVA ELDVLATFAE RALALDYSLP
LFTSDSVIEI QAGRHPVVEK QVDSFIANDV QLGARTGGRR QMLVITGPNM GGKSTYMRQV
ALIALLAHCG SFVPARSALI GPLDQLFTRI GASDDLAGGR STFMMEMTEA ANILHNATAQ
SLVLMDEVGR GTSTFDGLAL AFAIARYLLE KNRSYTLFAT HYFELTRLAE EFAQVANVHL
RAVEHKHHIV FLHAVNEGPA SQSYGLQVAA LAGVPDPVIR TARRYLLKLE QEALSNQPQG
DLFSRDDLFW KQDRMPEGSV DKNDSAPEHP VLALLRTIVP DDLSPKQALE QLYGLKKAAE
KE