Gene Nmul_A2282 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2282 
Symbol 
ID3785098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2595348 
End bp2596991 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content54% 
IMG OID637812370 
ProductDNA mismatch repair protein MutS-like 
Protein accessionYP_412966 
Protein GI82703400 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCGC CCTCTGCTCC CAATTCATAC TGGAATCAAA AACCTGTTCT CTCCAGTGGA 
CAGAAGCCCA GCGGTATTCG AGACAGCCGG CCGAAGGAAG ACGACTACAG TGTACTGGAT
GCGAAAACGT TCGATGCAAT CGAAGCCGAT GAGTTATTCG ATTCGGTCAA TCACGCTATT
ACTCATGCGG GGCAATCCGT GCTCTATCGC TCGCTTGCGC GACCAGAGAC AGATGCGGCG
CTGATTCAGA AGAAACAGGA AGCAGTAAGG GAACTTGCGT CAAATTCAAT GCTTCGTGAA
GCGCTTCAGC ACTTCGTCGA TACGATGAAA GGGGGAGAAC GGTCTCTTTA TCATTTACTC
TACGGTACTT TTACCGGGGG AATTGCCATA GACAATTCCA GGGGCGGTAA AGATGAAATG
GAATTCAGCG GTTACGGCTA CCACCAGTTC ATCGATGGCA CGGGATTTGC AATCGACATG
GTCGAGACGA TGGAAAATCT ACCTCAGCCG CAAAGTGCTT ATCTCCGGGA GCTATTTCAG
GCTGTTCGCG ACTTTGGCAA ATCGAGGATT TATTCTCTGA TGCGCGGCCC GGTCTATGTC
TCGGAAGGAA AATTCAAGAC CCGGGAAGAA AAACCGCACT ATCTGCCGCT TTCGCGCTTC
ACGCCCTCCA TGTTCAAGCC GATTCCGGTA TTTTTCGCTG TGGCTGCGCT CGGTGCGGCG
CTTTATTTTT TTCAGGGGTT AATGGCGAGT TTCGGCATCG CTTATCTCGG CTATGGGATA
CTTGTGCTGG CAGTGCCGAT TTTACCCGTT GTTCTGATCG CAATTGCCGC CTCCGATCGC
GATACTGTCA TCTATCCACT GCGCAAGCTG TTCAAGCACA GCCCTGAACT GGCACGATTG
GTGGATGTGC TGGGAATGCT CGACGAACTG CTATCCTTCC ATCGCTATTC GGCAGCCTTC
GGCGGCAAGA TGACTTTGCC GGAAATATCG GAAAGCGAGC GGCACACCCT GGAAGTCAGG
GAAGCGCGGA ATCCCGTACT CGCGAGGACC AACTCCAATT ATGTGCCGAA TGATGTTTCG
CTCGATGATG CCGGCCGCTT GCTGGTCATT ACCGGACCCA ACAGCGGCGG CAAAACTGCG
TATTGCAAGA CAGTCGTCCA GATTCAACTG CTGGGACAGA TAGGTTGCTA TATTCCCGCC
GCTGCCGGGT GGCTGGTGCT GGCTGAGCAT GTTTTCTACC AGGTACCGGA CCCCGGCCAT
CTGGATGAGG GCATGGGGCG CTTTGGCCAC GAGTTGAAAC GCACGCGCGA AATTTTCTTC
AATTCCACGC CGCGCAGTCT CGTCGTGCTG GATGAGCTTT CGGAGGGAAC GACATTCGAA
GAGAAGATGG ATCTCTCGGA GTACGTTCTC GCTGGTTTTT ACAAGCTTGG CGCAAGCACC
TTGCTGGTTA CCCATAACCA CGAGCTATGC GAGCGCTTGC AGGACAAGGG GATAGGCCGC
TATCTCCAGG GTGAATTTTC GCTGCAGGGC CCGACCTATC GACTGATTCC GGGTGTTTCC
CGGGTGAGTC ATGCGGATCG GGTTGCGGCT GCCCTGGGTT TCAGTAAGGA AGATGTGGAA
AAGCACCTGG CCAGCCGCAG CTAG
 
Protein sequence
MNSPSAPNSY WNQKPVLSSG QKPSGIRDSR PKEDDYSVLD AKTFDAIEAD ELFDSVNHAI 
THAGQSVLYR SLARPETDAA LIQKKQEAVR ELASNSMLRE ALQHFVDTMK GGERSLYHLL
YGTFTGGIAI DNSRGGKDEM EFSGYGYHQF IDGTGFAIDM VETMENLPQP QSAYLRELFQ
AVRDFGKSRI YSLMRGPVYV SEGKFKTREE KPHYLPLSRF TPSMFKPIPV FFAVAALGAA
LYFFQGLMAS FGIAYLGYGI LVLAVPILPV VLIAIAASDR DTVIYPLRKL FKHSPELARL
VDVLGMLDEL LSFHRYSAAF GGKMTLPEIS ESERHTLEVR EARNPVLART NSNYVPNDVS
LDDAGRLLVI TGPNSGGKTA YCKTVVQIQL LGQIGCYIPA AAGWLVLAEH VFYQVPDPGH
LDEGMGRFGH ELKRTREIFF NSTPRSLVVL DELSEGTTFE EKMDLSEYVL AGFYKLGAST
LLVTHNHELC ERLQDKGIGR YLQGEFSLQG PTYRLIPGVS RVSHADRVAA ALGFSKEDVE
KHLASRS