Gene Nmag_0330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_0330 
Symbol 
ID8823151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp319912 
End bp322665 
Gene Length2754 bp 
Protein Length917 aa 
Translation table11 
GC content67% 
IMG OID 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_003478482 
Protein GI289580016 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCCGG CGCTTGGCCC GCCTGACGCG ATGGCCGAGA AACGCGACGA GCTGACGCCA 
ATGATGCGTC AGTACCACGA CCTCTGTGCC CGCTACGACG ACGCGATCGT CCTCTTTCAG
GTCGGTGACT TCTACGAGAC CTTCTGCGGT GCCGCCGAGC GCAGCGCTCG CCTCCTCGAG
ATCGCCCTGA CTAGCCGCGA GGACAGCACC GGCGAGTATC CGATGGCCGG CATTCCGATC
GACAACGCCG AATCATACAT CGAGGACCTG CTGGAGGCCG GCTACCGCGT CGCCGTCGCG
GACCAGGTCG AAGAGCCCGG CGAAACCTCG GGGGTCGTCG AACGCGCCGT CACTCGCGTC
ATCACACCCG GCACACTCAC CGAGGACGAA CTCCTCGCGA GCGACGACAA CAACTTCGTC
GCGGCCGTCG CTCGCGGCCG CGGCATCGAC ACGCGACCCG CAGATGACGA ACTCGCCCTC
GCTCTCCTCG ACGTCTCGAC GGGCGACTTT CTGGCGACGA GTTCCGCCGC CAACGAGGCG
GTCGCGGACG AAGTGAGTCG CTTCGCACCC GCCGAAGCCG TCGTCGGCCC GAACGCACCC
GCTGACGTAC TCCCAGACGA CTGCATGGTG ACGCCGTTCG ACGAGCACGT TTTCGACCGC
GAGCGCTCCG CATCGATCCT TTCCGAATAC TTCGGTGAGC CAGACGCGCT GCTCGCGAGC
GACGCCGAAA TTCGAGCCTG CGGCGCGTTG CTCGCCTACG CCGAGTACGC TCGCGGCGGC
GAGCACGAGG GCGAGAAAGG GGATGGGAAT GGGGATGAGG AGAGGGATGG CGAGAGCGAC
GACACCAGCA AAGCCGGGCG TTCCCACAAG ATCGGCGAGA CCGACCGACT CGAGTACCTC
ACGCATCTCA CCCGCTACGA TCCCCGGGAG TACCTCCTGC TCGACGCCGT CGCGCTGCGC
AGTCTCGAAC TGTTCGAGCC CCGCGCCGTC AACGGCCGCG ACGACGCGAC GCTCGTCGGC
GTCCTCGACG AGACCGCCTG TGCCCTGGGC GGTCGGACGC TTCGTGACTG GCTTCGCCGG
CCGCTACTCG AGTCCCACCG AATCGAGGCG CGCCTCGACG CGGTAGAAGA GCTCACGGGG
TCAGTTCAGA CGCGCGAGCA CTGCCACGAG TTGTTGCGCG ACGTGTACGA TCTGGAGCGT
CTGATTGGGC GCATCTCCCG TGAGCGAGCG AACGCGCGGG ATCTGCGCTC GCTTCGGGAC
ACGCTCGCTG TCGTGCCCGA GATACGATCC CAACTGGCTG ACGCGGACTG CGACCGCCTG
CGCGACCTCC ACGAGGATCT CGACCCGCTC GCCGACGTGC GGGAGCTGAT CGACGACGCC
GTCGTCACGG ATCCGCCGAT CGAGATTACC GAGGGCGGTA TCATCGCCGA GGGGTACGAT
TCCGATCTGG ACGACCTCCG CGGAACTGCG CGGGACGGCA AGCAGTGGAT CGACGATCTG
GAAGCCCGGG AGCGCGAGCG AACCGGTATC GATTCGCTGA AGGTCGGCTA CAACTCCGTC
CACGGCTACT ATATCGAGGT GACGAACCCC AATCTCGACG CCGTACCGGA GAACTACCAG
CGCCGCCAGA CGCTGAAGAA CTCGGAGCGG TTCGTCACGC CGGAACTCAA GGAACGCGAG
GACGAAATCG TCGGCGCGGA AGAGCGCGCG GACGAACGCG AGTACGAACT CTTCTGTGCG
GTTCGCAGTG ACATCGGCGA CGAAGTCGAA CGTGTGCAGG GGCTGGCCGA CGCGCTCGCG
ACGCTCGACG CGCTCGTCTC GCTTGCGACC GTCGCAGCCC AGTACGACTA CTGCCGGCCC
GAGATTCTCG ACCCAGACGC AGACCACATC GACGGCGGAG TGCAGATCGA CATCACGGGT
GGCCGCCACC CTGTCGTCGA GCGCACCCAG GAGTCGTTCG TCCCGAACGG CGCACAGTTC
GATTCGGAGC AGCGCCTGGC GGTGATCACG GGCCCCAACA TGTCCGGAAA GTCGACCTAC
ATGCGTCAGG TCGCCCAGCT CGTCCTGCTC GCACAGGTCG GTAGCTTCGT CCCTGCGGAG
TCGGCACGGC TCACGCCCGT CGAGCGGGTC TTTACCCGCG TCGGCGCGAG CGACGATATC
GCCGGCGGGC GCTCGACGTT CATGGTCGAG ATGGACGAAC TCGCGACCAT CCTTCGGGAC
GCTGACGAGC GCTCGCTCGT CCTGCTGGAC GAGGTCGGTC GAGGAACGTC GACCGCGGAC
GGCCTCGCCA TCGCGCAGGC CATTACCGAA CACCTCCACG ACGCGGTCGG CGCGACGACG
CTCTTTGCGA CCCACCACCA TCCGCTGACT GAACTCGCCG AGGAACTCCC GAACGCGTTC
ACGCTCCACT TCGAGGTGGA ACAGACGGAC GGCGAGGTCG TCTTCCACCA CGAGATCGAA
CCCGGCGCAG CCACCGGCTC CTACGGTGTC GAAGTCGCGA CCGCAGCAGG CGTTCCCGAG
GCTGTCGTCG ACCGGTCGCG AGAACTGGTC GACGCCAACG CTGCCGCGGA GACCACAGCC
GAAACGGAGC CGACAGCCGA CGGCGGCACC AGCGAACTCG AGTCGACGGA GTCAGACCCA
GTCCCCGACA GCGTCGCCGC TGCACTCCGC GACCTCAACG TCGGGCACAT GACTCCCGTC
GAGGCGCTCA CCGAACTCGA CCGACTGCAA CGACTCCTCG AGGAGGAGTC GTAG
 
Protein sequence
MDPALGPPDA MAEKRDELTP MMRQYHDLCA RYDDAIVLFQ VGDFYETFCG AAERSARLLE 
IALTSREDST GEYPMAGIPI DNAESYIEDL LEAGYRVAVA DQVEEPGETS GVVERAVTRV
ITPGTLTEDE LLASDDNNFV AAVARGRGID TRPADDELAL ALLDVSTGDF LATSSAANEA
VADEVSRFAP AEAVVGPNAP ADVLPDDCMV TPFDEHVFDR ERSASILSEY FGEPDALLAS
DAEIRACGAL LAYAEYARGG EHEGEKGDGN GDEERDGESD DTSKAGRSHK IGETDRLEYL
THLTRYDPRE YLLLDAVALR SLELFEPRAV NGRDDATLVG VLDETACALG GRTLRDWLRR
PLLESHRIEA RLDAVEELTG SVQTREHCHE LLRDVYDLER LIGRISRERA NARDLRSLRD
TLAVVPEIRS QLADADCDRL RDLHEDLDPL ADVRELIDDA VVTDPPIEIT EGGIIAEGYD
SDLDDLRGTA RDGKQWIDDL EARERERTGI DSLKVGYNSV HGYYIEVTNP NLDAVPENYQ
RRQTLKNSER FVTPELKERE DEIVGAEERA DEREYELFCA VRSDIGDEVE RVQGLADALA
TLDALVSLAT VAAQYDYCRP EILDPDADHI DGGVQIDITG GRHPVVERTQ ESFVPNGAQF
DSEQRLAVIT GPNMSGKSTY MRQVAQLVLL AQVGSFVPAE SARLTPVERV FTRVGASDDI
AGGRSTFMVE MDELATILRD ADERSLVLLD EVGRGTSTAD GLAIAQAITE HLHDAVGATT
LFATHHHPLT ELAEELPNAF TLHFEVEQTD GEVVFHHEIE PGAATGSYGV EVATAAGVPE
AVVDRSRELV DANAAAETTA ETEPTADGGT SELESTESDP VPDSVAAALR DLNVGHMTPV
EALTELDRLQ RLLEEES