Gene Nmag_1289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1289 
Symbol 
ID8824121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1318170 
End bp1320467 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content67% 
IMG OID 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_003479431 
Protein GI289580965 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGACA CACAACCACC GACCGACGAA ACCGAGATTC ACCAGTTAGA CGAGGACACC 
GTCGCCCGCA TCGCCGCCGG CGAGGTCGTC GAGCGGCCCG CAAGCGCCGT CAAGGAACTC
GTCGAGAATA GCCTCGACGC CGGGGCGTCG AGTATCGACG TCACCGTCGA GGCCGGCGGC
ACCGACCTCG TCCGCGTCGC GGACGACGGC CACGGCATGA CTGAAGCCGA CCTTCGCGCT
GCCGTCCGCC AGCACACGAC GAGCAAGATC AGCGGCCTCG ACGACCTCGA GTCGGGTGTT
GCCACCCTCG GCTTTCGCGG CGAAGCCCTG CACACTATCG GCTCCGTCTC GCGCCTGACC
ATCCAGTCGC GCCCGCAGGA CGGCGACGGC GCGGGCACCG AACTCGTCTA CGAGGGCGGC
ACCGTCGAAT CGGTCTCGCC GACCGGCTGT CCCGCGGGAA CGACAGTCGA GGTCGCGGAT
CTCTTCTACA ACACACCTGC CCGACGAAAG TTCCTCAAGA CGACGGCGAC CGAGTTTGCC
CACGTCAACC GCGTCGTCAC CCGCTACGCG CTCGCCAACC CCGAGGTGGC CGTCTCGCTG
ACCCACGACG GCCGCGAGGT GTTCTCGACG ACCGGCCAGG GCGACCTTCA GGCCGCCGTG
CTCGCCGTCT ACGGCCGCGA GGTCGCCTCC GCGATGATCC CCGTCGACGC CGACGGCGAG
GAACTGCCGC CGGGGCCACT CGAGTCCGTC GCTGGTCTCG TCTCCCATCC CGAGACGAAT
CGCGCGAGCC GAGAGTATCT GGCGACGTAC GTCAACGACC GTGCGGTGAC CTCGGACGCG
CTTCGAGAAG GAATCATGGG TGCCTACGGG ACGCAACTCG GCGGAGACCG CTACCCCTTC
GTCGTGCTCT TTCACGAGGT CCCAGGCGAC GCCGTGGACG TGAACGTCCA CCCGCGAAAG
CGGGAGGTCC GCTTCGACGA CGACGATGCA GTGCGCCGGC AGGTCGATTC GGCGGTCGAG
AGCGCCCTCC TCGAGCATGG CCTGCTTCGC TCGCGAGCGC CGCGCGGTCG CTCGGCACCG
GGAGAGGCGC AGGTGACGCC GACACAGGAG GAACTCGCGG AGCGATCAGG TGCTGGAACG
GAGACGAGTG CAGCGGAACC GAGTGCAGCG GAACCGAGTA CGTCCGAAGG CGACGCCGAC
GAGTCGGGGG TACTCGAGAG CGGTGACACA GCGAGCGGGA CGAACGAGGA GACAGCCACG
AGCGCTGCCG GTGGCGATCG GAAACCGACG ACGCCGGATC GAGCGGAAGA TCCGAAATCG
GCAGAGTCGG TGGACGCGAC GAAGTCGCAA GCGAGTGAGT CGACGGAATC GGCTGACGCC
GAGTTCCAGT CGGCTGCAGC GTTCCGTCCA GATGACGATT CGACGAGTGA CATCTCCGCC
GCATCGTCCA GTTCGCCGAG TGGTGTGGGC GGCCAGGCCG ATCCAGATCC AGATACAGAT
ACAGAAACGA ATACGGATAC GGATTCGGCT CCAGACCCGA AACCAAATCC GGAATCCGCG
GCCGCCTCCT CGATGGAGAC GCCGGGAGCC ACTGAGACGA AGGATGGAAA CAGCAAGTTC
GACGCCACAA CGGAACAACG GACGCTCGCA GGCGACGCTG CGACAGGCGG AGACTACGAC
CACGAGTTCG ACTCACTGCC ACCCCTACGG GTACTCGGAC AGCTCAGCGA CACCTACCTC
GTCTGCGAAA CCGACGACGG TCTCGCACTG ATCGACCAGC ACGCGGCCGA CGAACGGGTC
AACTACGAGC GCCTGCGTAC TGCGTTCGAC GAGGACTCGT CCGCGCAGGC GCTGGCCTCG
CCGGTCGAAC TCGAGTTGAC CGCCGCCGAA GCGGAGGCGT TCGCGGCCTA CCGCGATGCG
CTCGCCCAAC TCGGATTTTA CGCGGATCGG GTCGACGACC GGACGGTGGC GGTGACGACA
GTGCCGGCGG TGTTCGAGAA GACGCTCGAT CCGGAGCAGC TTCGAGACGT GCTCGTCTCG
TTCGTCGAGG GCGACCGCGA GGCAGGGGCG GAGACGGTCG ACGCGCTGGC GGACGAGTTT
ATCGGCGACC TGGCGTGTTA TCCCTCTATT ACGGGCAACA CGTCGCTGAC GGAGGGCTCC
GTCGTCGACC TGCTCGCAGC GCTCGACGAC TGTGAGAACC CGTACGCTTG CCCGCACGGG
CGACCGGTGG TCGTACAGTT CGACGAGGCC GAAATCGAGG ATCGGTTCGA GCGAGATTAT
CCGGGCCACA GCGGCTGA
 
Protein sequence
MTDTQPPTDE TEIHQLDEDT VARIAAGEVV ERPASAVKEL VENSLDAGAS SIDVTVEAGG 
TDLVRVADDG HGMTEADLRA AVRQHTTSKI SGLDDLESGV ATLGFRGEAL HTIGSVSRLT
IQSRPQDGDG AGTELVYEGG TVESVSPTGC PAGTTVEVAD LFYNTPARRK FLKTTATEFA
HVNRVVTRYA LANPEVAVSL THDGREVFST TGQGDLQAAV LAVYGREVAS AMIPVDADGE
ELPPGPLESV AGLVSHPETN RASREYLATY VNDRAVTSDA LREGIMGAYG TQLGGDRYPF
VVLFHEVPGD AVDVNVHPRK REVRFDDDDA VRRQVDSAVE SALLEHGLLR SRAPRGRSAP
GEAQVTPTQE ELAERSGAGT ETSAAEPSAA EPSTSEGDAD ESGVLESGDT ASGTNEETAT
SAAGGDRKPT TPDRAEDPKS AESVDATKSQ ASESTESADA EFQSAAAFRP DDDSTSDISA
ASSSSPSGVG GQADPDPDTD TETNTDTDSA PDPKPNPESA AASSMETPGA TETKDGNSKF
DATTEQRTLA GDAATGGDYD HEFDSLPPLR VLGQLSDTYL VCETDDGLAL IDQHAADERV
NYERLRTAFD EDSSAQALAS PVELELTAAE AEAFAAYRDA LAQLGFYADR VDDRTVAVTT
VPAVFEKTLD PEQLRDVLVS FVEGDREAGA ETVDALADEF IGDLACYPSI TGNTSLTEGS
VVDLLAALDD CENPYACPHG RPVVVQFDEA EIEDRFERDY PGHSG