Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_1289 |
Symbol | |
ID | 8824121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 1318170 |
End bp | 1320467 |
Gene Length | 2298 bp |
Protein Length | 765 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | DNA mismatch repair protein MutL |
Protein accession | YP_003479431 |
Protein GI | 289580965 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGACA CACAACCACC GACCGACGAA ACCGAGATTC ACCAGTTAGA CGAGGACACC GTCGCCCGCA TCGCCGCCGG CGAGGTCGTC GAGCGGCCCG CAAGCGCCGT CAAGGAACTC GTCGAGAATA GCCTCGACGC CGGGGCGTCG AGTATCGACG TCACCGTCGA GGCCGGCGGC ACCGACCTCG TCCGCGTCGC GGACGACGGC CACGGCATGA CTGAAGCCGA CCTTCGCGCT GCCGTCCGCC AGCACACGAC GAGCAAGATC AGCGGCCTCG ACGACCTCGA GTCGGGTGTT GCCACCCTCG GCTTTCGCGG CGAAGCCCTG CACACTATCG GCTCCGTCTC GCGCCTGACC ATCCAGTCGC GCCCGCAGGA CGGCGACGGC GCGGGCACCG AACTCGTCTA CGAGGGCGGC ACCGTCGAAT CGGTCTCGCC GACCGGCTGT CCCGCGGGAA CGACAGTCGA GGTCGCGGAT CTCTTCTACA ACACACCTGC CCGACGAAAG TTCCTCAAGA CGACGGCGAC CGAGTTTGCC CACGTCAACC GCGTCGTCAC CCGCTACGCG CTCGCCAACC CCGAGGTGGC CGTCTCGCTG ACCCACGACG GCCGCGAGGT GTTCTCGACG ACCGGCCAGG GCGACCTTCA GGCCGCCGTG CTCGCCGTCT ACGGCCGCGA GGTCGCCTCC GCGATGATCC CCGTCGACGC CGACGGCGAG GAACTGCCGC CGGGGCCACT CGAGTCCGTC GCTGGTCTCG TCTCCCATCC CGAGACGAAT CGCGCGAGCC GAGAGTATCT GGCGACGTAC GTCAACGACC GTGCGGTGAC CTCGGACGCG CTTCGAGAAG GAATCATGGG TGCCTACGGG ACGCAACTCG GCGGAGACCG CTACCCCTTC GTCGTGCTCT TTCACGAGGT CCCAGGCGAC GCCGTGGACG TGAACGTCCA CCCGCGAAAG CGGGAGGTCC GCTTCGACGA CGACGATGCA GTGCGCCGGC AGGTCGATTC GGCGGTCGAG AGCGCCCTCC TCGAGCATGG CCTGCTTCGC TCGCGAGCGC CGCGCGGTCG CTCGGCACCG GGAGAGGCGC AGGTGACGCC GACACAGGAG GAACTCGCGG AGCGATCAGG TGCTGGAACG GAGACGAGTG CAGCGGAACC GAGTGCAGCG GAACCGAGTA CGTCCGAAGG CGACGCCGAC GAGTCGGGGG TACTCGAGAG CGGTGACACA GCGAGCGGGA CGAACGAGGA GACAGCCACG AGCGCTGCCG GTGGCGATCG GAAACCGACG ACGCCGGATC GAGCGGAAGA TCCGAAATCG GCAGAGTCGG TGGACGCGAC GAAGTCGCAA GCGAGTGAGT CGACGGAATC GGCTGACGCC GAGTTCCAGT CGGCTGCAGC GTTCCGTCCA GATGACGATT CGACGAGTGA CATCTCCGCC GCATCGTCCA GTTCGCCGAG TGGTGTGGGC GGCCAGGCCG ATCCAGATCC AGATACAGAT ACAGAAACGA ATACGGATAC GGATTCGGCT CCAGACCCGA AACCAAATCC GGAATCCGCG GCCGCCTCCT CGATGGAGAC GCCGGGAGCC ACTGAGACGA AGGATGGAAA CAGCAAGTTC GACGCCACAA CGGAACAACG GACGCTCGCA GGCGACGCTG CGACAGGCGG AGACTACGAC CACGAGTTCG ACTCACTGCC ACCCCTACGG GTACTCGGAC AGCTCAGCGA CACCTACCTC GTCTGCGAAA CCGACGACGG TCTCGCACTG ATCGACCAGC ACGCGGCCGA CGAACGGGTC AACTACGAGC GCCTGCGTAC TGCGTTCGAC GAGGACTCGT CCGCGCAGGC GCTGGCCTCG CCGGTCGAAC TCGAGTTGAC CGCCGCCGAA GCGGAGGCGT TCGCGGCCTA CCGCGATGCG CTCGCCCAAC TCGGATTTTA CGCGGATCGG GTCGACGACC GGACGGTGGC GGTGACGACA GTGCCGGCGG TGTTCGAGAA GACGCTCGAT CCGGAGCAGC TTCGAGACGT GCTCGTCTCG TTCGTCGAGG GCGACCGCGA GGCAGGGGCG GAGACGGTCG ACGCGCTGGC GGACGAGTTT ATCGGCGACC TGGCGTGTTA TCCCTCTATT ACGGGCAACA CGTCGCTGAC GGAGGGCTCC GTCGTCGACC TGCTCGCAGC GCTCGACGAC TGTGAGAACC CGTACGCTTG CCCGCACGGG CGACCGGTGG TCGTACAGTT CGACGAGGCC GAAATCGAGG ATCGGTTCGA GCGAGATTAT CCGGGCCACA GCGGCTGA
|
Protein sequence | MTDTQPPTDE TEIHQLDEDT VARIAAGEVV ERPASAVKEL VENSLDAGAS SIDVTVEAGG TDLVRVADDG HGMTEADLRA AVRQHTTSKI SGLDDLESGV ATLGFRGEAL HTIGSVSRLT IQSRPQDGDG AGTELVYEGG TVESVSPTGC PAGTTVEVAD LFYNTPARRK FLKTTATEFA HVNRVVTRYA LANPEVAVSL THDGREVFST TGQGDLQAAV LAVYGREVAS AMIPVDADGE ELPPGPLESV AGLVSHPETN RASREYLATY VNDRAVTSDA LREGIMGAYG TQLGGDRYPF VVLFHEVPGD AVDVNVHPRK REVRFDDDDA VRRQVDSAVE SALLEHGLLR SRAPRGRSAP GEAQVTPTQE ELAERSGAGT ETSAAEPSAA EPSTSEGDAD ESGVLESGDT ASGTNEETAT SAAGGDRKPT TPDRAEDPKS AESVDATKSQ ASESTESADA EFQSAAAFRP DDDSTSDISA ASSSSPSGVG GQADPDPDTD TETNTDTDSA PDPKPNPESA AASSMETPGA TETKDGNSKF DATTEQRTLA GDAATGGDYD HEFDSLPPLR VLGQLSDTYL VCETDDGLAL IDQHAADERV NYERLRTAFD EDSSAQALAS PVELELTAAE AEAFAAYRDA LAQLGFYADR VDDRTVAVTT VPAVFEKTLD PEQLRDVLVS FVEGDREAGA ETVDALADEF IGDLACYPSI TGNTSLTEGS VVDLLAALDD CENPYACPHG RPVVVQFDEA EIEDRFERDY PGHSG
|
| |