Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_1284 |
Symbol | |
ID | 8824116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 1311389 |
End bp | 1314079 |
Gene Length | 2691 bp |
Protein Length | 896 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_003479426 |
Protein GI | 289580960 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAGCC AGGCTCCGAG TATGACAGAG GCGACGGGCA TCGTCGGGGA GTTCTTCTCG CTCAAGGAGG GCACCGACGC GGAGTTGCTG GCGATGCAGT GTGGCGATTT CTACGAGTTC TTCGGCGAGG ACGCGGAAAC GGTCAGCGAC GAACTCGATC TCAAAGTATC CCAGAAGTCC TCACACGGCT CGTCGTACCC GATGGCCGGC GTGCCATTGG ACGACCTGAC GCCGTATCTC AAGGCCCTGG TCGAGCGCGG CTACCGGGTC GCCGTCGCCG ACCAGTACGA AACCGACTCC GGCCACGCGC GAGAGATTGT CCGCGTCGTG ACGCCCGGGA CGCTACTCGA GACGAGTGAT GCAGACGCGC AGTATTTGGC GGCGGTAGTG GACGGTGGCT CTGGCTCGAG TAGCGGGTCG ACGGACGCGC GCTACGGCCT CGCCTTCGCC GACGTGACCA CCGGCCGCTT CCTCGTCGCC GAAGCCGCAG ACACCGACGA GGCGCTGACG GAACTCTACC GGTTCGATCC CGTCGAGGTA CTTCCCGGGC CCGAGAGCAG AACTGACGAC GACCTGCTCG GAACCGTTCG GGAGCGCATC GACGCAACCC TCACGCTCCA CGAGACGGAG GCGTTCGCAC CGAAGCGCGC CGACCACGCA GTCCAGGAAC AGTTCGGCAG CGAGACCGTC GATCGACTGG CGGTCGGCGA GGCAGCCATC GCCGCCGCGG GCGCGATCCT TTCCTACGTC GAGGAGACCG GCGCGGGCGT ACTCGCCTCG ATGACACGGA TCCAGTCCCA CCACGGCGAC GATCACGTCA CCCTCGACGC GACGACCCAG CGCAACCTCG AGCTGACCGA AACGATGCAG GGCGAACGCG ACGGCTCGCT GTTCGCGACG ATCGACCACA CGGAGACCAG CGCCGGCGGG CGGCTCCTCA AGGAGTGGCT CCAGCGCCCT CGTCGCGCGC TCGACACGCT CGAACAGCGC CAGGAGAGCG TCGCCGCGCT CGCCTCGGCC GCGCTCGCTC GCGACGAGAT GCAGGACACG CTCGGTGAGG CCTACGACCT CGCGCGGCTG GCCTCGAAGG CAACCCACGG CAGCGCAGAC GCTCGCGACC TCGTCGCCGT GCGCGAGACG CTCGCCGTGC TGCCGGCGCT CGCGGAGACA ATTGAATCCG CGCCAGACCT GGCTGATTCG CCGCTCGCCG AAATCATCGA CCGGCCGGAC CGCGAGACCG CACGCGAGCT TCGGGAGGCA CTCGAGAACG CCATCGCTTC GGACCCCCCG TCGACGGTGA CCCAGGGTGA ACTCTTCCAG TACGGCTACG ACGATGACCT GGACGAGGTC ATCGACAGTC ACGAGGAGGT CAAGCAGTGG CTCGACACGC TCGCCGAGCG AGAGAAGCGC CAGTACTCCC TCTCGCACGT CACCGTCGAC CGGAACAAGA CCGACGGCTA CTACATCCAG GTCGGCAAGT CCGCCGCCGA CGGCGTCCCC GACCATTACG AGCAGATCAA GACGCTCAAG AACTCGAAGC GGTTCACGAC CGACGAACTC GCGGAGAAGG AACGCGACGT GCTCCGACTC GAGGAGAAAC GAGGGGAACT CGAGTACGAA CTCTTCGAGG AGCTCCGGGA AGAGGTCGCC GAGCAGGCGG AACTCCTGCA GGACGTGGGC CGGGCGCTCG CGACGGTCGA CGCGCTGGCG AGTCTGGCGA CCCACGCGGC CGAGAACCGG TGGGTCAAGC CCGACCTGCA CCACGGTGAC GCGCTCGACA TCGACCAGGG TCGCCACCCC GTCGTCGAGC AGACGACCGA GTTCGTTCCG AACGATGTCC GACTAGACGG CGAGCAGCGA GGCTTCCTGG TCGTTACCGG CCCCAACATG TCCGGAAAGT CGACCTACAT GCGCCAGGTC GCCTGTATCG TTCTGCTGGC CCAGATCGGG AGTTTTGTCC CGGCCAAGGA GGCCGAGATC GGCCTCGTCG ACGGTATCTT CACCCGCGTC GGTGCGCTCG ACGAACTCGC ACAGGGCCGG TCGACGTTCA TGGTCGAGAT GAGCGAACTC TCGAACATCC TCCACGCGGC GACCGACGAG TCGCTGGTCA TTCTGGACGA AGTCGGCCGC GGCACGGCGA CGTACGATGG CATCTCGATC GCCTGGGCGG CCACCGAGTA CCTCCACAAC GAAGTCGCAG CGAAGACGCT GTTCGCAACG CACTACCACG AACTGACCGG GCTCGCGGAG AACCTGCCGC GCGTCGCCAA CGTCCACATT GCAGCGGACG AACGCGACGG CGACGTGACC TTCCTCCGAA CCGTCCGGGA CGGTCCGACT GACCGCTCGT ACGGGATCCA CGTCGCCGAT CTGGCGGGCG TCCCCGGCCC CGTCGTCGAC CGCGCGCAGG ACGTCCTCGA GCGCCTCCGC GAGGAGAAGG CTATCGAGGC GAAGGGAGGA CACACGACCG AGCCGGTCCA GACGGTGTTC GATGTGGGAA GCGGCCAGTT CCGCGGGCCG GCGAACGCGG ATGGCGGTGA ACCGGACGAT GCGCCGGACG AGAGTGAGCG TGGGCCTGAC CCCGAGACCG AGGCCGTACT CGAGGACCTC GAGGAGCTCG ACGTGAACGC GACGCCGCCG GTTGAGCTTA TGTCCAAGGT ACAGGAATGG CAAGAGAACC TCTCTGAGTG A
|
Protein sequence | MESQAPSMTE ATGIVGEFFS LKEGTDAELL AMQCGDFYEF FGEDAETVSD ELDLKVSQKS SHGSSYPMAG VPLDDLTPYL KALVERGYRV AVADQYETDS GHAREIVRVV TPGTLLETSD ADAQYLAAVV DGGSGSSSGS TDARYGLAFA DVTTGRFLVA EAADTDEALT ELYRFDPVEV LPGPESRTDD DLLGTVRERI DATLTLHETE AFAPKRADHA VQEQFGSETV DRLAVGEAAI AAAGAILSYV EETGAGVLAS MTRIQSHHGD DHVTLDATTQ RNLELTETMQ GERDGSLFAT IDHTETSAGG RLLKEWLQRP RRALDTLEQR QESVAALASA ALARDEMQDT LGEAYDLARL ASKATHGSAD ARDLVAVRET LAVLPALAET IESAPDLADS PLAEIIDRPD RETARELREA LENAIASDPP STVTQGELFQ YGYDDDLDEV IDSHEEVKQW LDTLAEREKR QYSLSHVTVD RNKTDGYYIQ VGKSAADGVP DHYEQIKTLK NSKRFTTDEL AEKERDVLRL EEKRGELEYE LFEELREEVA EQAELLQDVG RALATVDALA SLATHAAENR WVKPDLHHGD ALDIDQGRHP VVEQTTEFVP NDVRLDGEQR GFLVVTGPNM SGKSTYMRQV ACIVLLAQIG SFVPAKEAEI GLVDGIFTRV GALDELAQGR STFMVEMSEL SNILHAATDE SLVILDEVGR GTATYDGISI AWAATEYLHN EVAAKTLFAT HYHELTGLAE NLPRVANVHI AADERDGDVT FLRTVRDGPT DRSYGIHVAD LAGVPGPVVD RAQDVLERLR EEKAIEAKGG HTTEPVQTVF DVGSGQFRGP ANADGGEPDD APDESERGPD PETEAVLEDL EELDVNATPP VELMSKVQEW QENLSE
|
| |