Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_2471 |
Symbol | |
ID | 8825324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 2528043 |
End bp | 2532512 |
Gene Length | 4470 bp |
Protein Length | 1489 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | peptidase S8 and S53 subtilisin kexin sedolisin |
Protein accession | YP_003480593 |
Protein GI | 289582127 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAATGGT GCCATACACG CTGTGTGCGT GGTGGAACGA ACACAATCAA AGACAACTAC CAAATGACAG ACAGACGAGA ACAATGGACG GCCATCCTGT TCACGGCCCT CATGGTGCTT TCGCTCGTTG CGATCGCAGG CACCGGCCTG GCAGGCAGTG CGGCCGCAGC CGATGGAATC GACACACAAG CAGTCGACGA TGAGACACCA GCAACGATCG ACCCCGAACT TGAGGAGGCG ACCGGAACTG AGCAAGTGTA TATCGTCCTC GATCAGTACG AGGGCGACCT GAGTGACGAT CGCGACCGCG CGATCGCCCA ACTCCAAGAG CACGCCGAGA CGTCGCAGGC GTCCGTCACG GCATCTCTCG ACACGATGGC GAACGTGGAC GTGGTCAACG ACTACTGGAT CACGAACGCA GTTCGCGCCG AAGTCGACGC GTCGGAAGTC GACACGGCTG AACTCGCCTC GATCGAGGGC GTCCAGTCGG TCGAGCTCCG CCCGGACTAC GAAGTGCCGG AGCCGGAACC ATCCGTCGAG CCAGCCGTTG AACCTGACGA AGACGACTAC ACGTACGGTC TCGATCAGGT CAACGCCTCC GAGACCTGGG CTGACTTTAA CACCCAGGGA GAGAACGTGA AGGTCGCAGT GCTCGACACT GGCTTCGACA TCGATCACCA GGATCTGGAC CTCTACACCG AGGATGCAGA TGACCCGACC TACCCAGGTG GCTGGGTCGA GATTGACGAG GACGGTGACC CAGTCGAGGG CTCCGAGCCA TACGACACGC ACTATCACGG CACGCACGTC GGTGGCACGG TTGGTGCAGC CGCACCGGCT GACGACGATA CGCCGGCCTA CGGTGTCGCG CCGAACGTCG ACCTGCAGCA CGGCCTCGTG CTGCCGGACG GTTCGGGCGC AGACTCCGAC ACCATCGCTG GCTTCGAGTA CGTCGTCGAA GAGATGGACT CCGACGTGGT CAGCATGAGC TTCGGCGCTG GCTGTGGCCT CTTCGGTCCA GTGTACGAAG ACGCCTGGAT TCCCGCAATC CAGAACGCCA ACGACATGGA CGTCGTCACG GTCACCTCCT CGGGTAACTC CGGCGAGGGC TGTGTCGGCT CGCCAGCGAA CACCTACGAC TCGTTCAGCA TCGGTGCCTC GAACGAAGCC GGCGATATTA CCGACTTCTC CAGCGGTAAC ACCATCGAAG CCCACAACTG GGACAACCCC GACCCAGAGT GGCCCGACGA GTGGATCAAG CCAGACGTCT CCGCACCCGG CGAAGACGTC CTGAGCGCGA TGCCAGACGA CGAGTACGAC TACCTTTCGG GTACGTCGAT GTCCGCACCG CACGTCTCCG GCGTGATCGC CCTCATGCTC TCTGCCAACG ACGACCTCAC GCAGGAGGAA ATCGAGGAGA CCCTCGAGGA GACCGCCTGG AAGCCTGACG GCGAACCCGA CGAAAAGGAC GTTCGCTACG GCCACGGCAT CGTCGACGCT TACGCCGCTG TCGACGAAGT TGCTGCGGGT GCACTCGAGT ACGAACTCGG TGACGTCGAC CAGGACGGCG ATGTAACGGT CCAGGACGTG CAGCTAACCC AGCAGTACCT CCAGGATATG GATCCTGAGC CGTTCGCCGA GGACCTCGCA GACATGAACC GCGACGGTGA GGTCACGACG GACGACCTGA GCCTGCTCCA GCAGAAGGTA CAGGGCATGC TCGACGAGGG TGAGATCGAC ATTACGGGAC TCGACGTGCC CGACGAAGTC GACGATGGTG AGATGTTCGA CGTCACCGTC GACCTCGAAA ACCTCGGTGA GGAGGGTGCT GTCCAGGAAG TTGACCTCTT CCTTGGCGAC GACGATGAGC CGGTCGACAC CGAAGTCGTC GACATGGCAG CACCCGGCGT CGACGACCCG ATTGACCACC CAGCAGAGAC GACGATCACG TTCGAACTCG ACGCGGGTGA CCTCGACGGC GGTGACCACA CTATCACCGT CGAAACCGAG GACGCGGACG CGAGCGACAC GGTCACAGTC CTTGCGTCGA ACTTCGAACT CTCGAACCTC GACGCACCCG ATGAAGCCGA CCGCGGTGAC GAGATTACCG TCAGCGCGGA CGTCACGAAC ACGGGCAACG TCGATGACAC CCAGTCCGTC GAGTACCGCT TCGACGACCT CGGCGACGCA CTGGACACAC AGAACGTCAC GCTCGAACCC GACGAGAACG CGACAGTCTC CTTCGATGTC GACACAGAGA ACGTCACTGA AGGAACCTAC GACCACGGTG TCTTCACCGA GGATGACGAG GCAACCGCCG CGATCGACAT CCTCGAACCG TTCTTCAGCG TCGACCTCAC TGACGCGCCC GAGGAACTCG CACCGGGCGA CTCCTACAGC GTCAACGCAA CCGTCGAAAA CACCGGCGAC GCGTCGGACG AACAGACAAT CGCCTACGAA CTCGGCGACG GCGGCGGCGA CGTTGCCGTC GTCGACGTCG CAGCCGACAC GAGCCAGGGC GACGCCGTTG CCAGCGTGCT CGACGAGCGA CTCAACAACG ACACCTACGA TGTCGACATG GTCGTCGCAG ACGACCTGCT CGACGAGATG GACGACTACG ACACGTTCGT CGTCCAGCGA CTCGGTAGCG ACACACTCGC CGAGGACTTC CTCGACGAAC TCGAGAGCGC CCAGAGTGCG GTCTATCTCG ACTCCCACCA GGGTGGCTCC GCAGAAGCCT ACGCAGACGG TATCTACCGC CTGAATAACG TCCGTGAGGA CCCCGAAGAA CGCGACTCTG AGGGAATCGG AACCTCGAGC GATCCGGTAA CGATCGACAT CGAACAGAAC CACCCAATCT TCACGGGTGT CGGTGACGCC GGCGACGAGG TTGAGATGTT CACCGGCACC ACAACCTGGG GGAGCTGGTT CGACGACTAC AGCGGTACCA CGCTTGCGGA TGCCGACTTC GGCGACGGCT ACGCCGGCCC GTCGGTTGCC GTCAGCGACG ACGGCAGCGA ACTGCTGCTC ACGGCGACCG CACGCGACTT CTTCACCGAC GAAGAGGACT TCACCGAGGA AGCGAACCAG TTGCTCGCGA ACTCCGTCGA ATTCACCACG ACGGGAAGCG TGACCGCAGA TGCAGAAACC GCTGACAGCT CGTCTGTCGT CCACCTCGAA CCGGGCGAGA GCGAGGACCT GACGTTCTCC GACGTCGTGC CGGAGGACAC CGAAGCGGGT GACGCAAGCC ACATCGTTGC CAGCGAGGAC GAACAGGACT TCGCCCCGGT CACGATCACC GGCGACGAAG CGAACTGGAA CGTCAAGGGA ACGGTCTCCG ACGACGCTAC CGACGAACCG ATCGAAAACG CTAGCGTCGA ACTCGAAGCC GACAACGAAA CGTACACGAA CGTGACCGAC GCTGACGGCG AGTTCGGTCT CGCCGACGTG CCTGCTGGCG AACACGACCT CACGGTCGAC GCCGAAGGCT ACGCGAGCCA CACTGACGCC GTCCACGTTC CCGAGAACGA CACCGCCACG GTCGACGTTT CCCTGGAGCA GGCTGCAGGA GCGATCAGCG GCGACGTGAT GGCAAGTGAC GACGACGCAC CAGTCGAGAA CGCAACCATC GTCGCCGAGA ACGACGACGG CGATGTCCAC GAGGCAACGA CCGACGAGAA CGGCTCGTAC GAACTCGACG GCGTCTCGGC AGGCACGTAC GTCGTCAACG TCGTCGACAC ACCACCGGGC TACGAACTCG ACGAGATCGT TTCCGTCGCA CCCGGCGAAC ACGTCGACGA TGTCGACTTC GTCGTCGACC GCACCGCCGG TTCGATCGAA GGCACCGTCA CGAACGCCGC TGGCGTCCCA ATCGCTGACG CGAACGTGAT TGACGCCGAC GACGGCGCGT TCAACGTGAC GACCGCCGAG GACGGCTCCT ACGAAATCGA GGACGTCACG CCCGGCACGA ACGCGCTCCG CGCGGTCGCT GATGGCTACG ACGACTCGAA CGTCGAGTTC GTCGACGTTG AGACCGGCGA GACGACGACC GCGAACCTCA CGCTCGGCAC CTACTTCGAG GTCGACGACC TCGCAGCACC TGACACCGCC GAGCAGGGTG AGGAGATCAC CGTCAACGCG ACGGTCACCA ACACCGGCGA GCAGGAAGAC ACCCGAACGG CGTTCTACTT CCCGCCGGGC ACTGACTTCG GCACTGACGT CATCGACTAC CAGCCTGAAC TGGCCGAGAC GGTCACACTC GAGGGCGGCG AGTCGACGAC GGTCGAATTC ACCTACGAAA TTCCGGCAGA CGATGAGTCG GGCGAGTACG AACACGGTAT CTCGGCTGAC GAGGTCGAGT CGACGACGAT CACGATCGAG GCTGCTGACG ACGGTGACGC CAGTATCGCC CACGACGTGA CCGACGTCGC TGGACCGAAC GCGAATCAGC TGGGCGCTGC TCCTGCCTGA
|
Protein sequence | MEWCHTRCVR GGTNTIKDNY QMTDRREQWT AILFTALMVL SLVAIAGTGL AGSAAAADGI DTQAVDDETP ATIDPELEEA TGTEQVYIVL DQYEGDLSDD RDRAIAQLQE HAETSQASVT ASLDTMANVD VVNDYWITNA VRAEVDASEV DTAELASIEG VQSVELRPDY EVPEPEPSVE PAVEPDEDDY TYGLDQVNAS ETWADFNTQG ENVKVAVLDT GFDIDHQDLD LYTEDADDPT YPGGWVEIDE DGDPVEGSEP YDTHYHGTHV GGTVGAAAPA DDDTPAYGVA PNVDLQHGLV LPDGSGADSD TIAGFEYVVE EMDSDVVSMS FGAGCGLFGP VYEDAWIPAI QNANDMDVVT VTSSGNSGEG CVGSPANTYD SFSIGASNEA GDITDFSSGN TIEAHNWDNP DPEWPDEWIK PDVSAPGEDV LSAMPDDEYD YLSGTSMSAP HVSGVIALML SANDDLTQEE IEETLEETAW KPDGEPDEKD VRYGHGIVDA YAAVDEVAAG ALEYELGDVD QDGDVTVQDV QLTQQYLQDM DPEPFAEDLA DMNRDGEVTT DDLSLLQQKV QGMLDEGEID ITGLDVPDEV DDGEMFDVTV DLENLGEEGA VQEVDLFLGD DDEPVDTEVV DMAAPGVDDP IDHPAETTIT FELDAGDLDG GDHTITVETE DADASDTVTV LASNFELSNL DAPDEADRGD EITVSADVTN TGNVDDTQSV EYRFDDLGDA LDTQNVTLEP DENATVSFDV DTENVTEGTY DHGVFTEDDE ATAAIDILEP FFSVDLTDAP EELAPGDSYS VNATVENTGD ASDEQTIAYE LGDGGGDVAV VDVAADTSQG DAVASVLDER LNNDTYDVDM VVADDLLDEM DDYDTFVVQR LGSDTLAEDF LDELESAQSA VYLDSHQGGS AEAYADGIYR LNNVREDPEE RDSEGIGTSS DPVTIDIEQN HPIFTGVGDA GDEVEMFTGT TTWGSWFDDY SGTTLADADF GDGYAGPSVA VSDDGSELLL TATARDFFTD EEDFTEEANQ LLANSVEFTT TGSVTADAET ADSSSVVHLE PGESEDLTFS DVVPEDTEAG DASHIVASED EQDFAPVTIT GDEANWNVKG TVSDDATDEP IENASVELEA DNETYTNVTD ADGEFGLADV PAGEHDLTVD AEGYASHTDA VHVPENDTAT VDVSLEQAAG AISGDVMASD DDAPVENATI VAENDDGDVH EATTDENGSY ELDGVSAGTY VVNVVDTPPG YELDEIVSVA PGEHVDDVDF VVDRTAGSIE GTVTNAAGVP IADANVIDAD DGAFNVTTAE DGSYEIEDVT PGTNALRAVA DGYDDSNVEF VDVETGETTT ANLTLGTYFE VDDLAAPDTA EQGEEITVNA TVTNTGEQED TRTAFYFPPG TDFGTDVIDY QPELAETVTL EGGESTTVEF TYEIPADDES GEYEHGISAD EVESTTITIE AADDGDASIA HDVTDVAGPN ANQLGAAPA
|
| |