Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_1144 |
Symbol | |
ID | 8823975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 1162523 |
End bp | 1164613 |
Gene Length | 2091 bp |
Protein Length | 696 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | |
Product | glycoside hydrolase 15-related protein |
Protein accession | YP_003479290 |
Protein GI | 289580824 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACACG ACTATCCACC CCTTCGAGAC TACGGCAGTA TCGGCAACGA CGACCGGTGT GCACTGGTCA GCAGATACGG CTCGATCGAC TGGTGTTGTT TCCCCCACCT CGAGTCACCG AGCGTGTTCG CCCGACTGCT CGATGCGACT GACGGGGGAC ACTTTACCGT CTCGCCGACG GCCGACGACT TCGAGTCGAG TCATCAGTAC GCCGACCGAA CGAACGTCCT TCAAACTACT TTCGAGACGG AATCGGGCCA GGTGACGCTG ACCGACTTCA TGCCCATCCA GAACGGGGAT GCAGCGGAGC AACAATCTCG CAACCAGGAT TCCGACGACC AGCACCGGGA CCAACACCAA CACCCACACC AACACCCACA CCAGCACCAG CACCAACACC CACACCCCCA GCACGCAATC TACCGCCAAC TCGAGTGCGA CCGCGGTTCG ATGGAGTGCC AGGTCGTTTT CGAACCCCGA CTCGAGTACG CGCGAGTGAC GCCGGCACTC GAGACGAGTG ATGGTGGCGT CACAGCGGTT CGTGGGGGTG GTCGCAGCGA ACCGGGCAAT GATGATGATG ATGATGATGA TGGTGTCGAT GGCCATGTCG ATGACAATGC CGACTGGCTC GAGCAGCGCC ACCAGCCGCT ACACTACGTC GGCGATGTCG ATCTCGAAAT TAACGAGGCG GCGGCGAAAG CGACTGGAAC AGTCACGCTC GAAGCCGGCG ATACCTGCTG GCTCGGGGTC CAGTACGGCG GCGAGGAGCC ACAGGGATCG CCGTCGTATC AGGAGTGGCT CGACGAGACG AAGCACTACT GGCGCGAGTG GGTCGGCGAT CGGGAGGGAG TCGCAGAGTC AGTCTCGGAA CGGTGGCACG AGATGGTCAT CCGTTCGGAA CTGGTGCTGA AACTCCTGAT TCACCACGAG ACGGGTGCAA TTCCCGCGGC TGCAACCACG TCGGTCCCGG AGGAAATCGG CTCCGAACGC ACCTGGGACT ACCGGTATAA CTGGATTCGG GACGCGAAGT TCACCGTACA GGCGTTGCAC GACACCGGCC ACCGCCAGGA GGCCAGAGAC TACTTCGACT GGTTCGTCGG TATCGCAACG GATCATCCCA CCGAAATCCG GCCGCTGTAC GGACTTCATG GCGAATACGA CGACGATCTC GAAGAACGGA CGCTCGATCA TCTCTCCGGC TATCGCGATA CTGGTCCGGT TCGGATCGGC AACGGTGCAG CATCACAGCT CCAACTCGAC GTCTACGGCA CGTTCGTCCA GGCAATCTAC GAAACCATCC AGTTCGACGA GGAAGCCGAA CTGAGCGAGG ACAGCTGGGA CGCCGTCATC GAGAGCATCA ACCACGTCTG TCGCAACTGG GACCAGCCGG ACGCCGGAAT CTGGGAGTTC CGCGACGAAC ATCGTCACTT CCTGCACTCG AAGCTGCTGT GCTGGGTCGC CCTCGACCGC GGTATCGCAC TGGCTGAGGC AAACGACTTC GACGCACCGC TCGAGCACTG GAGAGACACA CGCGACGAGG TTCGTGGCGC TATCGAAACG CGCGGCTACA GCGAGGAGGC CGGCAGCTTC GTCCAATACT TCGACAGTGA CGAGGCGATC GACGCGACGG CACTTCTGAT CCCCATTTAC GAGTTTCTCC CGCCGGAGGA CGAGCGCGTC CAGTCGACCA TCGACACCGT CCTCGAGAAG CTCACGACGG ACGATGGGCT CGTCGTGCGC TTCGTAGATA CCGACGTACG GGAAGACGAG GAAGAGGGGT TCTTGCTCTG TTCGTTCTGG CTCATCGACG CGCTCGTCCT CTCGAACCGA CTCGAACTCG CCACAGAGTA CTTCGAATCG CTACTCGAGT ACACCTCGCC GCTCGGGCTG TATTCGGAGA AGGTCGACCC CGATAGCGGT CGACTGCTCG GGAATTTCCC GCAGGCGTTC TCGCATCTGG GGTTGATCAA CAGCGTGAGT TACCTCGCGC GGGCGATCGA TGCGGAGGGG GACGTTTCAC CCGAGGATTT CCATCCGGAG AACGTGGAGA CGTTGTTTCG ACGGGGTGAC GAGGATGTCG TGACTGAGTA A
|
Protein sequence | MEHDYPPLRD YGSIGNDDRC ALVSRYGSID WCCFPHLESP SVFARLLDAT DGGHFTVSPT ADDFESSHQY ADRTNVLQTT FETESGQVTL TDFMPIQNGD AAEQQSRNQD SDDQHRDQHQ HPHQHPHQHQ HQHPHPQHAI YRQLECDRGS MECQVVFEPR LEYARVTPAL ETSDGGVTAV RGGGRSEPGN DDDDDDDGVD GHVDDNADWL EQRHQPLHYV GDVDLEINEA AAKATGTVTL EAGDTCWLGV QYGGEEPQGS PSYQEWLDET KHYWREWVGD REGVAESVSE RWHEMVIRSE LVLKLLIHHE TGAIPAAATT SVPEEIGSER TWDYRYNWIR DAKFTVQALH DTGHRQEARD YFDWFVGIAT DHPTEIRPLY GLHGEYDDDL EERTLDHLSG YRDTGPVRIG NGAASQLQLD VYGTFVQAIY ETIQFDEEAE LSEDSWDAVI ESINHVCRNW DQPDAGIWEF RDEHRHFLHS KLLCWVALDR GIALAEANDF DAPLEHWRDT RDEVRGAIET RGYSEEAGSF VQYFDSDEAI DATALLIPIY EFLPPEDERV QSTIDTVLEK LTTDDGLVVR FVDTDVREDE EEGFLLCSFW LIDALVLSNR LELATEYFES LLEYTSPLGL YSEKVDPDSG RLLGNFPQAF SHLGLINSVS YLARAIDAEG DVSPEDFHPE NVETLFRRGD EDVVTE
|
| |