Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_2046 |
Symbol | |
ID | 8824889 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 2085690 |
End bp | 2087339 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | glycosyl transferase family 2 |
Protein accession | YP_003480178 |
Protein GI | 289581712 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.421423 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCCAA TCATGGTCGC GCCGGTCGTC GTCGGCCGCG AGCGTGCACG GTTTGTGGTC CGCCAGCTCC AGTCGGTCAA GTGGCGCTCG GTCGCGCTGT TTCTCTCCGC ACTGGTGCCG TTCTGGCTCT TGATCAACGT CATGACACCA GAGACGCTGT ACGCCATTGG GCTCCTTGGT GTCCTGTTAC TGGCCTATCT CGGCTGGTCG TACTACGTCG TCGAACACGC CCCACGGTTC CGGGCCGTCT TCGGAACGCG AGGGCTGGCG GTTGCCGTCG GTAGCTACCT CGGACTCATC GTCTGGCTCG GCTGGCTCAC CCCGTCGCCG CTCGCGCTCG TCCACCTCGG CGCCGTCGTG TTGATCTTCG TCTACTACTG GTTCATCGCG TTCATCGCGC TCTTCCACGA CCAGATCGGC CGTTCGAAGT ACGTCCCGAA CCCACCCTAC CCGCAGATTA CCGTCCTCAT TCCGGCCTAC AACGAGGAGG GCTACGTCGG CCGAACGATC CAGTCCCTGC TCGACGCCAA CTACCCCGCA GACGCACTCG AGATCATCGC AGTCGACGAC GGCAGCACCG ACGACACGCT CGCTGAGGCG AGCGCCTTCG CCGCCGCGAG CGAGCAGGTC TCAGTCGTCA GCAAGGCAAA CGGCGGCAAG TACTCCGCGC TGAACTACGG CCTCCTGTTT GCCGCCGGCG ACATCATCGT CACCGTCGAC GCCGACAGTA TCGTCGACCG AGATGCCCTG AAACACATCG TCGCCCCGTT CGCCGCCGAC GACGACATCG GCGCCGTCGC GAGTAACGTC ACCATCTGGA ACCGCGACTC GCTCATCACC CGCTGCCAGC AACTCGAGTA CACCATCGGT GTGAACATCT ACCGGCGCGC GCTCGATTAC TTCGGCATCG TGATGGTCGT CCCCGGTTGT CTGGGCGCGT ACCGCCGCGA GGTGCTCTCG GAGGTGTTCG CCTACGATCC CGACACGCTC ACGGAGGATT TCGACGTGAC GATGAAGGTG CTTCGGGCTG GCTACCGCGT CTCCGTGAGC GATGCGCGGG TCTACACCGA AGCACCGGCG ACCTGGGGCG ATCTCTACCG CCAGCGCCTG CGCTGGTACC GCGGCAACTA CATGACGATT ATCAAGCACT GGTCGGTCGT GACGGACTCG TCGTACGGCT ACTTGAACCG GATCGCGCTT CCGTTCCGGC TGGTCGAGAT GTTCTTCCTG CCGTTTGCGA GTTTCGTCGT GCTTGCGTAC ATCCTCTGGC TGATCGCTGC CGGCCACGTC CTCACCGTGT TCGCGGTGTT TGTCTTCTTC ACGAGCATCG TCTTCCTGAT CGCGGCGCTC GGTATCCAGA TCGAAGGCGA GGACTGGCGG CTGCTCGTCT ACGCACCGCT GCTCGTTGTC GGCTACAAGC AGTTCCACGA CGCGCTCAAC GTCAAGTGTC TGTTCGACGT GCTCACGAGC CCAGAACTCG GCTGGACGCG CGCAGCGCGC ATCGAGCAGG TTGTGGAGGC TCCGGACGCA GGTGCAATAC CAGAGCCAGC TGTAAGTGCA TCGCCGTCTC CAGCCCCAAC TCCTGAGGCA GGACCACCAA CAGAAACGGA GACAGAGAGC GAGACCGTTG CCGACACGGA CTCGAAGTAA
|
Protein sequence | MNPIMVAPVV VGRERARFVV RQLQSVKWRS VALFLSALVP FWLLINVMTP ETLYAIGLLG VLLLAYLGWS YYVVEHAPRF RAVFGTRGLA VAVGSYLGLI VWLGWLTPSP LALVHLGAVV LIFVYYWFIA FIALFHDQIG RSKYVPNPPY PQITVLIPAY NEEGYVGRTI QSLLDANYPA DALEIIAVDD GSTDDTLAEA SAFAAASEQV SVVSKANGGK YSALNYGLLF AAGDIIVTVD ADSIVDRDAL KHIVAPFAAD DDIGAVASNV TIWNRDSLIT RCQQLEYTIG VNIYRRALDY FGIVMVVPGC LGAYRREVLS EVFAYDPDTL TEDFDVTMKV LRAGYRVSVS DARVYTEAPA TWGDLYRQRL RWYRGNYMTI IKHWSVVTDS SYGYLNRIAL PFRLVEMFFL PFASFVVLAY ILWLIAAGHV LTVFAVFVFF TSIVFLIAAL GIQIEGEDWR LLVYAPLLVV GYKQFHDALN VKCLFDVLTS PELGWTRAAR IEQVVEAPDA GAIPEPAVSA SPSPAPTPEA GPPTETETES ETVADTDSK
|
| |