Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_0884 |
Symbol | |
ID | 8823713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | - |
Start bp | 899436 |
End bp | 904172 |
Gene Length | 4737 bp |
Protein Length | 1578 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003479030 |
Protein GI | 289580564 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0424461 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACCAC GCACACGCGG CGCACTCGTC GTCGCGCTCG TCGCCGGCTT GCTCGTCACG AGCCTGATCG CACTGCCGCT GCTCGACGGC GGCCTGTCGG GACTCGGTGA CGACTCGGCA CAGGGAGACC AGCCCGGCGA GCAGACACAG GCGTCACCGC CGGTCGCAGC CGCCGACGAG ACGGCAACCG CCAGCCCTGC CGGGATGGGT GAGCACACTG CGCCGCTTGC GCCACTCGAG TACCCTGGCC CTGCTCAAGC CCAGGTTGAT GGGGCGGACG GTGGAGCGGT GGCGGACGGT GGTGCGGTGG CCGACGGAGC GGCGGTGGGC GATGCAGCTG CAGACGCTGA GTCGCCAGCG CAGCCACAGG CACAGGCACA GGCACAGACG CTCCAGGAAC ACGAGGACGC AGTCGAAACT GGCGTCGACC AGGGAATCGA ACTCGTACAG GCCCAGGGCG TCGAGGTGAG CCAGGAACAG CGCGCGGCCG CAGTCGAGGC GGCCAGCGAG TCGGCCGAAC AGCACCAGGA AGCAGAGGTC GAGCAGGTGC AGGAGGCGAC GGCTGGGGCA GTTCACGGGA CGCTCATTCA GGAGCAGCGC GTGAACGTAA CCCAGATTCA ACACGCCGTC GGCGGCGCGA CGGACGGCGC GCTCGCCCAG CACCAGACGG CGACCGCGAG CCAACTCCAG AGTGCGACCT GGGGTGCGAC CCATGGCGCG ATTGCGCAGG AACAGCGCGT TACCGTCGAG CAGTTACAGG TTGCTACCTT CGGCGCGGCG GCAGGTGCTG CGAGTGAGGC CGCGGAGTAC GAGGTCGAGG AGCAGCCGAA GATTCAGGAG GCAGCACAGG GCGCGGCCTA CGGCGTCCTG ACGCAGTACC AGAAACTGAC GGCCGAACAG CGCCAGACGG TTACGATCGA GCACGTCCAG CACGCAGCCA TCGGTGCCGC CTCGGGTGCA CTCGAGGGCA GCTCGGAGAT TGCACTCGAG CAGGAACAGC AGATCGACAT CGAGATGGAG CAGCGCCAGG CGGTGACGAT CAAGGAGATC CAGACTGCCG CGAAGGGAGC GGCCAAGGGT GCGCTGGTCC AGAAGCAGAC GGTGACGGTC GAGCAGACCC AGTCCGCGGC GTGGGGCGCG AGCGCGGGTG CGCTCAAGCA GGTCCAGTCG GTCCACGTCG AGCAGGTCCA GCAGATCACG ACGCCGAAGA TCGAGGAGGC TGCGAAGGGG GCGGCGACGG GGGCGATCAC GCAGTCCCAA GAGGCGACTG TCGAACAGAT CCAGGCGGCG GCTGATGGCG GCGCACAGGG TGTGCTCGTC CAGCGCCAGG ACGTGTCGGT CACGCAGATC CAGTCGGCCG CGACGGGTGC CTCGAAGGGG GCAGTCGCGT CGGCGATCCA GTACCAGGTC GTCGAAATCG AGCAGATTCA GTCCGTCGCG TTCGGCGCTG GCGAGGGCGC GGTGATCCAG AAGCAGGTCG TCGACATCAC GCAGGTCCAG CACCTCGCGA TGGGAAGCGC AGAGGGCGCG CTCACGCAAC ACCAGGAGGC GACCGTCACA CAGCTTCAGG TCGCGGCGTC GACCGCGAGC CAGGAGACGG CGCGGGCGAT CCAGGAACAG CGGATCAGCG TCACGCAACT CCAGTTGCTG ACCGCAGAGA CCGCGGCCGA CGCGACGGGG TACGCCGTCG ACCAGGGCAT CGACGACGGC GCACAGCTCG TCCAGTACGT CGAAATCGAA CTCGTCCAGC GCATCGAACT GATCGACGAA CTCGAGGGCA CCGCCTCGCT CTCGTTCCCC GATCAGAACA CGACCGGCGA GACGGTCAAT ATCGCTAGCG TCGACCTCTC CGAAGGCGGC TTCGTCGCGG TCTACGACGA CACGACCGCG GCGCTCGATC CCGAGGACGT GATTGGCGTC TCCGGCTACC TCGAACCCGG CGAACACGGA GACGTGGAAA TCGAACTCGA GGAGCCCCTC GAAGACGACC GCTCGCTCGT CGCGGCGGTC CACCACGACA CGACCGACGA CGAGACGTTC CGGTACGTCG AACGCGACGG CGGCGACGAC GAGCCGTACG TCACGGAGGG CGGCGCACCG GTGCTCGATA TGGCCTTCAT CATGGTCGAC CCCGAGGAGC CTGACGAGCC CGAGGCCGAG GCCGAGCTTT CGGTGAGCGA CCAGACCGGC GACGGCGAGA CGCTCACGGT CGACGAAGCC AACGCAACCG TCGACTACGT CGTGAGCGCC GTCTACGACG ACCAGCGGGT CGACAGCGAG ACTATCGAAG CCAACGAGAC CGTCTCCGAA CTCGAACTCG ACCTCGAGCC GCCGATCGAA TCGGACGGAC CGGTCTCAGT TGCCGTCCGC GCTGGCGCGG ACGACGAGGT GCTCGCGAGT GACACCATCG AGTACACGCT CGATGACCCG TTCGACCCCG AGTCGACGCT CAGCGTGAGC GACCAGACCG GCGACGGCAC CAGCGTCACC ATCGACGAAG CCAACGCCTC TGTCGAGTAC GCGCTCACCG TGACCGACGG CGACGGTGAC GGCGAGCCGC TCACCGAAAC CGAACCCTTC CCCGCCGGCG AGGCCATCGA AAACGAATCG ATCGACCTCG AAACGCCACT CGAGGAGAAC GCACTCCTCG AGGTCTCACT CGTCGCCACC GAGGAAAACC AGACGCTCGA GACGGCGTCA CTCGAGTACA CCGTCGACGA GGAGTTCCAG GTCGAATTCG TCAACTGTAC GCGCGCGGAG GTGACGGGCT CGTTCGAGGA AGGTGAGACG GTCGCCGCGA GCACGGGCTT CTACGCGGCC AGTGGGTTTG GCAATACGAT TATCGAAGAC TTCGTGACCG TCGGTGACCA GGTCGAGGCA CCGTTCACGG GGACGATTGT CTTCGAGATC GGTGCGGAAG ACGACTTCGA GGGAGCTGAA GATGGTGAGG GCACCATCAC GGTCGGTGTG CCCGACTACG GCACGTTCGG GACCTACATC TCCGGCATTA GCTCGGACGA AGCGATTCCG TTCGCCAGTA TCGACCATCC GAATCCGCAG GGACAGGAGT GCAACGAGGA CGCACGGCCC GAGGAGCCGT CGATCTCCGT CGCAGAGACG GAGCCGGGTG AGCCGACCGG CGACGACGGC GAGTGGGCGT GGGGTGAGGA CACGATCGAT GTCACCTTTA GCTCCGAGAA CCCGAACGAA GAGGCACTGC CCGGCGTGAG CGAGTTCGTC GAGGGAACGA CCGAAGACGA ACCGGTCGGT GCACTCGAGC CCGGCAACGA GACGTTCACC GTCGAGTGGA CGCCCGCAGA CGAAGACGAG CGGCTCGTCT GGGAGTTCGG CCTCCAGTCG TTCGGCTACG AGGAGCCGCT GCTCGCCGAG ACGGATCCCG CCGGCGAAGT CGTCGACATC CCCGAGCCGG ACGACCCCGC CGAGTTCGAG GTCGAAATCA CTGACACCAA CTCGCCGGTC ACACAGGGCG ACGACCTCGA GGTCGAGGCG CTCGTCGAGA ACGTCGGCGA CGAACCCGGC GAGCAAGAGA TTGAACTCAC ACTCGACGAC ACCGCGGTCG ACGCGGAGAC GCTCGAACTC GGGACGAACC CGAACGAAAC GGTCACGCTC GAAGCGGACA CCACTGAGTT CGAGCCCGGC GAGTACACCG CGACGGTCGA AAGCGAGAAC GACACCGACG AGACGCCGGT CACGATCGAG GAGCCAGCCG ACCCCGCCGA GTTCGAGGTC GAAATCACTG ACACCAACTC GCCCGTCGAA CAGGGCGACG ATCTCGAGGT CGAGGCACTC GTCGAGAACG TCGGCGACGA GGCCGGCGAG CAGGAACTCG AGTTCGCCCT CGACGAGACG CTGGTCGATT CGGAGCCGGT GGCACTCGAG TCCGATGCGA GCGAGACGGT TGCGTTCAGT ACGCCGACCG ACGAGCTCGC ACCTGGTGAG TACACGGCAA CGGTCGCGAG CGAGAACGAG ACCGACGAGG CGGTTGTGAC GGTGACCGAG CCGGCAGCCG CGGAGTTCTC GCTGGTGGAT CTCTCGGCAC CGCCGTCTGA CGTGGCCGGC CAGCCGACGA CTGTTGTCGC GACGATTATG AATACGGGCG ACGACGGCGA CACGCAGACG GTTACGTACA GCGTAGACGG AGAGGTAATC GAAGAGCGGA GCGTCTCGCT CGAACCGGGC GACACGACGG TCGAGCAGTT CTCGCCGACG CTGCCGGAGG GCGAGTCGGA CCACACGGTC GCAACCGAGG ATGCGGAGCA GACGGTGACG ATCGAGGGGC TTGCGGTGTT GCCGGGCGAA GAGGCAGATG ATGGAATCGC TGGCGAAGAG CAGCCAGCCG AACCGGAACC TGACGGGGAA TCACCGGATG ACGGCTCGCC GGACGAGGAG ATGCCTACTG AGCCGCCGGC AAACGGCGAC CAGGCCGGTG CGGATGCGGA GAATCAAGAG CAAAATGGCG CTGAGACGGA TGGACAAGAG CAGGAACAGG AACAGGAACA GGAGCCGAAC GGAGTCGAAA CCGATGACGG TGCAGCCGAC GACGGTGCAG CCGACGACGG TGACGAGACG CTCGGAACCA ACGGCGGCGA CGACGCTGGC GGCGAGGACG CTACCGCCGC CGACAACGGT CCCGATACCG AAGAGAATGA CGGCGACGAG AACACAGCCG ACACCACGGA CACTACCTCA GAACCGGCTG CGGCAGTGAC GGCGTAA
|
Protein sequence | MGPRTRGALV VALVAGLLVT SLIALPLLDG GLSGLGDDSA QGDQPGEQTQ ASPPVAAADE TATASPAGMG EHTAPLAPLE YPGPAQAQVD GADGGAVADG GAVADGAAVG DAAADAESPA QPQAQAQAQT LQEHEDAVET GVDQGIELVQ AQGVEVSQEQ RAAAVEAASE SAEQHQEAEV EQVQEATAGA VHGTLIQEQR VNVTQIQHAV GGATDGALAQ HQTATASQLQ SATWGATHGA IAQEQRVTVE QLQVATFGAA AGAASEAAEY EVEEQPKIQE AAQGAAYGVL TQYQKLTAEQ RQTVTIEHVQ HAAIGAASGA LEGSSEIALE QEQQIDIEME QRQAVTIKEI QTAAKGAAKG ALVQKQTVTV EQTQSAAWGA SAGALKQVQS VHVEQVQQIT TPKIEEAAKG AATGAITQSQ EATVEQIQAA ADGGAQGVLV QRQDVSVTQI QSAATGASKG AVASAIQYQV VEIEQIQSVA FGAGEGAVIQ KQVVDITQVQ HLAMGSAEGA LTQHQEATVT QLQVAASTAS QETARAIQEQ RISVTQLQLL TAETAADATG YAVDQGIDDG AQLVQYVEIE LVQRIELIDE LEGTASLSFP DQNTTGETVN IASVDLSEGG FVAVYDDTTA ALDPEDVIGV SGYLEPGEHG DVEIELEEPL EDDRSLVAAV HHDTTDDETF RYVERDGGDD EPYVTEGGAP VLDMAFIMVD PEEPDEPEAE AELSVSDQTG DGETLTVDEA NATVDYVVSA VYDDQRVDSE TIEANETVSE LELDLEPPIE SDGPVSVAVR AGADDEVLAS DTIEYTLDDP FDPESTLSVS DQTGDGTSVT IDEANASVEY ALTVTDGDGD GEPLTETEPF PAGEAIENES IDLETPLEEN ALLEVSLVAT EENQTLETAS LEYTVDEEFQ VEFVNCTRAE VTGSFEEGET VAASTGFYAA SGFGNTIIED FVTVGDQVEA PFTGTIVFEI GAEDDFEGAE DGEGTITVGV PDYGTFGTYI SGISSDEAIP FASIDHPNPQ GQECNEDARP EEPSISVAET EPGEPTGDDG EWAWGEDTID VTFSSENPNE EALPGVSEFV EGTTEDEPVG ALEPGNETFT VEWTPADEDE RLVWEFGLQS FGYEEPLLAE TDPAGEVVDI PEPDDPAEFE VEITDTNSPV TQGDDLEVEA LVENVGDEPG EQEIELTLDD TAVDAETLEL GTNPNETVTL EADTTEFEPG EYTATVESEN DTDETPVTIE EPADPAEFEV EITDTNSPVE QGDDLEVEAL VENVGDEAGE QELEFALDET LVDSEPVALE SDASETVAFS TPTDELAPGE YTATVASENE TDEAVVTVTE PAAAEFSLVD LSAPPSDVAG QPTTVVATIM NTGDDGDTQT VTYSVDGEVI EERSVSLEPG DTTVEQFSPT LPEGESDHTV ATEDAEQTVT IEGLAVLPGE EADDGIAGEE QPAEPEPDGE SPDDGSPDEE MPTEPPANGD QAGADAENQE QNGAETDGQE QEQEQEQEPN GVETDDGAAD DGAADDGDET LGTNGGDDAG GEDATAADNG PDTEENDGDE NTADTTDTTS EPAAAVTA
|
| |