Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0422 |
Symbol | |
ID | 5773164 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 378343 |
End bp | 379758 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641316052 |
Product | 3-isopropylmalate dehydratase large subunit |
Protein accession | YP_001581756 |
Protein GI | 161527930 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0065] 3-isopropylmalate dehydratase large subunit |
TIGRFAM ID | [TIGR00170] 3-isopropylmalate dehydratase, large subunit |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0695199 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAAAA CACTTTTTGA AAAAATTTGG GATGCACATG TTGTAGTTGG AAAAGAAAAC GGCCCATCTT TGATTTACAT CGATAGACAT CTAGTTCACG AAGTAACTTC TCCTCAGGCC TTTGATGGTC TTAGAATGAA TAACAGAAAG GTTAGGAGAC CTGATCTTAC CATTGCAACA ATGGATCATA ATGTTCCTAC AACTGATAGG GGGCTTCCAA TTTTAGATCA AACATCATCT GTACAAATTC AAACACTAGA AAAAAACTGC CAAGATTTCG GAATTAAACT ATTTGATATT AACAGTCCTA ATCAAGGAAT AGTTCATGTC ATTGGGCCTC AACTTGGAAT CACTTTACCT GGTTCCACTA TTGTTTGTGG TGACAGTCAC ACTTCTACTC ATGGTGCATT TGGTGCTCTT GCATTTGGAA TTGGAACAAG CGAAGTAGAA CATGTTTTGG CATCCCAGAC TTTGTGGCTA GAAAAACCAA AACCCTTTGA AATTAGAGTA GAAGGAAAGC GAAAGAACCC TCATGCTGTT ACTGCAAAAG ATATCGTACT ATCCATTATC AAAAATATTG GAACTGGCGG TGGGACTGGA ACTGTAATAG AGTACCGTGG TGAGGGAATA GAGGACCTTT CCATGGAGCA GAGAATGACC ATATGTAACA TGTCAATTGA GGCTGGTGCT CGTGCTGGAT TGATTGCCCC TGATGAGAAG ACTTATGATT ATCTTAGAGA TAGGCAATAC ACTCCAAAAA ACTATGAATC TCTTGTAGAA TACTGGCGAG AAAATCTAAA ATCAGATGAT GATGCAAAAT TTGAAAAACA ATTCACATTA CACATTGATG ATATTGCACC TCAAGTAAGT TGGGGAACAA ATCCTGGAAT GACTTGTGAT GTAACTGAAA CAGTTCCAAC ACCTGACGAG TTTTCAAAAG GCGATTCCAA TCAAAAGAAG GGTGCAGAAA AGGCACTTGA TTACATGGAC CTTAAATCTG GAACACCAAT TGAAGAAATT AAAATCGACA GAGTGTTCAT TGGCTCTTGT ACTAATGCAA GACTTGAAGA TTTGATTGAA GCCTCCAAAG TAGTCAAAGG ACAAAAAGTT TCTCCAGATG TTCGTGCAAT GGTGGTTCCT GGCTCTCAAA TGGTAAAGAA ACAAGCTGAA GAGATGGGTC TTGATAAAAT TTTCACTAAT GCTAACTTTG AATGGAGGGA AGCTGGATGT AGTATGTGTC TTGGAATGAA TCCTGATATT TTATCTCCAG GAGAAAGATG TGCAAGTACT TCTAATCGAA ACTTTGAAGG AAGACAGGGC ACTGGTGGAC GAACTCATTT GGTTAGTCCT GTAATGGCAG CTGCTGCTGC AATCAATGGA CATTTTGTTG ATGTAAGGAA GATGGATTTG AGTTAA
|
Protein sequence | MGKTLFEKIW DAHVVVGKEN GPSLIYIDRH LVHEVTSPQA FDGLRMNNRK VRRPDLTIAT MDHNVPTTDR GLPILDQTSS VQIQTLEKNC QDFGIKLFDI NSPNQGIVHV IGPQLGITLP GSTIVCGDSH TSTHGAFGAL AFGIGTSEVE HVLASQTLWL EKPKPFEIRV EGKRKNPHAV TAKDIVLSII KNIGTGGGTG TVIEYRGEGI EDLSMEQRMT ICNMSIEAGA RAGLIAPDEK TYDYLRDRQY TPKNYESLVE YWRENLKSDD DAKFEKQFTL HIDDIAPQVS WGTNPGMTCD VTETVPTPDE FSKGDSNQKK GAEKALDYMD LKSGTPIEEI KIDRVFIGSC TNARLEDLIE ASKVVKGQKV SPDVRAMVVP GSQMVKKQAE EMGLDKIFTN ANFEWREAGC SMCLGMNPDI LSPGERCAST SNRNFEGRQG TGGRTHLVSP VMAAAAAING HFVDVRKMDL S
|
| |