Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0420 |
Symbol | |
ID | 5774001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 376584 |
End bp | 377852 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641316050 |
Product | 3-isopropylmalate dehydratase |
Protein accession | YP_001581754 |
Protein GI | 161527928 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0065] 3-isopropylmalate dehydratase large subunit |
TIGRFAM ID | [TIGR01343] homoaconitate hydratase family protein [TIGR02086] 3-isopropylmalate dehydratase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0665356 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATTG TAGAGAAGAT TCTTGCTCGT GCATCAGGAA AATCGCAAGT TGCCCCTGAT GATGTAGTTT TTGCAAAGGT CGACAAGGTA ATGGTTCATG ATGTTTCTGG ACCTGGAGTT CTCAAGGTGT TTGATAAATT AAAAAACAAA GGTGTTGATG TCAGCAAACT TTGGGATCCT ACAAAGGTAT GGGTTGCTGA AGATCACTTT GTTCCTTCTG CAGAAAAAAT ATCTGCTGAA AATATCGTTA AATTATCAAA TTTCACAAAA AACTATGGAA TTGAAAAACA CTTCAAATAT GGAATGGGTC AGTATGGAAT CTGCCATACA TTATCTCATG AAGAAGCAAT GGTGATGCCT GGTGATGTCT ATGTTGGCGG TGATTCTCAC ACAAACACTA CGGGTGCACT TGGTGCATTT GCATGTGGGT TAGGTCATAC TGATATTGCA TATGTTTTGC TTAATGGACA AATCTGGTTT AAGGTGCCAG AGACTGATTA TTTCAAACTA AACGGAAAAC TCCCAGATCA TGTAATGGCT AAAGATTTGA TCTTGAAAAT CATTGGCGAT ATTGGAACTG ATGGAGGAAA TTACAGAACA ATGCAGTTTG GCGGTACTGG GATTGATGAG ATGTCTGTTG AAAGCAGATT GACACTATGT AACATGACAA CAGAAGCTGG AGCAAAGAAT GGAATTGCTG AAGCTGATCA AAAAGTCGTA GATTATCTTT CTAGTAGAGG TGCAACAAAT GCACAAGTAT TCAAAGGTGA TGATGATGCA CAGTATGCAA ATGTGTATGA GTATGAAGCC TCTGAAATGG AACCTCTTGT CGCAAAACCA TTCTCTCCTG AAAATATTGC AGTAGTAAGA GAAGCTCCTT CAGTAGAACT TGACAAATCC TACATCGGTT CTTGTACTGG GGCAAAGTAT GAAGACTTGG AAGCTGCAGC AAAGATTCTC AAAGGAAAGA CTGTAAAGAT TAGAACAGAA ATTCTTCCAG CATCTATCTC AATTTACAAG CGTGCAATGG AAAATGGATT ACTTACCATA TTCTTAGATG CAGGCGTTAC TGTAGGTCCA CCAACTTGTG GTGCATGTTG TGGAGCACAC ATGGGTGTTT TGGCTAAAAA TGAAATCTGC ATAAGCACTA CAAACAGAAA TTTCCCAGGT AGAATGGGTC ATGTAGAGTC TGAGACATAT CTTTCATCTC CAATGGTTGC TGCAGCTTCC GCAGTAACTG GAAAAATCAC TGATCCGAGG GATTTGTAA
|
Protein sequence | MNIVEKILAR ASGKSQVAPD DVVFAKVDKV MVHDVSGPGV LKVFDKLKNK GVDVSKLWDP TKVWVAEDHF VPSAEKISAE NIVKLSNFTK NYGIEKHFKY GMGQYGICHT LSHEEAMVMP GDVYVGGDSH TNTTGALGAF ACGLGHTDIA YVLLNGQIWF KVPETDYFKL NGKLPDHVMA KDLILKIIGD IGTDGGNYRT MQFGGTGIDE MSVESRLTLC NMTTEAGAKN GIAEADQKVV DYLSSRGATN AQVFKGDDDA QYANVYEYEA SEMEPLVAKP FSPENIAVVR EAPSVELDKS YIGSCTGAKY EDLEAAAKIL KGKTVKIRTE ILPASISIYK RAMENGLLTI FLDAGVTVGP PTCGACCGAH MGVLAKNEIC ISTTNRNFPG RMGHVESETY LSSPMVAAAS AVTGKITDPR DL
|
| |