Gene Nmar_0422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0422 
Symbol 
ID5773164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp378343 
End bp379758 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content39% 
IMG OID641316052 
Product3-isopropylmalate dehydratase large subunit 
Protein accessionYP_001581756 
Protein GI161527930 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0695199 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAAA CACTTTTTGA AAAAATTTGG GATGCACATG TTGTAGTTGG AAAAGAAAAC 
GGCCCATCTT TGATTTACAT CGATAGACAT CTAGTTCACG AAGTAACTTC TCCTCAGGCC
TTTGATGGTC TTAGAATGAA TAACAGAAAG GTTAGGAGAC CTGATCTTAC CATTGCAACA
ATGGATCATA ATGTTCCTAC AACTGATAGG GGGCTTCCAA TTTTAGATCA AACATCATCT
GTACAAATTC AAACACTAGA AAAAAACTGC CAAGATTTCG GAATTAAACT ATTTGATATT
AACAGTCCTA ATCAAGGAAT AGTTCATGTC ATTGGGCCTC AACTTGGAAT CACTTTACCT
GGTTCCACTA TTGTTTGTGG TGACAGTCAC ACTTCTACTC ATGGTGCATT TGGTGCTCTT
GCATTTGGAA TTGGAACAAG CGAAGTAGAA CATGTTTTGG CATCCCAGAC TTTGTGGCTA
GAAAAACCAA AACCCTTTGA AATTAGAGTA GAAGGAAAGC GAAAGAACCC TCATGCTGTT
ACTGCAAAAG ATATCGTACT ATCCATTATC AAAAATATTG GAACTGGCGG TGGGACTGGA
ACTGTAATAG AGTACCGTGG TGAGGGAATA GAGGACCTTT CCATGGAGCA GAGAATGACC
ATATGTAACA TGTCAATTGA GGCTGGTGCT CGTGCTGGAT TGATTGCCCC TGATGAGAAG
ACTTATGATT ATCTTAGAGA TAGGCAATAC ACTCCAAAAA ACTATGAATC TCTTGTAGAA
TACTGGCGAG AAAATCTAAA ATCAGATGAT GATGCAAAAT TTGAAAAACA ATTCACATTA
CACATTGATG ATATTGCACC TCAAGTAAGT TGGGGAACAA ATCCTGGAAT GACTTGTGAT
GTAACTGAAA CAGTTCCAAC ACCTGACGAG TTTTCAAAAG GCGATTCCAA TCAAAAGAAG
GGTGCAGAAA AGGCACTTGA TTACATGGAC CTTAAATCTG GAACACCAAT TGAAGAAATT
AAAATCGACA GAGTGTTCAT TGGCTCTTGT ACTAATGCAA GACTTGAAGA TTTGATTGAA
GCCTCCAAAG TAGTCAAAGG ACAAAAAGTT TCTCCAGATG TTCGTGCAAT GGTGGTTCCT
GGCTCTCAAA TGGTAAAGAA ACAAGCTGAA GAGATGGGTC TTGATAAAAT TTTCACTAAT
GCTAACTTTG AATGGAGGGA AGCTGGATGT AGTATGTGTC TTGGAATGAA TCCTGATATT
TTATCTCCAG GAGAAAGATG TGCAAGTACT TCTAATCGAA ACTTTGAAGG AAGACAGGGC
ACTGGTGGAC GAACTCATTT GGTTAGTCCT GTAATGGCAG CTGCTGCTGC AATCAATGGA
CATTTTGTTG ATGTAAGGAA GATGGATTTG AGTTAA
 
Protein sequence
MGKTLFEKIW DAHVVVGKEN GPSLIYIDRH LVHEVTSPQA FDGLRMNNRK VRRPDLTIAT 
MDHNVPTTDR GLPILDQTSS VQIQTLEKNC QDFGIKLFDI NSPNQGIVHV IGPQLGITLP
GSTIVCGDSH TSTHGAFGAL AFGIGTSEVE HVLASQTLWL EKPKPFEIRV EGKRKNPHAV
TAKDIVLSII KNIGTGGGTG TVIEYRGEGI EDLSMEQRMT ICNMSIEAGA RAGLIAPDEK
TYDYLRDRQY TPKNYESLVE YWRENLKSDD DAKFEKQFTL HIDDIAPQVS WGTNPGMTCD
VTETVPTPDE FSKGDSNQKK GAEKALDYMD LKSGTPIEEI KIDRVFIGSC TNARLEDLIE
ASKVVKGQKV SPDVRAMVVP GSQMVKKQAE EMGLDKIFTN ANFEWREAGC SMCLGMNPDI
LSPGERCAST SNRNFEGRQG TGGRTHLVSP VMAAAAAING HFVDVRKMDL S