Gene Nmar_0420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0420 
Symbol 
ID5774001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp376584 
End bp377852 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content39% 
IMG OID641316050 
Product3-isopropylmalate dehydratase 
Protein accessionYP_001581754 
Protein GI161527928 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0665356 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTG TAGAGAAGAT TCTTGCTCGT GCATCAGGAA AATCGCAAGT TGCCCCTGAT 
GATGTAGTTT TTGCAAAGGT CGACAAGGTA ATGGTTCATG ATGTTTCTGG ACCTGGAGTT
CTCAAGGTGT TTGATAAATT AAAAAACAAA GGTGTTGATG TCAGCAAACT TTGGGATCCT
ACAAAGGTAT GGGTTGCTGA AGATCACTTT GTTCCTTCTG CAGAAAAAAT ATCTGCTGAA
AATATCGTTA AATTATCAAA TTTCACAAAA AACTATGGAA TTGAAAAACA CTTCAAATAT
GGAATGGGTC AGTATGGAAT CTGCCATACA TTATCTCATG AAGAAGCAAT GGTGATGCCT
GGTGATGTCT ATGTTGGCGG TGATTCTCAC ACAAACACTA CGGGTGCACT TGGTGCATTT
GCATGTGGGT TAGGTCATAC TGATATTGCA TATGTTTTGC TTAATGGACA AATCTGGTTT
AAGGTGCCAG AGACTGATTA TTTCAAACTA AACGGAAAAC TCCCAGATCA TGTAATGGCT
AAAGATTTGA TCTTGAAAAT CATTGGCGAT ATTGGAACTG ATGGAGGAAA TTACAGAACA
ATGCAGTTTG GCGGTACTGG GATTGATGAG ATGTCTGTTG AAAGCAGATT GACACTATGT
AACATGACAA CAGAAGCTGG AGCAAAGAAT GGAATTGCTG AAGCTGATCA AAAAGTCGTA
GATTATCTTT CTAGTAGAGG TGCAACAAAT GCACAAGTAT TCAAAGGTGA TGATGATGCA
CAGTATGCAA ATGTGTATGA GTATGAAGCC TCTGAAATGG AACCTCTTGT CGCAAAACCA
TTCTCTCCTG AAAATATTGC AGTAGTAAGA GAAGCTCCTT CAGTAGAACT TGACAAATCC
TACATCGGTT CTTGTACTGG GGCAAAGTAT GAAGACTTGG AAGCTGCAGC AAAGATTCTC
AAAGGAAAGA CTGTAAAGAT TAGAACAGAA ATTCTTCCAG CATCTATCTC AATTTACAAG
CGTGCAATGG AAAATGGATT ACTTACCATA TTCTTAGATG CAGGCGTTAC TGTAGGTCCA
CCAACTTGTG GTGCATGTTG TGGAGCACAC ATGGGTGTTT TGGCTAAAAA TGAAATCTGC
ATAAGCACTA CAAACAGAAA TTTCCCAGGT AGAATGGGTC ATGTAGAGTC TGAGACATAT
CTTTCATCTC CAATGGTTGC TGCAGCTTCC GCAGTAACTG GAAAAATCAC TGATCCGAGG
GATTTGTAA
 
Protein sequence
MNIVEKILAR ASGKSQVAPD DVVFAKVDKV MVHDVSGPGV LKVFDKLKNK GVDVSKLWDP 
TKVWVAEDHF VPSAEKISAE NIVKLSNFTK NYGIEKHFKY GMGQYGICHT LSHEEAMVMP
GDVYVGGDSH TNTTGALGAF ACGLGHTDIA YVLLNGQIWF KVPETDYFKL NGKLPDHVMA
KDLILKIIGD IGTDGGNYRT MQFGGTGIDE MSVESRLTLC NMTTEAGAKN GIAEADQKVV
DYLSSRGATN AQVFKGDDDA QYANVYEYEA SEMEPLVAKP FSPENIAVVR EAPSVELDKS
YIGSCTGAKY EDLEAAAKIL KGKTVKIRTE ILPASISIYK RAMENGLLTI FLDAGVTVGP
PTCGACCGAH MGVLAKNEIC ISTTNRNFPG RMGHVESETY LSSPMVAAAS AVTGKITDPR
DL