Gene Nmar_0886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0886 
Symbol 
ID5774374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp776452 
End bp778128 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content38% 
IMG OID641316525 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001582220 
Protein GI161528394 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAAA TCTCTAGTAG AAATGTTGTA GAAGGTACTT CTCGTGCTCC ACAAAGAGCA 
ATGTACAAAG CTATGGGATT AGATGATAGT GATTTATCAA AACAAATGAT TGGAGTTTGT
CATACAGGAA ATGAAGCAAC TCCATGTAAC ATCCATCTTC CAAGATTAGC TCTCAAAGCA
AAAGAAGGAG TTTCAGATAC GGGTGCAACA CCTAGAGAGT TTTCAACAAT TGCAGTAAGT
GACGGAATTG CAATGGGACA TGAAGGAATG AAATCATCAT TAATTTCTCG TGAAGTAATT
GCAGATTCTA TTGAATTAAT GGTAAGAGCA CATCAATACG ATGCACTAGT AGGAATTGCA
GGGTGTGATA AGAGCCTTCC AGGTACAATG ATGGCAATGG CAAGATTGAA TGTTCCATCA
GTGTTTGTGT ATGGGGGAAC TATCATGCCA GGAATGCTTA ATGGAGAAGA ATTAACAGTT
GTAGATGTTT ATGAAGCAGT TGGAGCATAT GATGCAGGAC AGTTATCATT AGAAGGTCTA
AAAAATATTG AAAATACTGC ATGTCCAAAC GAAGGATCCT GTGGAGGAAT GTTTACTGCA
AATACTATGG CATCAATTTC AGAAGCAATT GGTCTTGCAT TACCAGGCAG TGCCAGTCCA
CCTGCAGAAG ATGATAGAAG AGAAAAGATG GTGTATGATA CTGGAGTAGC ATGTGCAAAA
GCACTACAGA TGGATATCAG ACCAAGAGAA GTTCTTACTT TTGAAGCATT TGAGAATGCA
ATTACAATGC TAAACTCTGT TGGAGGTTCA ACAAATGGAA TTTTACATTT GTTAGCACTT
GCAAATGAAG TTGGTATCAA ATTAACATAT GATGATTTTG AAAGAGTTAG AAAAAAGACA
CCCCACATAG CAGACATGAA ACCAGGTGGA AACTATGTTA TGAATTCACT TGATAAAATT
GGAGGAATTC CATTTGTATT AAAGAAACTA CTTGACAAAG GACTCTTAAA TGAAAATTGT
ATTACAATTA CTGGAAAGAC AATCAAAGAA AATTTGAATG CCATGGTCAT GCCACAAACA
GAACAACAAA TAATCAGACC AATGGAGAAC CCACTTCACA ATGTTGGAAC TGCAGTAATT
CTCAAGGGTT CACTTGCACC AGATGGTGCA GTGATAAAGA CTGCGGGAGT AGAAATGACA
AAATTCACAG GAAAGGCTCG TGTCTTTGAT AGAGAAGAAT TAGCATTTGA GGCAGTAAAA
AGAGGCGATA TTGATGAAGG ACATGTTGTA GTAATTAGAT ATGAAGGACC AAAAGGAGGT
CCAGGAATGA GAGAGATGCT TGCAACTACT GCAGCACTCG TAGGTCAGGG ATTAGGCAAG
AAAGTAGCCA TGGTTACTGA TGGAAGATTC TCAGGAGGAA CAAGAGGGTT CATGGTAGGA
CATGTTGCTC CAGAAGCATA TGTCGGTGGA CCAATAGCAC TTGTAAAAGA TGATGATGAA
ATTACAATAG ATACTGAAAC TAACATAATT GATCTTCATG TGTCATCTGA AGAATTAGAA
AATAGAAGAA AACAATGGAG TCCTCCAAAA CCAAATTACA CATCAGGGGC TTTAGCAAAA
TTTGCAACAT TAGTTGGTTC TGCAGCTGAA GGTGCTATTA CAAAACCTAA TCTATAG
 
Protein sequence
MEEISSRNVV EGTSRAPQRA MYKAMGLDDS DLSKQMIGVC HTGNEATPCN IHLPRLALKA 
KEGVSDTGAT PREFSTIAVS DGIAMGHEGM KSSLISREVI ADSIELMVRA HQYDALVGIA
GCDKSLPGTM MAMARLNVPS VFVYGGTIMP GMLNGEELTV VDVYEAVGAY DAGQLSLEGL
KNIENTACPN EGSCGGMFTA NTMASISEAI GLALPGSASP PAEDDRREKM VYDTGVACAK
ALQMDIRPRE VLTFEAFENA ITMLNSVGGS TNGILHLLAL ANEVGIKLTY DDFERVRKKT
PHIADMKPGG NYVMNSLDKI GGIPFVLKKL LDKGLLNENC ITITGKTIKE NLNAMVMPQT
EQQIIRPMEN PLHNVGTAVI LKGSLAPDGA VIKTAGVEMT KFTGKARVFD REELAFEAVK
RGDIDEGHVV VIRYEGPKGG PGMREMLATT AALVGQGLGK KVAMVTDGRF SGGTRGFMVG
HVAPEAYVGG PIALVKDDDE ITIDTETNII DLHVSSEELE NRRKQWSPPK PNYTSGALAK
FATLVGSAAE GAITKPNL