Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0886 |
Symbol | |
ID | 5774374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 776452 |
End bp | 778128 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641316525 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001582220 |
Protein GI | 161528394 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGAAA TCTCTAGTAG AAATGTTGTA GAAGGTACTT CTCGTGCTCC ACAAAGAGCA ATGTACAAAG CTATGGGATT AGATGATAGT GATTTATCAA AACAAATGAT TGGAGTTTGT CATACAGGAA ATGAAGCAAC TCCATGTAAC ATCCATCTTC CAAGATTAGC TCTCAAAGCA AAAGAAGGAG TTTCAGATAC GGGTGCAACA CCTAGAGAGT TTTCAACAAT TGCAGTAAGT GACGGAATTG CAATGGGACA TGAAGGAATG AAATCATCAT TAATTTCTCG TGAAGTAATT GCAGATTCTA TTGAATTAAT GGTAAGAGCA CATCAATACG ATGCACTAGT AGGAATTGCA GGGTGTGATA AGAGCCTTCC AGGTACAATG ATGGCAATGG CAAGATTGAA TGTTCCATCA GTGTTTGTGT ATGGGGGAAC TATCATGCCA GGAATGCTTA ATGGAGAAGA ATTAACAGTT GTAGATGTTT ATGAAGCAGT TGGAGCATAT GATGCAGGAC AGTTATCATT AGAAGGTCTA AAAAATATTG AAAATACTGC ATGTCCAAAC GAAGGATCCT GTGGAGGAAT GTTTACTGCA AATACTATGG CATCAATTTC AGAAGCAATT GGTCTTGCAT TACCAGGCAG TGCCAGTCCA CCTGCAGAAG ATGATAGAAG AGAAAAGATG GTGTATGATA CTGGAGTAGC ATGTGCAAAA GCACTACAGA TGGATATCAG ACCAAGAGAA GTTCTTACTT TTGAAGCATT TGAGAATGCA ATTACAATGC TAAACTCTGT TGGAGGTTCA ACAAATGGAA TTTTACATTT GTTAGCACTT GCAAATGAAG TTGGTATCAA ATTAACATAT GATGATTTTG AAAGAGTTAG AAAAAAGACA CCCCACATAG CAGACATGAA ACCAGGTGGA AACTATGTTA TGAATTCACT TGATAAAATT GGAGGAATTC CATTTGTATT AAAGAAACTA CTTGACAAAG GACTCTTAAA TGAAAATTGT ATTACAATTA CTGGAAAGAC AATCAAAGAA AATTTGAATG CCATGGTCAT GCCACAAACA GAACAACAAA TAATCAGACC AATGGAGAAC CCACTTCACA ATGTTGGAAC TGCAGTAATT CTCAAGGGTT CACTTGCACC AGATGGTGCA GTGATAAAGA CTGCGGGAGT AGAAATGACA AAATTCACAG GAAAGGCTCG TGTCTTTGAT AGAGAAGAAT TAGCATTTGA GGCAGTAAAA AGAGGCGATA TTGATGAAGG ACATGTTGTA GTAATTAGAT ATGAAGGACC AAAAGGAGGT CCAGGAATGA GAGAGATGCT TGCAACTACT GCAGCACTCG TAGGTCAGGG ATTAGGCAAG AAAGTAGCCA TGGTTACTGA TGGAAGATTC TCAGGAGGAA CAAGAGGGTT CATGGTAGGA CATGTTGCTC CAGAAGCATA TGTCGGTGGA CCAATAGCAC TTGTAAAAGA TGATGATGAA ATTACAATAG ATACTGAAAC TAACATAATT GATCTTCATG TGTCATCTGA AGAATTAGAA AATAGAAGAA AACAATGGAG TCCTCCAAAA CCAAATTACA CATCAGGGGC TTTAGCAAAA TTTGCAACAT TAGTTGGTTC TGCAGCTGAA GGTGCTATTA CAAAACCTAA TCTATAG
|
Protein sequence | MEEISSRNVV EGTSRAPQRA MYKAMGLDDS DLSKQMIGVC HTGNEATPCN IHLPRLALKA KEGVSDTGAT PREFSTIAVS DGIAMGHEGM KSSLISREVI ADSIELMVRA HQYDALVGIA GCDKSLPGTM MAMARLNVPS VFVYGGTIMP GMLNGEELTV VDVYEAVGAY DAGQLSLEGL KNIENTACPN EGSCGGMFTA NTMASISEAI GLALPGSASP PAEDDRREKM VYDTGVACAK ALQMDIRPRE VLTFEAFENA ITMLNSVGGS TNGILHLLAL ANEVGIKLTY DDFERVRKKT PHIADMKPGG NYVMNSLDKI GGIPFVLKKL LDKGLLNENC ITITGKTIKE NLNAMVMPQT EQQIIRPMEN PLHNVGTAVI LKGSLAPDGA VIKTAGVEMT KFTGKARVFD REELAFEAVK RGDIDEGHVV VIRYEGPKGG PGMREMLATT AALVGQGLGK KVAMVTDGRF SGGTRGFMVG HVAPEAYVGG PIALVKDDDE ITIDTETNII DLHVSSEELE NRRKQWSPPK PNYTSGALAK FATLVGSAAE GAITKPNL
|
| |