Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1613 |
Symbol | |
ID | 5773987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1470803 |
End bp | 1471993 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 641317266 |
Product | amidohydrolase |
Protein accession | YP_001582947 |
Protein GI | 161529121 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAATCA AGAACATTAG TCTCCTTTTA GGAAAAGAAT TAGAATTTGT TTCAAAAACA AATGTACAAA TTCAAGATGG TAGATTCAAA CGAATTCAAC CTAACATTAA ACCAAGTGGT AAAGAGGATT CTATTGACTG TGAAGATCTT TTATTGATCC CAGGATTTAT CAATGCACAT ACACACATAG GTGATTCAAT TGGAAAAGAT GTCACTCTAG AAAGTTCTGT AGATAAAAAA ATACATCCTG TTTTTGGGGC AAAGTCAAAA ATTCTAAAAA ACACTCCTCC TGAAAATTTG TCTAATTTTA TGAAAAATAC ATGTCATTCT ATGATTAGAA AAGGAATAAC CACCTTTGTT GATTTTAGAG AAGGGGGTTT AGATGGTGTT ATCTTGTTGA AAAAAACATT ATCTGAAATT CCAATCCGAT CAATTATTTT GGGTAGGGTT AATTTCTACC AAAATTCAAC TGAAATCAAA AAAAATCTCC CCATTCCTAA AGAAAAAGCC AAGGAATTGC CTCTAATTCT TCAAAAATGT GATGGTATTG GAGTTAGTGG TGCAAATGAG AACAGTACTT CAACATTGAA TCATTACTCA AAGACATCAA AGATTCGAGC AATTCATTCT GCTGAAACAA AACAGAGCGT TTCAAGATCT AAAAAGATGA CTAGAAAATC TGAAGTGATT CGTGCATTGT CCATGAAACC TCATTTTCTT ATTCATATGA CTCATGCATC AAACAGTGAT CTTCATCTAG CTGCAAAAAA AACTCGAGGA ATAGTAGTTT GCCCAAGAGC AAATTCTTCT TTGGCTGAAG GAATTCCTGA CATTACTTTG ATGCAAAAGG CCGGTTGTAC GCTTGGATTA GGCACAGATA ATGTTATGAT AAACTCTCCT GACATGTTCA GAGAAATGGA TTATCTTTGG AAAGTCACAA TGGGCATTCA TAAAAAAAGA ATCAATCCTA AAGAAATTTT GAAAATGGCT ACCGTAAATG GAGGAAAAAT ACTAAAAAAA GACATTGGAG TAATTGAAAC CAAAAAAATT GCTGATTGCA TATTTCTAAA CAAACATGCA TTAGATTTAG AACCAATGCA TGAACCATAT GCATCTATTG TACATAGAGC ATCTGAATCT GCAATCCAAG CAGTAATGAT TGGAGGTAAA ATAGTTCATG GAAAAATCTA G
|
Protein sequence | MLIKNISLLL GKELEFVSKT NVQIQDGRFK RIQPNIKPSG KEDSIDCEDL LLIPGFINAH THIGDSIGKD VTLESSVDKK IHPVFGAKSK ILKNTPPENL SNFMKNTCHS MIRKGITTFV DFREGGLDGV ILLKKTLSEI PIRSIILGRV NFYQNSTEIK KNLPIPKEKA KELPLILQKC DGIGVSGANE NSTSTLNHYS KTSKIRAIHS AETKQSVSRS KKMTRKSEVI RALSMKPHFL IHMTHASNSD LHLAAKKTRG IVVCPRANSS LAEGIPDITL MQKAGCTLGL GTDNVMINSP DMFREMDYLW KVTMGIHKKR INPKEILKMA TVNGGKILKK DIGVIETKKI ADCIFLNKHA LDLEPMHEPY ASIVHRASES AIQAVMIGGK IVHGKI
|
| |