Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1562 |
Symbol | |
ID | 5774566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1430379 |
End bp | 1431731 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 641317215 |
Product | hypothetical protein |
Protein accession | YP_001582896 |
Protein GI | 161529070 |
COG category | [R] General function prediction only |
COG ID | [COG1571] Predicted DNA-binding protein containing a Zn-ribbon domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAT TAGTTACCAA AAACGTGGTA AAAAAAACTG TTCTAAACAT TGGCTTTGAT GATACTGATT CTCCAAAAGG AATGTGTACT ACTTTTTTGG CTTACAAAGT AGTAGATTTA CTCAAAAAAC AGAAAACTGA ATTTCTTGAT TTTCCAAGAT TAATTCGATT CAATCCAAAC ATTCCATGGA AAACACGTGG AAATGGAGCA GTTTCTATCA AAATTAAGAC TAGTAATCCA TCAAAAATAA AAAATCAAAT CAAAAATCTT GTTTCAAAAT ATTCTGATAC AAAAAATGGA GCAAATCCTG GTTTAGTTTT TTTTGAAAGT GATTCAATTC CTAATGAATT TACAAAATTT AGTAATTTAG CTTTATGGCA ATTAATTAAT CGAAATAACG CAAAAAAGTT TGCTAAAAAA AACAATCTTG AATATTTCTA CAAAGGAAAT GGACAAGGAT TGGTTGGTGC AATTGGTGCA ATAGGTTATG ATTTTAATGA TCATACACTA GAGCTATTGA GTTATCGTAA AAAACAAAAA TTTGGAAAAG AAAGAAAAAT TTCAACAAAA AGCGTTAAGG AAATGCAAGA AAAAACCTAT CCTGACACTT TCAATAGTTT TGATACAAAA AAAGGGCGAG TTTTAATCAC TCCTCACGGT CCAGATCCTG TATTTTTTGG TGTCCGTGGT GAAAATGTAG ATGCACTAGT TTCTGCATCT AAGATTCTAA AAAGTGAAGA AAAACTAGAT GGTTACATGA TCTTCAAATC TAATCAAGGA ACTGGTGACC ATTTGAAGAA TGAACTTACT TTTGAAACAA TGAAACCATA TGCTTCTGGA AAATTAACTG GAGTTGTTTC AAATTCCCCC AGAATTGCCA AAGGAGGACA TGTATTTTTT AAAATCCTCT CAAATGGTAA TGAGTTTTGG TGCGCAGTTT ACAAACCAAC GGGCATGTCT GTTATTGCTT CAAATTTGAT CAAAGGCGAT AAAATTTCTG TGGGTGGGGG AGTTAGAAAG GCTTCAAAAA ATTTCCCTCG AATAATTAAT CTTGAATTTA TTCAAATTAT TTCTCTTAAA AAACAAACCA AGACATCAAA TCCAATTTGT AAAAAATGTA CTAAAAAAAT GAAATCTAAA GGCAAGGGAC AAGGATTTGA ATGCATTCGC TGTGGAAAAA AATCTTCAAG AAAAATAAAT GAAATAGTTA CTAGAAACCT AGGCCCACAA CTGTATTTGC CAAAAATTTC TGCTCATAGA CACCTTACAC GTCCTCAACA AAGACAAGGC ATTCAAAACA AATCAACTAA TTTCAAAAAT TCTCTATCTT GGTTTTGTGT TTATCAAAAT TAA
|
Protein sequence | MSELVTKNVV KKTVLNIGFD DTDSPKGMCT TFLAYKVVDL LKKQKTEFLD FPRLIRFNPN IPWKTRGNGA VSIKIKTSNP SKIKNQIKNL VSKYSDTKNG ANPGLVFFES DSIPNEFTKF SNLALWQLIN RNNAKKFAKK NNLEYFYKGN GQGLVGAIGA IGYDFNDHTL ELLSYRKKQK FGKERKISTK SVKEMQEKTY PDTFNSFDTK KGRVLITPHG PDPVFFGVRG ENVDALVSAS KILKSEEKLD GYMIFKSNQG TGDHLKNELT FETMKPYASG KLTGVVSNSP RIAKGGHVFF KILSNGNEFW CAVYKPTGMS VIASNLIKGD KISVGGGVRK ASKNFPRIIN LEFIQIISLK KQTKTSNPIC KKCTKKMKSK GKGQGFECIR CGKKSSRKIN EIVTRNLGPQ LYLPKISAHR HLTRPQQRQG IQNKSTNFKN SLSWFCVYQN
|
| |