Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1628 |
Symbol | |
ID | 5774668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1484428 |
End bp | 1486599 |
Gene Length | 2172 bp |
Protein Length | 723 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641317282 |
Product | regulatory protein ArsR |
Protein accession | YP_001582962 |
Protein GI | 161529136 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1328] Oxygen-sensitive ribonucleoside-triphosphate reductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATGT CTGATGAAGA ATTAGAAATG GATCTAGAAG GTGATCTAGA TGCAGACATG GATGACGATG ATCTAGATAC AGACTTAGAT GATGATTCAG ATTCACCAAA ACGCGGTGGA ATTTTACAAT CTACATCAAA ACGTGTCAGA ATGATCTTTT CTGTCATGGC AAGTCCTAAC AGAATCGATA TCCTGAGAAT TCTAAATTCA AAAGGCCCTC TAACTTATTC AGAATTAAAA TCTCTGGCTG GATTCAAATC AAAAAAAGAG AGTGGAAAAT TTGCATACCA CTTGAGAAAA TTACTTAGAC AATCACTTGT TGCATTAAAC AAATCTGAAA GACGTTACAC AATTACAAAT CTTGGAAAAC TAGTTTTGAG CTTAGCAAGA CAAATTGAAG AAAGATCAAT TATTGAAAGT GGAAAGATGT ATGTTAGAAC ATCACACGAA TCAATTGAGG AATTTAATTC TCACAAAATC ATACAATCAT TGGTACGTGA AGGTAGCCTC CCACTTGAAT TAGCACAAAA AATTACTGAA GAAGTTGAAA ATAGAATCTA CAAATATCAA ACTACCTATC TTACAGGCTC ACTTATCAGG GAAATGGTAA ACTCTGTCTT GCTTGAGCAT GGTCATGAGG AATACCGCAA CAAGCTTGCA CGCCTAGGCT TGCCTGTGTT TGACGTTCAA GAGATGATTA CTAATCTAGA CAATGTAGAC AATGGCGCTG AAGGTCTCTT GTTTAAAACA GGACAAACAG TATTTGCTGA ACATCTTCTG ACAAACATCT TACCAAAAGA TGTAGCTGAC TCTCATCTTT CTGGTGATCT ACACATTACA AATCCTGGAA TCTGGTCAAT GATTCCTGAT ACAATGTTTG TTAACATTAA AGAATTAATT GAAGATGGAA TTGATCTTGG TGGAAAGTAT CTTGATGTTT CAAGAATTCC TGCATCAAAA CAACTTGATG AAATTACAAG TGCACTGTCT ATTGTAGTCT CTCTTCTTTC AAAAGAAGCT TCACAAGAAA TTATACTTGA TGGATTTGTT TCACTATTCT CAAAACATTC AAAATCACTT CCAGAACTAG AACAAAAATT AACTACAGCA TTTGCAACTG CATCTACAAC TTCCAAATAC AATAAATCAA GTACTAACGT TTCAATTAGA TTACAATTAG GATCTGATAC AAAAATAATT AATTCAATTA TTAATGCATA CAAAAACTAC ACAAAGCTTA CCCCAATTCC AAAAATTGGA TTAATCATAG ATAATGAAAA AGGAAAGATC ACAGATGTTT CAGCAGCAGT ATCAGAAATA ATTTCACTTG GTGGAAAAGT AATGTTTGCC AAAGGACAAA CATCAAGTCA TGGTATTACT AATGGATCAA CTAAAAGTTC TGGTCCACTT TCAATAATGC TTGAATCAGT ATCAATCAAT CTTCCAAGAT TAGCATTTGA ATCAAACAAA GATGAAACTT ACTTTAGAGC AAGATTGGCA TTACTCATGA AACCAGCTTT ATCTTCCATG GCCTTGAGAA AGAAAGACAT TTCTGATTTG ACTAGACGCG GATTGAATCC AATTTTGGCC AAAAATACTC AGTATATGCA AAAAAGTAAT GTTTCACTTG TAATCAATTT GGTAGGCCTC AAAGAGGCAG TCTTTAACAT CCTTGGATTC CAAGATAATA AAGAAGGTCG TGAAATCCTT CACAAGGTAA TTGAAACTGC AGTAGACGTT GGCGCCAAAA AAGGCAAAGA ATTAGGTGAT CCTGTGGCTA TTTGCATGAC TGAAACTGAA AGTGCAACCA GATTTGCTAC TCTTGATGGT GAGAAATATG GCAAAAATTC ATCCTTAAAC TCTATGGAAG GTGATTCTTA TTCTGAGGGA ATCATTATTG ATGCCTCTGA AGTCTCTGAT TATACTGCCA AGAGCGAGCC AATCTCTGAG TGTAACAAAC TCTCAAAGTT GCTAAACGGG GGATTGTTTG TAACACTAAA TATCGGCAAA GATGCAAAGC CTGCAGAAAT TAAAAAAGCA ATTGAGAAGA CATCAGAACT TACAACATCC TTCAAGCCTG TTCAAGACAT TGCAATCTGT GGTGAGTGTG GTTTTAAAGA TGAACCGTTT GAAGATAAGT GTCCAAAGTG CAAGTCTCCA TATGTTGTCT GA
|
Protein sequence | MKMSDEELEM DLEGDLDADM DDDDLDTDLD DDSDSPKRGG ILQSTSKRVR MIFSVMASPN RIDILRILNS KGPLTYSELK SLAGFKSKKE SGKFAYHLRK LLRQSLVALN KSERRYTITN LGKLVLSLAR QIEERSIIES GKMYVRTSHE SIEEFNSHKI IQSLVREGSL PLELAQKITE EVENRIYKYQ TTYLTGSLIR EMVNSVLLEH GHEEYRNKLA RLGLPVFDVQ EMITNLDNVD NGAEGLLFKT GQTVFAEHLL TNILPKDVAD SHLSGDLHIT NPGIWSMIPD TMFVNIKELI EDGIDLGGKY LDVSRIPASK QLDEITSALS IVVSLLSKEA SQEIILDGFV SLFSKHSKSL PELEQKLTTA FATASTTSKY NKSSTNVSIR LQLGSDTKII NSIINAYKNY TKLTPIPKIG LIIDNEKGKI TDVSAAVSEI ISLGGKVMFA KGQTSSHGIT NGSTKSSGPL SIMLESVSIN LPRLAFESNK DETYFRARLA LLMKPALSSM ALRKKDISDL TRRGLNPILA KNTQYMQKSN VSLVINLVGL KEAVFNILGF QDNKEGREIL HKVIETAVDV GAKKGKELGD PVAICMTETE SATRFATLDG EKYGKNSSLN SMEGDSYSEG IIIDASEVSD YTAKSEPISE CNKLSKLLNG GLFVTLNIGK DAKPAEIKKA IEKTSELTTS FKPVQDIAIC GECGFKDEPF EDKCPKCKSP YVV
|
| |