Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1399 |
Symbol | |
ID | 5773686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 1281980 |
End bp | 1284148 |
Gene Length | 2169 bp |
Protein Length | 722 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641317045 |
Product | hypothetical protein |
Protein accession | YP_001582733 |
Protein GI | 161528907 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTAGTT TGAATATAAT GAATAAATCC TCGATGAGAG GAATTTTCCT TTCAATTGTA CTATTATTTT CAATAACATT AGTTTCAATT CCAGATAACG TGTATGGTCA AGAGATTAAT GCTAACAGTA TAGGATTGGA AGAAACCGTG ATTATAGAAT TTACAAATGA ATTAGATAAA GAAATCAATA CTTTTAGAAT TTGGCTTGGT TCAGGTTTTA ATTTTGAATC TTTTAAAACC GAAAAAGGAT GGTTAGGTGA AAAAACACCA CAAGGGGTTA TAATTTTTAC AACTTCTGAA CCTATCAAAA AAGGAGAAAC AGTGAAGTTC GGAATAAAAA CGGATACCAA AAATCCAGGA ATTAATTGGA AAGCAATTGA CACAAAAAGT GAACAACAAG GTATGGGCAA AGTGCTACCA GAAGAATTAC CTAAAGTTGT AGAAAATACT GAGATTAAAC CAGTAGAAAA CATTGAGGAT AAACCAGTAG AAACAGGAAT TCTTTCTGAT TCAGTTTTTA GAATAATTCC AGAAAAGCCC AACATAGGGT CATCAATCAG AGTAACTGGA GATAATTTTG GTTCTTCAGA AGAATTTGAC TTTTACATAG ATAATTCAAA AATCGGTAGT TTTGTAACAG ATGAGAATGG TCACTTTATG ACAACAATGC AGATTCCAGA AAATCAAAAT GCAGACAGAG TGGATTTTAA AGTCAAAGAT AGTGAAGGAG GAGAGAAAAA GTATAGTGTG AGAATAGAAG AGATAGACAA CAGGATTGCA GAAGAAAATG TTAAACTCAC AATCAAAGGA ATACCAAATG TAATTCATAG AGGAGACTTT TTGGAGATTT CAGGTACAGG AGATCCTAAT AGTGCAGTTA CTGCAGAAAT AACTAGTCCA GATGGAGAGA TTATTAATGC AAGAACAGCG GAAATTGATG CCAAAGGAAA TTGGGAATTA GCAGAACCAA TAGTTGTTGC ATTAGACACA CCATTTGGCA AATACAGTGC AACAATTACT GACGGACGAG AAAGTATACT AAAATCATGG ACTGTTGAGT CTGACAAAAT AATAATCGTG GCGCCAAACA AATTGAAGTT TGAACCAGGA GAAGTAATGA TATTTAATGG AACTGCTTTG CCTAACAAAC CAATTGAATT TATTTTGGAA GATCCACAAG GTAAAGAATT ATTTTCAGAC ATTATTCAAA TAGATGAATC AGGGCATGTG GAATTCGAAT ACGCTACAGT GCAATCATCA CCTAAAGGAA CATACACATT GATTGCAACA CAAGAGAAAG ACACAGAATT CATATTTGCA GGACTCGGTC AATTACCTAC AACTCCAGTG AATTTAGAAT TTGATGAACT CAATTACAAT GCAGGAGATA CAGCAATAAT TTCAATTACA GGAAAATCTT CAGAAATTGC TAGTCTGTTA ATCATAGACC CATCGGATAA GCCAAAAGGA GATACAGTGT CCATTCAATT AGCACCAGAT GGTACTGCAG TATATGAACT GGACTTGACA GGATTTGCAT CAGGAGTGTA TTCTGCAGTA GTTAGTAAAG GAACCGCACA AAGTACAGAA ATTTTCACAG TAGGATTACA AACAGGTTCA GGAGAAATCA AAATCAACAC TACAAAATTA GATTATTCCC CAGGTGATTC CATTCTAATT CTAGGAGATA CAGGAGAAAA TGTTTTGTTA ACAATATCTT TGGCAGATCC TGATGGAAAC ATAGTAAAAG AAAAGGATAC TTTTTCAGAT AAAAATGGAA AGATTTCTGA AAGTACATTT AGAATTCCAT CAGATGGAAA AGGTGGTATG TGGTCAATTA ATGCAAAAAG TGGTTCAAAC TTCCACACAA TAGAGATCGA TGTTATCGCC ACAGTTACGG AAGGAGTTCA GATAAGTGTC GAGGAAGGAA TTAATGTTCC GGGTGTTGGT GATTCAATGC AAATCAAGAT AGTCGGCGTT AGTCAAACTG TACAAATTGA AATAATTGCA GCAGATGGAA CAGTCATAGA TTCATTAGAA TTTGTTGCAT CATCTGGAGG AGAAATAAAC CAACCATGGA TAATTCCTAA AGACACTGCA CCAGGAACAT ATACAATCAA AGTGGAAGAT GCATTTACTT CGGCAGAAGA TACTTTTGAG ATAAGTTAA
|
Protein sequence | MTSLNIMNKS SMRGIFLSIV LLFSITLVSI PDNVYGQEIN ANSIGLEETV IIEFTNELDK EINTFRIWLG SGFNFESFKT EKGWLGEKTP QGVIIFTTSE PIKKGETVKF GIKTDTKNPG INWKAIDTKS EQQGMGKVLP EELPKVVENT EIKPVENIED KPVETGILSD SVFRIIPEKP NIGSSIRVTG DNFGSSEEFD FYIDNSKIGS FVTDENGHFM TTMQIPENQN ADRVDFKVKD SEGGEKKYSV RIEEIDNRIA EENVKLTIKG IPNVIHRGDF LEISGTGDPN SAVTAEITSP DGEIINARTA EIDAKGNWEL AEPIVVALDT PFGKYSATIT DGRESILKSW TVESDKIIIV APNKLKFEPG EVMIFNGTAL PNKPIEFILE DPQGKELFSD IIQIDESGHV EFEYATVQSS PKGTYTLIAT QEKDTEFIFA GLGQLPTTPV NLEFDELNYN AGDTAIISIT GKSSEIASLL IIDPSDKPKG DTVSIQLAPD GTAVYELDLT GFASGVYSAV VSKGTAQSTE IFTVGLQTGS GEIKINTTKL DYSPGDSILI LGDTGENVLL TISLADPDGN IVKEKDTFSD KNGKISESTF RIPSDGKGGM WSINAKSGSN FHTIEIDVIA TVTEGVQISV EEGINVPGVG DSMQIKIVGV SQTVQIEIIA ADGTVIDSLE FVASSGGEIN QPWIIPKDTA PGTYTIKVED AFTSAEDTFE IS
|
| |