Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1647 |
Symbol | |
ID | 5774743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1504301 |
End bp | 1505896 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 641317301 |
Product | hypothetical protein |
Protein accession | YP_001582981 |
Protein GI | 161529155 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAGA AATTCGTAAA AAAGATCCAA ACAAGTGGAC ATAATTTGAA AAATCTATTC CTGTTACTTT TACTGCCATT GTTGTTTGTA TTTACTTTTG ATAGTGCCTT TGGTCATGGT GTAGGAAGTG AGACATTTCC TCCTGTAAAC CTTGATGGAA AACAAGTAAC TGTAGAGGTT TCTTCATCAC AAAGTGATCC TGAAGCAGAT GACGATCAAC AAATCTCTAT TTCTCTGATT GACTTTGATT CAAAAATTAC ACTACGTGAT GTTACATTTC ACATAAAATC TGAGAAAGGA AACCAATTTC TCTTTGAGCA AGAATTCAAA ACAGATAATG GTTTTCTAGT ATTCAATTTT GTTTCAGAAG AAACTGATTC TATTGTAATA GAAGAAGAAA CTGGTGCTAA TTTCTTTGGT TCCATATTGG GATTAGATAG TAGACTAATT CATGTCAAAG GCCCAAAACT CAGTGAGGGT GGACTGTACA AATTTGATAT CAGTATACTT ACTGCTGATG GATATTCAAA AACACTTGAA AAACCACTAG TGTTTAATGC AGGAATTTCT ATTGCCCAAA CCTCTAGCCA TGATTTTGTT GATCCAAACT TTGGAGAACA AAGCATTGAC GTAATTACTT ACTATGATGA AATATCTGAC TTTGAATATG ATACTGATTC AAAAGAGATT AGATTTTCAA TGCCTTTTGA ATGGAGTCAT ACAAACATCA ATCAAACATC AGTTGTCCAT GAAGAGTTAG ATATTCCAAA AACCTATGGT GATTTGCTTG TATCTGGATT CACCATGTAC ATTAATGGAG TAGAACTCTC TGATGACATA TCTACAATAG ATGACTTTTT TTCTGACGGT CGTGTGGTGC ATTTTATTAT TTATCAACAA GAATTACTCC GAGTTCTTGA AAATGGCTCA AATGAAAATG GCATGAATTT CCTAATTACT CCTGATAGAG ATTATCCGCA TATGAGCTCA GTTACTGAAA ATGGACAATT TCGAATTTTT GCATCTTGGG AACCTGAAAA CTTGCAATCT GGTTCTGATG CAAAAATATT ATTTGATGTA ACTGATGTTT TCTTGAAAAA CAAACCTATA GCAACAAATT ATGATTTCTC TATTACACAA AATAACAAAG TCATTTACCA ACAAAGTGGA ACAAGTACTG ATTCAAGAGA AGAACATAAT GTAGTAGAGT TTACAATTCC ACAAGATGTT ACAGGAATTG TTAATCTAAA TTTTAATAAT TTAGATAATA ATGATCTTGC AAGAACAACT ATTCCAATTG TAATTGATAG AGTTACATCT CAAAAAGAAA TTACAATTCC TGATTGGATT AGAAACAATG CATTGTGGTG GTCTGAAGAA CAAATTGATG ATAATACATT TGTTCAAGGA ATTGAATATC TCATCAAAAA CAAAATAATT GTAATTCCAT CAACACAACA ACAAGATTCT TCATCCCAAG AAATTCCATC ATGGATTAGA AACAATGCTG CATGGTGGGC TGCAAAACAA ATAGACGATC AGACATTTGT CCAAGGACTG GAATATTTGA TTCAAAAGGG AATCATTCGT GTCTGA
|
Protein sequence | MEKKFVKKIQ TSGHNLKNLF LLLLLPLLFV FTFDSAFGHG VGSETFPPVN LDGKQVTVEV SSSQSDPEAD DDQQISISLI DFDSKITLRD VTFHIKSEKG NQFLFEQEFK TDNGFLVFNF VSEETDSIVI EEETGANFFG SILGLDSRLI HVKGPKLSEG GLYKFDISIL TADGYSKTLE KPLVFNAGIS IAQTSSHDFV DPNFGEQSID VITYYDEISD FEYDTDSKEI RFSMPFEWSH TNINQTSVVH EELDIPKTYG DLLVSGFTMY INGVELSDDI STIDDFFSDG RVVHFIIYQQ ELLRVLENGS NENGMNFLIT PDRDYPHMSS VTENGQFRIF ASWEPENLQS GSDAKILFDV TDVFLKNKPI ATNYDFSITQ NNKVIYQQSG TSTDSREEHN VVEFTIPQDV TGIVNLNFNN LDNNDLARTT IPIVIDRVTS QKEITIPDWI RNNALWWSEE QIDDNTFVQG IEYLIKNKII VIPSTQQQDS SSQEIPSWIR NNAAWWAAKQ IDDQTFVQGL EYLIQKGIIR V
|
| |