Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0477 |
Symbol | |
ID | 5774452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 429251 |
End bp | 430828 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641316109 |
Product | NHL repeat-containing protein |
Protein accession | YP_001581811 |
Protein GI | 161527985 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.176502 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAAACAA TTTCACTTGC TGTAGCCGTA ATTACTGCAA TTTTGCTATC TGGAACTTTT GCTCCATCTT CATATGCATT AGGTGATTAT GATTTTCTTG CTGGATGGGG TGAGTTTGGA ATATCTACTC CTGGACACCT TTCACACCCA CAATTTATCG CAGTTGATGA TGAAGGGAAT GCATACATTA GTGATTTAGG AAATAAACGT GTACAAAAAT TCTCCAGCTC TGGTGAGTTT ATTCTTAACT TTGGAGAAAG TGGCAAATCC TCTGGCCAAT TCCATCACCC TTCCGGTGTT GCAGTTGATT CTGATTTTGT CTATGTGGCA GATCAAAATT TGCATAAAAT TCAAAAATTT ACCCTTGATG GAGTGTTTGT AGATGAATGG GGAAAATACG GTAACCAAGA TGGTCAATTC AAGTCCCCAA AAGATATTGC AGTAGACTCT GATTTTCTTT ATGTTGTAGA TGCTGATAAC TATCGAATCC AGAAATTTAC TACTGATGGA GAATTTGTAT TATCTTTTGG TTCTGGTGGT ATGAATCATG ATCAATTTCT TATTTTATCT GGAATAGCAG TTGATGATGA TGGAAATATC TACATCACAG ACAAAGGAAA CCGTAAAATT GAAAAATTTA CTTCTGATGG AATTTTGATT AAATCTTATC CACTATTTGG TACAAACTAT GTATTTGCTC CTACGGGAAT CACTGTTGAT TCTGATGGGA AAATATTTGT CATAAACTCT GCAGAAAACA GGATTTTGTA TCTTGAACTT GATGATAATT TACGTCTAAG TGTATTTGAG CAACTTGGAC CATTTGGTAA CTCCTTTATT GCTCCAACTG ATCTTACCTT TGGATTTCAA GGTAACTTGT TGATAGTAGA TTCTGCTGCT CATAAAGTAA AATCCTTTGA AACCCCATTT TATGATGAAA CAAAAGTTTT TCAAACAACT GAAATTATTG CACCAGAAAT AACTGAAGGT TATGAATCTG ATGATATTGA TCCAACAATT ATGGCTCCAA GTGACATAAA ATTAGAAGCA ACTGATCTAT TTACTCCTGT ACCTATTGGT GATGCTGTTG CAAATGACTT GCAGAGTGGA ATCAAAACTA TTTTGAATAA TGCTCCTGAG GCATTCTCTT TGGGTGTAAA CAAGGTAACT TGGGTAGCAT TTGATAATGC TGGAAACACT GCTGAAGACT ATCAAACAGT TACAGTCTTT GCATGTGGTC ATGTCTATTC TGATTACAAC ATGATAGTTG GAACTGATGA GAATGATGTT CTTCTGGGAA CCTCTGGTGA TGATCTAATC TTTGGGTTAG AAGGAAATGA TATCATTAGT GGGTTAGAAG GAAATGACTG TATCTTTGGC GGTGATGGTG ATGATATTGT CTATGGCGAT GATGGATATG ACACCATTAG TGGAAACGGT GGACATGATG TCCTCAAAGG AGATTCTGGA TCTGATGTAA TCTATGGTGG ATCAGGCTCT GATGTACTTG ATGGCGGTTC TGAGAATGAC AACTGCTATG ATTCACTAGA AAATGTTGTT CTTAATTGCA ACGAATAG
|
Protein sequence | MKTISLAVAV ITAILLSGTF APSSYALGDY DFLAGWGEFG ISTPGHLSHP QFIAVDDEGN AYISDLGNKR VQKFSSSGEF ILNFGESGKS SGQFHHPSGV AVDSDFVYVA DQNLHKIQKF TLDGVFVDEW GKYGNQDGQF KSPKDIAVDS DFLYVVDADN YRIQKFTTDG EFVLSFGSGG MNHDQFLILS GIAVDDDGNI YITDKGNRKI EKFTSDGILI KSYPLFGTNY VFAPTGITVD SDGKIFVINS AENRILYLEL DDNLRLSVFE QLGPFGNSFI APTDLTFGFQ GNLLIVDSAA HKVKSFETPF YDETKVFQTT EIIAPEITEG YESDDIDPTI MAPSDIKLEA TDLFTPVPIG DAVANDLQSG IKTILNNAPE AFSLGVNKVT WVAFDNAGNT AEDYQTVTVF ACGHVYSDYN MIVGTDENDV LLGTSGDDLI FGLEGNDIIS GLEGNDCIFG GDGDDIVYGD DGYDTISGNG GHDVLKGDSG SDVIYGGSGS DVLDGGSEND NCYDSLENVV LNCNE
|
| |