Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0832 |
Symbol | |
ID | 5774144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 735149 |
End bp | 736402 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641316470 |
Product | hypothetical protein |
Protein accession | YP_001582166 |
Protein GI | 161528340 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.000822238 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTAGATT TATGGGTTGA AATTGGAGCG TTAGCTGTAC TTATTGGATT ATCTGGATTC TTCAGTGGGT TAGAGGTTGC ACTAGTAGGA GTAAGAAAAT CCAAAGTTGT TCAATTATTC AATGAAGGTA AGAAAGGTGC TAAGGCACTT TACAAATTAA AAACAAATCC TGGATGGATG ATGTCTAGTG TCAATCTGGG CAATAATTTG GTGAATGTTG GGGCATCTGC ATTGGCAACA AGTGTTGCAA TCAGAATGTT TGGTGATGAA GGGTTAGGAA TTGCCGTGGG CATCATGACA TTTTTGATTC TAATTTTTGG AGAGATTACC CCAAAAACAT ACTGTAATGC AAACTCTACA AAAATTGCAT TAAGATACGC TCCAATATTA TTAGGATTTA GTTACGCATT ATATCCAGTT GTAAAATTCT TTGAAACCAT TACAAAAGGT GTAGTAAAGA TGACAGGTAG CAGTTATGCT CCTCCACCGA TTACTGAAGA GGAGATTAAA GGAGTTATTG ATCAAGGCTT GGAAGAAAAA GCACTTGAGA GAGATGAAAT GGAATTAGTT CACGGGGCAT TGAAATTTGA TGACACTGTA ATTCGTTCAG TGATGACCCC AAGAACAAAA ATGTTTACAC TAAATTCAAA AATGTTACTT TTTGAAGCAT TACCACAAAT TAACCAAAGT GGTCATTCTA GAATTCCAAT TTATGGGGAC ACACAAGATG ACATTGTAGG TTTTATTCAT GCCAGAGATG TCCTAAAAGA ATTAGAAAAA GACAACGAAG TTGTTAGTTT AGAGCAAATT GCAAGAAAAC CAGTATTTGC ATCTCAAGAA AAGATGGTTA GTTCTTTGCT AAAAGAGATG AAGGGAAGAA AAACACACAT GGCAATTGTT GTAGATGAGC ATGGAGGAGT TGAAGGGTTA GTCACACTAG AAGATTTGTT AGAAGAGATT GTCGGAGAGA TTGAAGATGA AACGGATCTA ACTAGAACTA TAGGCTATGA AAGAATTGAT CAAGACACAA TTGTCACAAA CGGAGATATT GAAATTGATA TTGTAAATGA AATTTTCAAA ACTAACGTAC CTGAAGGTGA TGATTATGCA TCATTGAGTG GTTTGTTACA TGAAAGACTA CAAGACATTC CACAAGAAGG GGACAAAGTA GAGGTAGAAG ATCTTAGAAT AGTTGTAGAA AAAGTATCAA AGAATATTCC TCAAAAAATC AGAATAGAGA AAATTAGAAC CTAG
|
Protein sequence | MVDLWVEIGA LAVLIGLSGF FSGLEVALVG VRKSKVVQLF NEGKKGAKAL YKLKTNPGWM MSSVNLGNNL VNVGASALAT SVAIRMFGDE GLGIAVGIMT FLILIFGEIT PKTYCNANST KIALRYAPIL LGFSYALYPV VKFFETITKG VVKMTGSSYA PPPITEEEIK GVIDQGLEEK ALERDEMELV HGALKFDDTV IRSVMTPRTK MFTLNSKMLL FEALPQINQS GHSRIPIYGD TQDDIVGFIH ARDVLKELEK DNEVVSLEQI ARKPVFASQE KMVSSLLKEM KGRKTHMAIV VDEHGGVEGL VTLEDLLEEI VGEIEDETDL TRTIGYERID QDTIVTNGDI EIDIVNEIFK TNVPEGDDYA SLSGLLHERL QDIPQEGDKV EVEDLRIVVE KVSKNIPQKI RIEKIRT
|
| |