Gene Nmar_0616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0616 
Symbol 
ID5773001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp554316 
End bp557606 
Gene Length3291 bp 
Protein Length1096 aa 
Translation table11 
GC content35% 
IMG OID641316251 
Producthypothetical protein 
Protein accessionYP_001581950 
Protein GI161528124 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAGA CTCTGCATAT TGTATTTTTC ATATTCTTAC TTGTAGGGGT AATTGTTCCT 
GCATATGCTC AAACCGCAGA AAATGTTGTG ATTAACGAAG TTGATATTAA TCCTCCTGGC
GATGATTCAA AATCCATTTC TGAGTGGGTT GAACTTTACA ATCCTACTGA TTCTGAAATT
GATTTGAGTG GGTGGCAAAT TGCGTCAACA ACTGTTCTAA AGAAAACTAT GACAATTGGC
TCTGGAACTA CTATTGAACC TGGACAATTT TTAACTTTTT CTTATCAAAG TGTTTGGTTT
ACTGACATCA ATGAATCTGT TGAATTACGA GATGAAAATG GAATTGTAAT TGATAAGACT
CCAATACTCT CAGATATTAA AAATGACTTT ACATCTTGGC AAAGGATCTA TGATGGTTAT
GATTTGGACA ATCCTGATGA TTGGAAATTT GTAACATCTA CCTCTGGTTC TACTAATGGA
AAACTAGTAC AAGAACAAAA ACAAGATGAA ATTTCTCTTT CTCTTTCTTC AGATAAATCA
TTTTACTTGT TTGGAGAAAC TGCTGTAATT GAAGGAAGTG TTTCTAAAGC AGTATTTGTT
GAGAAACCAT TCTTTCAACC TGAAGTAATT ACTGTAAAAA TTAGTGGTCC TAATTTTGAC
AAAACATTAA CTTTGTACCC TGATCTAAAC AAAAATTTTA AAACAACTTT GGGGCTGCAA
AAAGTTTTAG GAATCAATGA AGGCGATTAC ACAATAAATG CATCTTATGC AGGATCAACT
GTTAATACGT CATTCTCAGT GGGATATGAA ATTACTGAAG AACAAGTACA ACAAGACAGT
TTTCTCAATT TGAACACTGA CAAAACTCAA TACATTCCCG GTCAAATGGT TTCAATTACC
GGTACTGCTA CCGATATTGT TGAATTTCAG GGGATGAAAT TCACTGTTAC TAATTCTGAG
GGTACAATTG TGTATAATGG AAATTTGTTT CCTGTAAATG GTCAATTTAA AACCAGTATT
TTCTTGTCAA CTGTAAATCC TGTATATGGT ACTTATGAAA TAATTGGTGA ATATGTTGAC
AAATCTGTAA TCACAACATT TGAAGTAATA GAAGATGCAA AAGAAACTGT TCCAATATCT
CTGTGGACTA ATAAGGATGT TTATGGAACT GGTGAAGTAG TAACCATAAC TGGAAGACTA
AATGATGTGT GGGTTGCTTC CCTTGATTTA GAAATAGTAC AAACAAAGAA TCTTGCTTTG
GGCACTGGCA GCCAACTTGG TGGTGGTAAT GTTCTAAAAA TTCTTGATGT TGTCCGAATT
GACGGTGATG GTAAATTCAA GTATTCTTTT ACAATTCCTG ATGTGGATAC TCGATTAGGT
GATTATAAAA TCAAAGTTTC TAAAGATATT GGCTCTGCAA AAAAGACTGT CATGGTAGTA
AAAGAACCTG AAAACTTTGT CCCAATCACT GATCCACTAA TTGTTACAAC AAACAAGCTT
GTTTATGATT TCACTTTGGA TAAAGAACTC GTAATCCGTG GTCAAATAAA GAACCCTGTA
GATCGAACAA GTTTTGAAAC TCCTACCGTG TTAATTTCAT TTAAAGATGA AAACGGTAAA
CCCCTTTCTA TTATTGGTGT TCCTGAGGGT GTTAATCAGG GAGCTGCTGG CGGCACAGGT
AGTGTGACTG CAAAATATCA GTTCACTGCA ATACCTGAAT CTGGTGGAAC ATTTTCTGTA
ACTGCTGACA TTAGTAGAGG TATATTTTCT GAAGGTACGT ACACTATAAC TGCACAATAT
CTTGATCTTA CTTCAACAAC TTCGTTTGAT ATTGTTGATG ACTTAGCAGG CGGAGGTGTG
GTATCTCTTG ATAAAGATGT TTATGGTTTA GGTGAACAGG TGGTTGTTAG TGGTATTATT
CCGACAAGTG ATCGTTCTGT AACTATTTCT GTTACAAGAC CTGATGGTAC AAAAACCACA
TATGGCGAAG CTGTTGATAA CCAAAGATTC TCTTGGTCTT GGACTACCCC TGTTTCAGAA
CGATATCAAA CGCTGAAATC TGACGGTGAA CGTGGTGTTA CATTTTCAAA TTTCGGTATT
TATAAAATCA AAGTAGCAGG TGATACATAC AGCAAAGATT TACTATTCAA AGTATCTACT
GATCCTGAAA ATGATTCTTT ATCTGCCACC CCTCTTTTTA TAACTACAGA CAAATCTCTG
TACCAAGCAG GAGATAAACT CAAAGTAATT GGAAATGTTA TTCCTCGTGA TCAAGGTGAT
GAGGGATTAG TAGTTCCCGA CCGTGTAACT ATCAAAGTTT TGGATGGAAC ATTCCCGTAC
AAGCAAATTC ATGAAGCATC TGTTTACCCA AAACAAGGTG GTGAATTTTC AAGCTTGTTT
GAGTTACCTG CAACTATCTT TAGTGAGGGC ATGTATACTG TAAAAGCAAT TTATTCCACA
AAGCAAGTAA CATCAACATT TAGTGTTGCT AATGATTTTA CATTTGGTAT TGATGAACCT
GTATCGTTAT TAGCATCAAC TGATAAATCT GAATATTATC CTGGTGATAC CGTAATTATT
TCTGGAAAAC CAAACAAATT GATTTATCTT GAAGCCTATG ATGTAAGTAT CATTAAAAAG
TCTGATACTG AAATTACTTG TGGTTCTTTT ATTTGTGGAA CTCATGTTGG ACAAGTAAAA
TCCATTCGTC CTAGCCCATC TGGTTCTTTT ACTCATGAAT TCCCAATCAA GAATACATTA
TCTTCAATTG GAACATATGA AGTAACTATT GATGCTGACT TTGAAACAAA ACATATTAGA
TTTAACGTTG TAGAAGAACC TCTTGCTCCT AAATTAGAAA CAGTGATTGA AAAAGAAAAC
AGAATTCCTG ACAAAACAAT TTCTGTATCT ACTCCAGAAA AAACTGTAGA TGATGCAACC
TTTGCTCCTA GAGTTGTTTC TGGCTCCTTA CTTACTCCAA TTAAAGATGA AGCCTCTAAT
GTAAATCTCA AAGTATCTTC TGAGAGTGGT GTTTGTATAA TTGGTCCTGA TGCTGATTGT
CTTGTTAGTG AATCCACTAG AAAACCTGGA CAAATTTATG ATGTCGTTGA AGTAGATGGA
ATGAGTCTAA ATGTAAGGTA TAGTGGCCCT GATGTACGCC TAGAAAAATT CAGCATACTA
CCTGAATCTT CTGAATCATT CTTGCCTGAT TCTGATTGGA ATGTAGAAAT ACTCAAAGAT
GAACAAGTAT CCAGGTTCTA TTACAAAGTA ACTTACAAAA CAATAGAGTA A
 
Protein sequence
MNQTLHIVFF IFLLVGVIVP AYAQTAENVV INEVDINPPG DDSKSISEWV ELYNPTDSEI 
DLSGWQIAST TVLKKTMTIG SGTTIEPGQF LTFSYQSVWF TDINESVELR DENGIVIDKT
PILSDIKNDF TSWQRIYDGY DLDNPDDWKF VTSTSGSTNG KLVQEQKQDE ISLSLSSDKS
FYLFGETAVI EGSVSKAVFV EKPFFQPEVI TVKISGPNFD KTLTLYPDLN KNFKTTLGLQ
KVLGINEGDY TINASYAGST VNTSFSVGYE ITEEQVQQDS FLNLNTDKTQ YIPGQMVSIT
GTATDIVEFQ GMKFTVTNSE GTIVYNGNLF PVNGQFKTSI FLSTVNPVYG TYEIIGEYVD
KSVITTFEVI EDAKETVPIS LWTNKDVYGT GEVVTITGRL NDVWVASLDL EIVQTKNLAL
GTGSQLGGGN VLKILDVVRI DGDGKFKYSF TIPDVDTRLG DYKIKVSKDI GSAKKTVMVV
KEPENFVPIT DPLIVTTNKL VYDFTLDKEL VIRGQIKNPV DRTSFETPTV LISFKDENGK
PLSIIGVPEG VNQGAAGGTG SVTAKYQFTA IPESGGTFSV TADISRGIFS EGTYTITAQY
LDLTSTTSFD IVDDLAGGGV VSLDKDVYGL GEQVVVSGII PTSDRSVTIS VTRPDGTKTT
YGEAVDNQRF SWSWTTPVSE RYQTLKSDGE RGVTFSNFGI YKIKVAGDTY SKDLLFKVST
DPENDSLSAT PLFITTDKSL YQAGDKLKVI GNVIPRDQGD EGLVVPDRVT IKVLDGTFPY
KQIHEASVYP KQGGEFSSLF ELPATIFSEG MYTVKAIYST KQVTSTFSVA NDFTFGIDEP
VSLLASTDKS EYYPGDTVII SGKPNKLIYL EAYDVSIIKK SDTEITCGSF ICGTHVGQVK
SIRPSPSGSF THEFPIKNTL SSIGTYEVTI DADFETKHIR FNVVEEPLAP KLETVIEKEN
RIPDKTISVS TPEKTVDDAT FAPRVVSGSL LTPIKDEASN VNLKVSSESG VCIIGPDADC
LVSESTRKPG QIYDVVEVDG MSLNVRYSGP DVRLEKFSIL PESSESFLPD SDWNVEILKD
EQVSRFYYKV TYKTIE