Gene Nmar_0832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0832 
Symbol 
ID5774144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp735149 
End bp736402 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content35% 
IMG OID641316470 
Producthypothetical protein 
Protein accessionYP_001582166 
Protein GI161528340 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.000822238 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTAGATT TATGGGTTGA AATTGGAGCG TTAGCTGTAC TTATTGGATT ATCTGGATTC 
TTCAGTGGGT TAGAGGTTGC ACTAGTAGGA GTAAGAAAAT CCAAAGTTGT TCAATTATTC
AATGAAGGTA AGAAAGGTGC TAAGGCACTT TACAAATTAA AAACAAATCC TGGATGGATG
ATGTCTAGTG TCAATCTGGG CAATAATTTG GTGAATGTTG GGGCATCTGC ATTGGCAACA
AGTGTTGCAA TCAGAATGTT TGGTGATGAA GGGTTAGGAA TTGCCGTGGG CATCATGACA
TTTTTGATTC TAATTTTTGG AGAGATTACC CCAAAAACAT ACTGTAATGC AAACTCTACA
AAAATTGCAT TAAGATACGC TCCAATATTA TTAGGATTTA GTTACGCATT ATATCCAGTT
GTAAAATTCT TTGAAACCAT TACAAAAGGT GTAGTAAAGA TGACAGGTAG CAGTTATGCT
CCTCCACCGA TTACTGAAGA GGAGATTAAA GGAGTTATTG ATCAAGGCTT GGAAGAAAAA
GCACTTGAGA GAGATGAAAT GGAATTAGTT CACGGGGCAT TGAAATTTGA TGACACTGTA
ATTCGTTCAG TGATGACCCC AAGAACAAAA ATGTTTACAC TAAATTCAAA AATGTTACTT
TTTGAAGCAT TACCACAAAT TAACCAAAGT GGTCATTCTA GAATTCCAAT TTATGGGGAC
ACACAAGATG ACATTGTAGG TTTTATTCAT GCCAGAGATG TCCTAAAAGA ATTAGAAAAA
GACAACGAAG TTGTTAGTTT AGAGCAAATT GCAAGAAAAC CAGTATTTGC ATCTCAAGAA
AAGATGGTTA GTTCTTTGCT AAAAGAGATG AAGGGAAGAA AAACACACAT GGCAATTGTT
GTAGATGAGC ATGGAGGAGT TGAAGGGTTA GTCACACTAG AAGATTTGTT AGAAGAGATT
GTCGGAGAGA TTGAAGATGA AACGGATCTA ACTAGAACTA TAGGCTATGA AAGAATTGAT
CAAGACACAA TTGTCACAAA CGGAGATATT GAAATTGATA TTGTAAATGA AATTTTCAAA
ACTAACGTAC CTGAAGGTGA TGATTATGCA TCATTGAGTG GTTTGTTACA TGAAAGACTA
CAAGACATTC CACAAGAAGG GGACAAAGTA GAGGTAGAAG ATCTTAGAAT AGTTGTAGAA
AAAGTATCAA AGAATATTCC TCAAAAAATC AGAATAGAGA AAATTAGAAC CTAG
 
Protein sequence
MVDLWVEIGA LAVLIGLSGF FSGLEVALVG VRKSKVVQLF NEGKKGAKAL YKLKTNPGWM 
MSSVNLGNNL VNVGASALAT SVAIRMFGDE GLGIAVGIMT FLILIFGEIT PKTYCNANST
KIALRYAPIL LGFSYALYPV VKFFETITKG VVKMTGSSYA PPPITEEEIK GVIDQGLEEK
ALERDEMELV HGALKFDDTV IRSVMTPRTK MFTLNSKMLL FEALPQINQS GHSRIPIYGD
TQDDIVGFIH ARDVLKELEK DNEVVSLEQI ARKPVFASQE KMVSSLLKEM KGRKTHMAIV
VDEHGGVEGL VTLEDLLEEI VGEIEDETDL TRTIGYERID QDTIVTNGDI EIDIVNEIFK
TNVPEGDDYA SLSGLLHERL QDIPQEGDKV EVEDLRIVVE KVSKNIPQKI RIEKIRT