Gene Nmar_1648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1648 
Symbol 
ID5773844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1506093 
End bp1507811 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content36% 
IMG OID641317302 
Producthypothetical protein 
Protein accessionYP_001582982 
Protein GI161529156 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTATTGC ATAACAAATT TGCGTTATTC ATTACGATAA TTGGAATTTT CACAATACCT 
GCATTAGTAC CTGATGTCTT TGGACATGGT CTAGGTGGAG ATCAAGCTCC TGCTATCTCT
TTTGGTGATA TGGCAGTGAC TGTTAGCACA CAACTAACTC CATCTGATAT TACAGTAGGT
GAAGTTGATA CCGCAAACAT GCAAGTTCGT TTCTTTGATA CTGTGACTGA TACCAATCTT
GACAAAGTAA CCTACAGAAT AGAAGTTTGG CAAAGTGGAG AACTTTTAGC TAGAAACTTG
TTTTATGATT TGGATGGACG ACTAGATGTA AAAATAACAC CAAAAACTGA TTGTAATGAG
GAAAATCTAC ATCAATGTTC TGTTTATTAT GGTTCAGAAC ATGTCAGTGC ACCTGGTGCT
CTATTTGTTT ATGGTACGGA TTGTAATGAT GAGAATTTAG ATGTGTGTGC AAGACCTGAA
ATTACGGGTC CAATATTCAT AGAAGGTGGA TTGTATCAAA TCCGTGTTGA TATTGAAGCT
GCAACCAGTC CAAAATCTAT CTTGGCAAAT TTGTTAAGCT ATGATACTTT TGTTAGTATT
GCACAAGATC AAAACTTTAT GATACAAACT GCAAATGCAG ACATTCCAGT TGTTGTCAAA
ACATACTATG ATCAAGTTGA TAATTTCAAC TTTGATCAAT CAGATAATTC AATTTCATTT
GATATGCCAT TTGACTGGAC TCCAGAATAT GTTGATTTGG TCCAAGTAGT TCATGAAGAA
GTTAGAGTCC CAAAAACATT TGCACCATAT GCAGAAGGTA AACAATTCAA AGGATATGTT
AATGGTATCG AGGTTGATCA AAGAGCATTA CTAAATGATC CATACTCATA TGATGATACT
AACGTTGTTC ACTTTCTAAT TACTAATCAA GAATTACAAA GAATTAACGA GACATTAGGT
GAAGCAAACT ATGAAAATCC AAAAATGGAT TTGAAACTCG TTCCACTTGA TGAAATAGAA
AAACAATCAA CTGAATTTTA TCTAGTAGAC ACTACAAACT ATGAACCTGT TCCAACAACT
GTTAACATTT CATGGGATGG AAGTTATGGC GCAGGAGATG AAATTCCATT TGAGATTACA
TTCTTTGATG AAAATAGAGA ACTTATTCGA GATATGAGAT ATGTTGTATC ATTTATTGAC
GAAAACGATG AAGTCCTAGA GACCTTTTTG GGAGATGATC CACAAATGCC AGGAATAGTG
GCCACTGAAG GAATTGACAT AAAGAAAATC TATGTCCCAT CACAAGGAGT CTATAGGATT
GACATTAGAG CCTTAGGTAC TGGATTAGCA TATGATGAGA CTTATGCAGG AATTGGTTCA
GGAATAATTG AGTTAGGTCC TAGTACTGGT AAAACTGTGC CTACACCTGA AGAACAAACA
CCGGCCGCAA TTCCTGCATG GATTAAGAAC AATGCAGAAT GGTGGGCTGC TGGACAAATA
GATGATGGTT CTTTTGTTCA GGGAATTCAA TATTTAGTTA AAGAAAATAT TTTGCAAATC
CCCCCAACCT CTGCAGGTGA AGGTACAGGT TCTAATGAAA TTCCAGCATG GATTAAAAAC
AATGCAGGTT GGTGGGCAGA AGGCGCAATT GATGATGATG CCTTTATCCA AGGAATTCAA
TTCTTGATTA AAGAAGGAAT CATGAAAGTT CAATCATAA
 
Protein sequence
MVLHNKFALF ITIIGIFTIP ALVPDVFGHG LGGDQAPAIS FGDMAVTVST QLTPSDITVG 
EVDTANMQVR FFDTVTDTNL DKVTYRIEVW QSGELLARNL FYDLDGRLDV KITPKTDCNE
ENLHQCSVYY GSEHVSAPGA LFVYGTDCND ENLDVCARPE ITGPIFIEGG LYQIRVDIEA
ATSPKSILAN LLSYDTFVSI AQDQNFMIQT ANADIPVVVK TYYDQVDNFN FDQSDNSISF
DMPFDWTPEY VDLVQVVHEE VRVPKTFAPY AEGKQFKGYV NGIEVDQRAL LNDPYSYDDT
NVVHFLITNQ ELQRINETLG EANYENPKMD LKLVPLDEIE KQSTEFYLVD TTNYEPVPTT
VNISWDGSYG AGDEIPFEIT FFDENRELIR DMRYVVSFID ENDEVLETFL GDDPQMPGIV
ATEGIDIKKI YVPSQGVYRI DIRALGTGLA YDETYAGIGS GIIELGPSTG KTVPTPEEQT
PAAIPAWIKN NAEWWAAGQI DDGSFVQGIQ YLVKENILQI PPTSAGEGTG SNEIPAWIKN
NAGWWAEGAI DDDAFIQGIQ FLIKEGIMKV QS