Gene Nmar_0139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0139 
Symbol 
ID5774399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp127631 
End bp129448 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content23% 
IMG OID641315759 
Producthypothetical protein 
Protein accessionYP_001581477 
Protein GI161527651 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000000152944 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
TTGAGCATTA ATCAAATCAT CATAATAGAT AATTCAATAG AAATTTCCAA AATAAAAAAT 
TTGGAATTGT CTAATTCCAA AATCATTACA GTTGATTACG AAATTCATAA AAAATTGAAT
GATGAAAAAA TCGAACATTT TATCTCAGAT GATTTTTTAG ATCAAAAATT ATTGATAAAA
ATTCAAAAAG AGTCATATAA AATTGCAAAA TGGTACAATA CACCAGAAAT TTCAGAAAAA
ATTACATACA ATAGTATTAA CTTAGGGGGT TTATTCTACA TAGAATTTCA TGCATTTATG
ATTCCTTTCT TAAAAAAATT GTTTGAGATA TATTACATAT GTAAAGAATA TAAAAACTAT
AAAATTTTTA CGTCTAAAAA TCTTATCAGT ATAATTAAAA AACACACAAG TGATTTTGAA
TGTATAAATC CAGAATCATT AGAAAATTCA GATTTTTTAT ACGACTCACT GCCCATAACA
AAAAAAATAT GGAATCAGAA TCTAAAAATT AATTTAAGTA GAAAAAATAT TAAACAAATC
AAAAATTTTT CAGAAAATTT TATGAAAAAG ATTTTAGATG TAGATAAAAG GATTTCAAAA
AATAATAAGT CAGTACTATT TGTTGAATTT GATTTAAACA AATTTCAAAA TTTATTTTTA
CATCCACAAA AAAAATTTAA TTTTATTTTA TACAATCGAA GAAGACCAAT TTTTATCAAT
AAAAAATCAA TTAATATTTT AAAAGATTCA AAATGCAGTT ACATATCAGA AAATAATTTC
AAAAAATATA GAAAAAATGA TTTTGAAGTT TCACAAATAA TTAATTTAAT TGAAAATTCA
GGTGAATTAA AAAAATTATT TGAAATAGAT TCTATAAATT TTTGGGATGA ATTAAAACCC
AAACTCATCA ACTTGAGTAA AAAACGAATT CCTGAAGCAA TTACTGAAAT TGATTATGCA
AAACAAATAT TTGAAAAATA TCAGTTTTCT TCAATGGTTA TTTTATCTGA ATCAGGATTT
AACGAACAAA TTGTAATAGA ATTGGCCAAA AAACAAGGAA TTCCAATAAT TTTGTTACAA
CACGGAATGG GACCAGAAAG TAGAGAGACA TTAAATTTAA AAAAATTTGA TGGCGTATGG
CCCATTAAAT CAGATTACAT ATGTGTATGG GGTAATATTG TTAGAAAGAT TTCATCTGAA
AACGGAATTA ATGAAAATAA AATTATCGAA GTTGGAAATC CAGTATTTGA TAACCTTTCT
AAGTGTAAAA ATAAGGAATA TGTTTTATTA ACAACTTCTT GGTTAACTAA TAATCAAATT
TTTGACTATT CAGCAAAAAA GATATTAAAT TATGAAAAAA CAATAGAAGA AATTGCAAAA
ATAATCAAAA AATTAAATTT AGAGTTAGTA GTAAAACTGC ATCCTGCTCC GTCAGAAAGA
GATGTAGGAA AGAGTATTTC CAAGACTATG CCAAAAGCAC GTATTGTAAA AGATGGCAAT
ATTCACGAAT TGATTAATAA TTGTAGATTA ATGATAGTTA CAGATTTTTC TACAACTATT
TTAGAAGGAA CATTATTTGA AAAACCAGTA ATTTCAGTTA GAATTAAAGA AGAATATTAT
GGAACTCCAA TTGTTTTTGA AAAAAATGGA TGTCATGTTT CAACTATTAA TGAATTAGAT
GAAAGTATTG AAAAAATTCT AAAGGATAAA GATTTTGAAA AAAGTTTACT TGCAAAACAA
AAAAAATTTT TAAAACAATA TGTTTCACAC ATAAATAATT CTACAGACAA ATTTATTGAA
TTTATTTCCA AGCTCTAG
 
Protein sequence
MSINQIIIID NSIEISKIKN LELSNSKIIT VDYEIHKKLN DEKIEHFISD DFLDQKLLIK 
IQKESYKIAK WYNTPEISEK ITYNSINLGG LFYIEFHAFM IPFLKKLFEI YYICKEYKNY
KIFTSKNLIS IIKKHTSDFE CINPESLENS DFLYDSLPIT KKIWNQNLKI NLSRKNIKQI
KNFSENFMKK ILDVDKRISK NNKSVLFVEF DLNKFQNLFL HPQKKFNFIL YNRRRPIFIN
KKSINILKDS KCSYISENNF KKYRKNDFEV SQIINLIENS GELKKLFEID SINFWDELKP
KLINLSKKRI PEAITEIDYA KQIFEKYQFS SMVILSESGF NEQIVIELAK KQGIPIILLQ
HGMGPESRET LNLKKFDGVW PIKSDYICVW GNIVRKISSE NGINENKIIE VGNPVFDNLS
KCKNKEYVLL TTSWLTNNQI FDYSAKKILN YEKTIEEIAK IIKKLNLELV VKLHPAPSER
DVGKSISKTM PKARIVKDGN IHELINNCRL MIVTDFSTTI LEGTLFEKPV ISVRIKEEYY
GTPIVFEKNG CHVSTINELD ESIEKILKDK DFEKSLLAKQ KKFLKQYVSH INNSTDKFIE
FISKL