Gene Nmar_0595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0595 
Symbol 
ID5774192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp531812 
End bp533020 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content33% 
IMG OID641316229 
Productglycosyl transferase family protein 
Protein accessionYP_001581929 
Protein GI161528103 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATAG CATTTGATGT TTTGAATTAT TCATTATCAG CAATCCTAAT TGGAATATGT 
GGAGCGTGGT TATTTTTGAT AAAATCAATG GTTGATTCAT TTAGATTAAC ACCTTACTTG
GATAGATTTG AAAATACATC AAAGGGATTT CCCAAAGTTT CAATAATTTT ACCTGCAAGA
AACGAAGAAG AGTTTCTTGG AAAATGTTTG GATTCATTAA TTGATCAGGA TTACAAAGAT
TATGAAATTA TTGTAATTGA TGATTCATCA GAAGATTCTA CGGGAAAAAT AATTTCAGAA
TATGCAAAGA AAAACTCCAA AGTCATTCAT GTTTCTGCAA GAGAAAAACC TGAAGGATGG
ATGGGAAAAA ACTGGGCATG TATGGAAGGA TATAGAAAAG CAACAGGAGA ACTATTGTTA
TTTACAGATG CAGACACTAC ACATAAAAAA AATGTCATAT CACTTGCGGT CTCACATCTT
TTATCATTTG AACTAGATGC ATTATCAACC ATACCAAAAA TGCTCACATT TGATTTTTGG
ACAAACATTA CCCTTCCAAT GATTTCTACG TTTTTGCATA CAAGATTCTC TGCACTAAAT
GTGAACAATC CATCAAAAAA GACAGGTTAT TTTTTTGGTA GTTTTTTCAT TTTGAAGAAA
AGTACGTATG AACAAGTTGG TATGCATGAG GGAGTCAAAC ACGAAATAAT TGAAGATGGG
GCACTTGGAA AAAAAGTAAA GGAAGCAGGA TACAAAATGA AGATGGTAAG AGGAGAACAT
CTAGTAGAAG CAGTTTGGGC AAGAGACAAA AGTACCCTTT GGAATGCACT AAAAAGATTG
ATGATACCTT TGTATCTTCA AAGTGGGAAA ATCGCAATAG GAATTTTCTT TGCAGTATTG
TTTTTGCTTT TTGTACCATT TCCAATTTTT GCAACATCTA TTTTGTTACC TGCAGAAACA
TTATCATCAA AAATTCTTTG TGCAACGGCG TTTGCAGCAT CATTGTTAAT TTACATTGGA
GCAGTGATTG AAGCCAAAAT AGGATTAGAA TTAAAATTTA GATATGCAAT ATTTGCTCCA
CTTGGAAGCC TTGTAGTTGT GTTAGGATTT TTGAGTGGAT TATTGCAAGC TAAAAAAACA
TCATCAGTTA CTTGGAGGGG AAGGAGTTAC TCTATGAAAG ATCACTCTCA AAGTTCGATT
AGCGTATAG
 
Protein sequence
MEIAFDVLNY SLSAILIGIC GAWLFLIKSM VDSFRLTPYL DRFENTSKGF PKVSIILPAR 
NEEEFLGKCL DSLIDQDYKD YEIIVIDDSS EDSTGKIISE YAKKNSKVIH VSAREKPEGW
MGKNWACMEG YRKATGELLL FTDADTTHKK NVISLAVSHL LSFELDALST IPKMLTFDFW
TNITLPMIST FLHTRFSALN VNNPSKKTGY FFGSFFILKK STYEQVGMHE GVKHEIIEDG
ALGKKVKEAG YKMKMVRGEH LVEAVWARDK STLWNALKRL MIPLYLQSGK IAIGIFFAVL
FLLFVPFPIF ATSILLPAET LSSKILCATA FAASLLIYIG AVIEAKIGLE LKFRYAIFAP
LGSLVVVLGF LSGLLQAKKT SSVTWRGRSY SMKDHSQSSI SV