Gene Nmar_0841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0841 
Symbol 
ID5774175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp742126 
End bp743241 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content34% 
IMG OID641316479 
Productpropanoyl-CoA C-acyltransferase 
Protein accessionYP_001582175 
Protein GI161528349 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.000470716 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTATGA ACAAAGTAGG TATTGCAGCA TATGGAATTA CACCATTTAC CAAAGATGAT 
AAAAAAATAG AATCTGCACT TTTTGAATCA ACAAAAAATT TGATTGAAAA TAATCCCAAA
ATCAAAAAAA ATGATATTGA TGCAGTTTTG GTTTCAACTA ACAACAACTC AAAATATCTT
GCACCTATAT TGTCAGAGGC CATAGGAATT CAACCAAAAA CGGCACATTC TATAGAAAAT
CTGTGTAATT CAGGCACAAA TTCGATTGTT TCAGCCTATT CGTACATAGC AGCAGGTCTG
GCAGATATGG TGCTAGTAAG TGGTGCTGAA AGATATGATA GTCCAGGCCA AATTTTAGAA
TGGGACAATT CACGTGGAGA GTTTAAACAT CCAATATTTT GGGCATCAAT TTTTACGAGT
TCATACAAAC GAAAATTCTC AGTTACTGAT GAACAGTTAG CAATAGTTTC AGTAAAGAAT
CACAGTCAAG CTCAAAACAA TCCAAATGCT CTCTCTAAAA AAACATATTC AGTTCAAGAT
GTAATAGATT CTAAAAAAAT TACTGATGAT CTTAGATTAC TTGATTGTTC GAGATCATGT
ACGGGAAGTG CATCAATTAT TTTAGCATCA GAAAACATGA TGAAAAAAAT TACTGATCAA
CCAATATGGA TTACCGGAGT AGGTCAAAAA ACAATTTCAG CAGGATTTAC AAAAAATGAA
TCATTCAATT CAATGGAATC TACAAAACAT GCAACACAAA CAGCACTAAA CATGGCAAAG
AGAAAAATTA AAGACATTGA TGTGGCAGAA GTTCATGATG CATTTTCAGT ATGTGAACCA
ATGGCATTAG AAGCAATTGG TATGGCAAAT CCTGGAAAAG GTACAACCAT TGTAAAAGAA
CTTTATGAAA CAAAAAACCT CAAGATTAAT CCAAGAGGAG GACTGATTGG TTGTGGACAT
CCTTTAGGTG CAACAGGTAT TGCACAAACT ATCGAGATTA TGCAACAACT GCAAAATAAT
GCAGAAAAAA GACAAGTTGA AAATGCAAAT GTTGGTCTAG TTCACAACAT GTCTGCAGCA
GCAACATCTT CAACTATTTT GGTGTTAGAA AAGTGA
 
Protein sequence
MSMNKVGIAA YGITPFTKDD KKIESALFES TKNLIENNPK IKKNDIDAVL VSTNNNSKYL 
APILSEAIGI QPKTAHSIEN LCNSGTNSIV SAYSYIAAGL ADMVLVSGAE RYDSPGQILE
WDNSRGEFKH PIFWASIFTS SYKRKFSVTD EQLAIVSVKN HSQAQNNPNA LSKKTYSVQD
VIDSKKITDD LRLLDCSRSC TGSASIILAS ENMMKKITDQ PIWITGVGQK TISAGFTKNE
SFNSMESTKH ATQTALNMAK RKIKDIDVAE VHDAFSVCEP MALEAIGMAN PGKGTTIVKE
LYETKNLKIN PRGGLIGCGH PLGATGIAQT IEIMQQLQNN AEKRQVENAN VGLVHNMSAA
ATSSTILVLE K