Gene Nmar_0653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0653 
Symbol 
ID5774463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp592793 
End bp593812 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content31% 
IMG OID641316289 
Producthypothetical protein 
Protein accessionYP_001581987 
Protein GI161528161 
COG category[S] Function unknown 
COG ID[COG4301] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03438] probable methyltransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.671427 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACAATA CCTTACAAAA AAACAAAGAA TACAAGAAAT TTGTTGTTGA TTCCAGATTA 
CAATATTTCA AACCTCATGC CACAAAAATT GAAAAAACGT TTGCTGAAGA AATCTCATCT
GGCCTTGGAA CTAACTCTAA ATCTATTCAT CCCAAATTTT TCTATGACAA GAAAGGTTCT
GAGTTGTTTG AAAAAATATG TTCTGTTCCA GAATACTATC CTACTAGAAC TGAAATTTCT
ATTCTGAAAA AACTCCAGAG TGAACTGTCT TCCTACTTGG ATGAAGACTT TAGATTGGTA
GAATTAGGCA GTGGTTCCTC AACAAAAACT CGGTTAATCC TGGACTTTTT GACATCTCAA
AAAACTCTCG AGTACTTTCC AATAGATATC TCTGAAATTC TTACAGAAAG TTCTGAAGAA
TTACTAAATG ATTATCAAAA TCTTACAATT ACTGGCATTA TCGATACTTA TGAGGGTGGT
TTAGAATTTT TAAAAACATA TGATGATAAA AGCAATCTCA TCATTTTCCT GGGTTCCAGT
TTTGGTAATT TCTCTCCAAT TGACGGGTAC AAATTTTTAG AAAAAGTTTA TGCTACTATG
AAACCTGGTG ATTTGTTTTT GATTGGACTT GATCTTGTAA AAGACAAAAC CATTCTTGAA
TCTGCTTATA ATGACTCTGA AGGCGTAACT GCAAAGTTCA ATCTTAATGT TTTATCTAGA
ATTAATGACG AGCTTGATGC TGATTTTAAT TTACAAAACT TTTCACATCA TGCTATTTAC
AATGAAAAAG ATCAGAGAAT TGAAATGTAT TTGAAATCTC TGGTTGATCA ATCAATAATC
ATATCAAAAT CTGATTTGGA ATTAAAATTA CAAAAAGATG AATTGATTCA CACTGAATAC
TCTCACAAAT ATAGATTATC TCAAATTCAT GATCTTCTTG ATGATGTTGG ATTTGAGTTA
AAACACACCT GGCTTGACGA TAAAAAATAT TTTTCATTAA CTTTGGTCTC AAAAACTTGA
 
Protein sequence
MNNTLQKNKE YKKFVVDSRL QYFKPHATKI EKTFAEEISS GLGTNSKSIH PKFFYDKKGS 
ELFEKICSVP EYYPTRTEIS ILKKLQSELS SYLDEDFRLV ELGSGSSTKT RLILDFLTSQ
KTLEYFPIDI SEILTESSEE LLNDYQNLTI TGIIDTYEGG LEFLKTYDDK SNLIIFLGSS
FGNFSPIDGY KFLEKVYATM KPGDLFLIGL DLVKDKTILE SAYNDSEGVT AKFNLNVLSR
INDELDADFN LQNFSHHAIY NEKDQRIEMY LKSLVDQSII ISKSDLELKL QKDELIHTEY
SHKYRLSQIH DLLDDVGFEL KHTWLDDKKY FSLTLVSKT