Gene Nmar_0217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0217 
Symbol 
ID5774598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp193616 
End bp194686 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content34% 
IMG OID641315837 
Productradical SAM domain-containing protein 
Protein accessionYP_001581551 
Protein GI161527725 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1180] Pyruvate-formate lyase-activating enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGCAA TTCTCGGCAA AGAAGCTGAA TTATACGAAA AACTTGCAGA TGATAAAGTC 
AAGTGTACTG CATGTGCACG ATATTGCGAA ATTGGTAAAG GACAAATTGG TTTATGTGGA
ATTCGTGGAA ATGAAGATGG AAAATTACAA CTTTATGCCT ATGGAAAAGT AATTTCTGGT
CATGTTGATC CAATTGAAAA AAAACCATTA ATCCACTATT ATCCTGGAAG TAAAGTCTAT
TCAATTGCTA CAACTGGATG TAATTGGCTT TGCAGATATT GTCAAAATTC TGACATCAGT
CAGAGACGAG AAATTCAAGG AATTGATATG ACTCCTGATC AAGTTGTTGA TACTGCAATC
AAGTATGGTG CTCATGGAAT TGCATATACA TACAATGAAC CTTCCATCTT TATTGAATTT
GCAAAAGATT GTGGTGTTGC TGCAAGAAAA AAAGGATTGT TTAATGTCTT TGTTTCAAAT
GGATATGATA CTACTGAATC AGTTTCAATG ATGAATCAAT TTCTTGATGG GATAACCGTT
GATTTCAAAG GAAGTGCAGA AAAAGAATTT ACTCGAAAGT TTATTGGAGT TCCAGATCCT
CAACCTATCT TTGATACTTT GTTAGAAATT CGAGATAAAA CCAATATTCA TATCGAAATT
ACTGATCTGA TTGTCCCTAA AGTTGGTGAT GATCTAGAAC ATGCAAAAAA ACTTTCAAAC
TTTATTCTCG ATGAGTTTGG ACCAGAGATG CCAATTCATT TTCTACGATT TCATCCCGAT
TACAAAATGA TGGAATATCC AAGTACTCCT GTAGAAACAT TGGAAAAACA TTATCAAATT
GCAAAAGAAG TTGGACTAAA GTATGTGTAT TTAGGAAATG TTCCTGGACA CAAATGGGAG
CACACGTATT GTTCTGAATG CAATAATGTT GTTGTAAATC GCTATGGCTT TAGTATTAGA
GAATGGAATC TTGATAAAAA CAACTGTTGC AAGTTTTGTG GAAATAAAAT TCCAATAAAA
GGAAAATTGC AGGAAGGATA CAAAGAAGAT AGATTTCAGT TTGTATCGTA G
 
Protein sequence
MTAILGKEAE LYEKLADDKV KCTACARYCE IGKGQIGLCG IRGNEDGKLQ LYAYGKVISG 
HVDPIEKKPL IHYYPGSKVY SIATTGCNWL CRYCQNSDIS QRREIQGIDM TPDQVVDTAI
KYGAHGIAYT YNEPSIFIEF AKDCGVAARK KGLFNVFVSN GYDTTESVSM MNQFLDGITV
DFKGSAEKEF TRKFIGVPDP QPIFDTLLEI RDKTNIHIEI TDLIVPKVGD DLEHAKKLSN
FILDEFGPEM PIHFLRFHPD YKMMEYPSTP VETLEKHYQI AKEVGLKYVY LGNVPGHKWE
HTYCSECNNV VVNRYGFSIR EWNLDKNNCC KFCGNKIPIK GKLQEGYKED RFQFVS