Gene Nmar_1328 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1328 
Symbol 
ID5774745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1218185 
End bp1219645 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content38% 
IMG OID641316973 
Producthypothetical protein 
Protein accessionYP_001582662 
Protein GI161528836 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00000298022 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATAGCA TTAGATTGTT TTCAATATCT AGTGTAACAG CAATTTTTTT GTTGCTTTCT 
TTGTCCGTGT TTGTTCCCGT GTTAGGAGAA ATTGCACCGC CAAACAAACA GATGAAATTA
GGAGTACTTG CCGAGGATGT AATTTGCAAG GAAGGATTAC AGCTAATGAT TAGAAGCAGT
GGAAATGCCA TATGTGTAAA AGATTCTTCT ACAACTCGCC TAGCTGACTC TGGTTCTGCA
ATAGTTGTCC CAAACAATCC AATTCCTCTC ATAGAAAAAC AAACAGAGGA ATCTGAAAAT
ACAGTGCAAG ATGAAATTAC TCAAAAGGCT GAAAAAACCA TAGTAATTGA GCCTACAAAC
GTAGGTTACT TTCCTGGCGA TACAATTGAT TTTTCAGGCA AGGCAAGAGG TAACCAGAAT
CTTGAGATAA CCTTGATTGA TCCTAACGAA AAAGAGGTTT TCACGGAGGT TTTTGAATTG
GACTCTTCTG GTAATGTGAG CTTCCAAATA GTAACTGAGA CTTTTTTTGT AGAGGGACGT
TACTTTCTGA TTGCAGAACA AAGTGATGAC TCTGAAATAA CACCTGTAAC TATTGGACAT
AATTCTAATG ACATTCAATT GGTAGTGGAT GATTTTTACA ACAAATTAGA TTCCGAATTA
TTAATTGAGA TGTATAGTGA TCCTCTGTCC ACAGTAGAGT TGGCAGTGTA TGATTATGCA
GAACATGAAA AATTTAGATC TGACATCACC GTCAATTCAA ATGGCTATGC GGAAACTACC
ATAGATCTGA GCGGATACAA GTCAGGAGTT TACTCTGCAG TGCTAACACA TGGAACCGAA
GATGTTGATG TAGATTTTGC AGTGGGTCTA AAAACAGGTT CCACCTTACC AATAACACTC
AGTGTCACCA AAGAACACTA TGTTCCTAAT GAACGGGTTC TAATTTTTGG AACAGCAAGT
AGTAATTCTG CAGTTACATT AGATGTGATA AATCCTGATG GTGATCTTGT TCACGAGATT
GAATCATACT CTGATGCATC TGGAAAAATT TCAACCATGT TTAATCTTCC TACAAGTGCA
GCTACAGGAG AATGGAAGAT AGCAACTACA GGAAGATCCA TAGTTAACGA AGCCACTTTT
CATGTAACCG GAAATGATCC CCCATTGTCT TTTGAGATAG ACAAGGTAGA GCCATACGAG
ACAGGAGATG TTTTAACATT GGTGGGAAGT GGCGTAGACA CCGTATCTAA AATTGCAATT
AGCATCACAT CAGGTGAAGT AATAGAAAAA TTCGAGGTGT TTGCTACAGA AGACGGGGAT
TTCTCTTTAA GTTGGAGCAT TCCTGAGGAT TTAGAGTCAG GAACACATAC CATTACTGTT
GATGACAACA TTACAACAGT TACCAAAAAC TTTGAGGTCA TTAACCCTCT CTATGAGAAA
AGTTATTTTA CATCTCCATG A
 
Protein sequence
MNSIRLFSIS SVTAIFLLLS LSVFVPVLGE IAPPNKQMKL GVLAEDVICK EGLQLMIRSS 
GNAICVKDSS TTRLADSGSA IVVPNNPIPL IEKQTEESEN TVQDEITQKA EKTIVIEPTN
VGYFPGDTID FSGKARGNQN LEITLIDPNE KEVFTEVFEL DSSGNVSFQI VTETFFVEGR
YFLIAEQSDD SEITPVTIGH NSNDIQLVVD DFYNKLDSEL LIEMYSDPLS TVELAVYDYA
EHEKFRSDIT VNSNGYAETT IDLSGYKSGV YSAVLTHGTE DVDVDFAVGL KTGSTLPITL
SVTKEHYVPN ERVLIFGTAS SNSAVTLDVI NPDGDLVHEI ESYSDASGKI STMFNLPTSA
ATGEWKIATT GRSIVNEATF HVTGNDPPLS FEIDKVEPYE TGDVLTLVGS GVDTVSKIAI
SITSGEVIEK FEVFATEDGD FSLSWSIPED LESGTHTITV DDNITTVTKN FEVINPLYEK
SYFTSP