Gene Nmar_0331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0331 
Symbol 
ID5774596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp291621 
End bp292727 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content35% 
IMG OID641315959 
ProductATP-binding domain-containing protein 
Protein accessionYP_001581665 
Protein GI161527839 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1108] ABC-type Mn2+/Zn2+ transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATAGAGA TATCTAATGA TCTAATTATT ATTGGAACTG CTGTACTAGT TGCAGCAAAT 
TGTGCATCAA TTGGAACTTT CTTAGTTTTA AGACAAATGG CCATGATGTC TGATGCCATT
AGTCATGCTG TATTATTAGG AATAGTAATT GCCGTGTTTA TGGTTGGAGG TCGAGAAACT
ATTCCCGTAA TAGTTGGAGG TGTACTGTCT GGTATCTTGA CTGTTTCAAT TGTAGAGATG
CTTTATCGTA CAGGTAAACT AAAACAAGAC AGTTCCATTG GCATCGTCTT TCCATTTCTT
TTTGCAATTG GTGTGATACT TGTCACACAA GCTGGTAATG TACACATTGA TGCACAACAT
GTACTGTATG GTTCAATTGA ATTTGTACCA TTTGACACAC TATATCTTGA TGAAATTAAC
ATAGGTTCAA AATCTCTTTG GGTTCTTGGC GTTTTAGCAA TTGCAAATAT TTCTTTTATT
GCAATTCTAT ACAAGGAACT AAAAATTAGC ACTTTTGATG CTTCAGTTGC AGTAAGTGTT
GGATTGATGC CCATGTTGAT TCATTATTTG TTAATGATAA TGGTAGCTAC AACTGCTGTT
GTTGCTTTTG AATCAGTTGG TGCAATACTT GTCATTGCAT TTTTCATAGT TCCTGCGTCT
GGAGCTTATC TGTTAACCGA TAGATTATCT CATATGATAG CACTATCTGT AACACTGGGA
ACTATTAGTG CTATTGCAGG TTATCTGTTT GCAGTCTTAC TTGATGTGTC GATAGCAGGC
TCAATGGCCA CAATTGCTGG TGCCATATTT GGATTAATCT GGGTATTTGC ACCAAATCGA
GGCTTGATTA GCAGATGGAG ACGTATTACA AAACAAAGAT TCGAAATAGA TTTGGGAATA
GTTCTAACTT TCATTCAAGA TGAGCTTAGC GCTAATCGTC AAGTATCCAC TTCAACTTTA
TCGAATGCAT TAGGGTGGAC AATAGATTAC TCGAAAAAAC TAAGTGAATT TATTCAAGAA
AAAAAATTTA TAGAAAAAGA TGTTAGTGGA AATCTAATTC TAACAGTACT AGGACAAAAA
AAGGCAAAGA ACTACTCAAC AACTTGA
 
Protein sequence
MIEISNDLII IGTAVLVAAN CASIGTFLVL RQMAMMSDAI SHAVLLGIVI AVFMVGGRET 
IPVIVGGVLS GILTVSIVEM LYRTGKLKQD SSIGIVFPFL FAIGVILVTQ AGNVHIDAQH
VLYGSIEFVP FDTLYLDEIN IGSKSLWVLG VLAIANISFI AILYKELKIS TFDASVAVSV
GLMPMLIHYL LMIMVATTAV VAFESVGAIL VIAFFIVPAS GAYLLTDRLS HMIALSVTLG
TISAIAGYLF AVLLDVSIAG SMATIAGAIF GLIWVFAPNR GLISRWRRIT KQRFEIDLGI
VLTFIQDELS ANRQVSTSTL SNALGWTIDY SKKLSEFIQE KKFIEKDVSG NLILTVLGQK
KAKNYSTT