Gene Nmar_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0101 
Symbol 
ID5773598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp91543 
End bp93711 
Gene Length2169 bp 
Protein Length722 aa 
Translation table11 
GC content36% 
IMG OID641315721 
ProductAAA family ATPase 
Protein accessionYP_001581439 
Protein GI161527613 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID[TIGR01243] AAA family ATPase, CDC48 subfamily 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.00292078 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTCAAA ACGCACTTTC TCTCAAAGTT TTAGAGGCAT ACACTAGAGA TGTCGGAAGA 
GGAGTAGCAA GAATAGATTA TGATTCTATG GATACATTAA ATGCATCCAC AGGCGATGTT
ATTGAAATTA AAGGTAAAAG AAGAACAGTT GCAAAATGTC TTCCATTATA CCCATCTGAT
GAAGGAAAGG GAATTATCAG AATTGATGGC CTTGGAAGGA ATAATTCAGG AATTGCAATT
GGAGATACAA TTTCAGTTAG GAAAATCAAA GCCGTAGCTG CAGAAAAGGT CGTAGTTGCT
CCATTAGAAG CAATTCCACC AATTGATGAA AGATATCTTG CGGATGCTTT AGAAAGTGTT
CCTTTGATTA AAGGAGACAA TGTAATGGTG CCATACTTTG GTGGACGTTT AACTTTTCAA
GTAATTGGAG TCACACCAGC TGCTGATGCA GTTTTGATCA CCCAAAAGAC AGTTTTCCAT
ATTGCAGAAA AAGGTGAAAC ATTACGTGGA GTTCCACAAG TAACCTATGA AGACATTGGA
GGTTTGACTG ATGAGATAAA GAAAGTAAGA GAAATGATTG AACTCCCATT AAGACATCCT
GAAATTTTTG AAAAACTAGG AATTGAAGCT CCAAAAGGTG TTTTACTTTA TGGTCCACCA
GGAACAGGTA AAACATTACT AGCAAAAGCT GTTGCAAACG AAAGTAATGC ACACTTTATC
AGTATTTCAG GTCCAGAAAT TATGAGTAAG TTTTATGGTG AAAGTGAAGC AAGATTAAGA
GAGATTTTCA AAGAAGCAAG AGAAAAGGCC CCTTCAATAA TCTTTGTTGA TGAAATAGAT
TCTATTGCAC CAAAAAGAGA AGAAGTTACC GGAGAAGTTG AAAGAAGAGT AGTATCTCAG
ATGTTATCAT TAATGGATGG ATTAGAAGCA AGAGGTAAAG TCATTGTAAT TTCTGCAACA
AATAGACCAA ATGCAATTGA TCCTGCACTT AGAAGACCAG GAAGATTTGA TAGAGAGATT
GAGATCAAAG TACCAGATAA AAAAGGAAGA AAAGACATTC TTGCAATTCA CAGCAGAAAC
ATGCCATTAT CTGACGATGT AAACGTGGAT AAAATTTCAG CAATTAGCCA CGGATATGTT
GGTGCAGACT TGGAATACCT CTGTAAAGAG GCTGCAATGA AATGTTTGAG AAGATTATTA
CCAATTCTAA ATCTCGAAGA AGAAAAAATC CCACCTGAGA CTTTGGATAA ATTAATTGTA
AATCATGAAG ATTTCCAAAA AGCCCTAATT GAGGTAACTC CATCTGGAAT GAGAGAAGTT
TTCATTGAAA ATCCAGATGT AAAATGGGAT GAAGTTGGCG GTTTAGAGGA TGTTAAACGT
GAATTACAAG AAGCAGTTGA ATGGCCAATG AAATATCCAG CACTATATGA CAAACTTGGT
CACAGCATGC CAAGAGGAAT ATTGCTTCAC GGTCCTAGTG GTACTGGTAA AACATTACTA
GCAAAAGCTG TTGCAACCCA AAGTGAAGCA AACTTTGTTT CAGTTAGAGG TCCTGAGTTA
TTATCAAAAT GGGTAGGCGA ATCTGAAAGA GGAATTAGAG AAATTTTCAA AAGAGCAAGA
CAATCTGCCC CATGTGTAGT TTTCTTTGAT GAAATTGATT CTATTGCACC AATTAGAGGA
GCAGGTGGAG AAACTGCAGT TACTGAAAGA GTTGTCAGCC AATTACTTAC AGAGTTAGAT
GGAATGGAAA ACATGCATGG AGTTGTTGTT CTAGCTGCAA CAAACAGAGC AGACATGATT
GATCCAGCAT TGTTAAGACC AGGAAGATTT GATAAAATTA TTCAAGTACC AAATCCAGAT
AAAGATAGTA GAAAACGTAT TCTTGAAATT AATGCTGAAA AAATTCCTAT GGGTGATGAT
GTAGATATGG AAAAGATTGC AGAAATTACA GATGGAATGA GTGGTGCAGA TACTTCATCT
ATTGCAAATA CAGCAGTTTC ATTAGTAATT CATGAATTCC TAGACAAACA TCCAGATGTA
AAAGATGTTG AAAAGAGCAG CATAGAAGCC AAAGTAACAA TGAAACACTT TGAAGAAGCA
GTAAAGAAAG TAAGAGAACA AAAAGACCTC AAGATGGGTG AAAAGCTAGT TGCTTCCTAT
TACAGGTAG
 
Protein sequence
MSQNALSLKV LEAYTRDVGR GVARIDYDSM DTLNASTGDV IEIKGKRRTV AKCLPLYPSD 
EGKGIIRIDG LGRNNSGIAI GDTISVRKIK AVAAEKVVVA PLEAIPPIDE RYLADALESV
PLIKGDNVMV PYFGGRLTFQ VIGVTPAADA VLITQKTVFH IAEKGETLRG VPQVTYEDIG
GLTDEIKKVR EMIELPLRHP EIFEKLGIEA PKGVLLYGPP GTGKTLLAKA VANESNAHFI
SISGPEIMSK FYGESEARLR EIFKEAREKA PSIIFVDEID SIAPKREEVT GEVERRVVSQ
MLSLMDGLEA RGKVIVISAT NRPNAIDPAL RRPGRFDREI EIKVPDKKGR KDILAIHSRN
MPLSDDVNVD KISAISHGYV GADLEYLCKE AAMKCLRRLL PILNLEEEKI PPETLDKLIV
NHEDFQKALI EVTPSGMREV FIENPDVKWD EVGGLEDVKR ELQEAVEWPM KYPALYDKLG
HSMPRGILLH GPSGTGKTLL AKAVATQSEA NFVSVRGPEL LSKWVGESER GIREIFKRAR
QSAPCVVFFD EIDSIAPIRG AGGETAVTER VVSQLLTELD GMENMHGVVV LAATNRADMI
DPALLRPGRF DKIIQVPNPD KDSRKRILEI NAEKIPMGDD VDMEKIAEIT DGMSGADTSS
IANTAVSLVI HEFLDKHPDV KDVEKSSIEA KVTMKHFEEA VKKVREQKDL KMGEKLVASY
YR