Gene Nmar_1700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1700 
Symbol 
ID5774600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1560670 
End bp1561704 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content35% 
IMG OID641317354 
ProductH(+)-transporting two-sector ATPase 
Protein accessionYP_001583034 
Protein GI161529208 
COG category[C] Energy production and conversion 
COG ID[COG1527] Archaeal/vacuolar-type H+-ATPase subunit C 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.122412 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAGA ACGTCTATGC ATCAGTAAAG TCATACAGCC AAAGAGGAAA ATTACTCAGT 
AGGGCTGATT TTCAGACACT GGCAGAATCA AGAGATCTTG ATGAATTTAT GACCAGAATA
AAGAACACCA TTTATGGTGA TTCAATTAAT GATGTTCAAA AACCATATAC TTCACAAGGT
ATTGAATCAG CATTTAGAGG ACATTTGGCT GATGTCCATT ACTCCATTGC AAAAACTGCT
GGCGATTCTG ATATTCTTGA TGCATATTAT ATGAAGTTCA TAATTTCAAA TCTAAAATTA
ATACTAAAAG GCAAGGTTTT AGGTAAATCA CAAGAAGAGA TTGAGAATCA CATCAATCTA
CGTGCAGAAG AATTAGTTAA ACAACGAGAT ATCATAATCA AATCCCTTGT TGCAAAAGAT
CTTGAAGAGG CAGTTGCAAG TCTAAATTCA GTTCAATTTG GAGATGAGAT TGCAAAGGCT
GCAACACTTT ACAACGAAAC AAAAAACATC CAAGTCTTTG ACACGTATTT TGATAAAATT
TTGTACCAAC AACTAGGACG AGCTTTGAAG AATACAAGAG ATAGAGATGT CATAAAGATT
GTCGGAATGG ATGTTGACTT TTACAATCTT CTTAGTGTGA TTAGAGGAAA ATTCTGGGGA
TTAGAAGAAT CACAAATTCA AGATTTGATT GTGACTCAAA CTCCAACTGT CCCAAGAGAA
CTTCTTGGAA GAATGATGGC AGCAGGTTCA GTCAGAGATG CACTAAATGA GCTTGCCACA
ACCAAATACA AAGACATGAT TCCACAGATG GAAAATGAGT TAGATGCAGT TGCCGAATTT
GAAAGAGCAT TTGAGATGAG CATTTATCAT TCATCTGCCA GAGCATTTAC CAAGATGTTT
AGTTTTGCAA CAATCATAGG AATCACAAAA CTAACGGGCT TTGAAGTAAG GAATTTGGCT
GCAATTGCAT ATGCAGTAGA GCAAAAAATT CCTACAGAAA CAACAATGTC AAAATTGATT
CTTGAAGAAG AATAG
 
Protein sequence
MGKNVYASVK SYSQRGKLLS RADFQTLAES RDLDEFMTRI KNTIYGDSIN DVQKPYTSQG 
IESAFRGHLA DVHYSIAKTA GDSDILDAYY MKFIISNLKL ILKGKVLGKS QEEIENHINL
RAEELVKQRD IIIKSLVAKD LEEAVASLNS VQFGDEIAKA ATLYNETKNI QVFDTYFDKI
LYQQLGRALK NTRDRDVIKI VGMDVDFYNL LSVIRGKFWG LEESQIQDLI VTQTPTVPRE
LLGRMMAAGS VRDALNELAT TKYKDMIPQM ENELDAVAEF ERAFEMSIYH SSARAFTKMF
SFATIIGITK LTGFEVRNLA AIAYAVEQKI PTETTMSKLI LEEE