Gene Nmar_1054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1054 
Symbol 
ID5773538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp928142 
End bp929959 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content35% 
IMG OID641316696 
ProductATPase 
Protein accessionYP_001582388 
Protein GI161528562 
COG category[R] General function prediction only 
COG ID[COG1855] ATPase (PilT family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAAA TTGTAGTTGA TACCAGTGTT ATAATTAATG GTCAATTAAT ATCTCAAATA 
GAAAAAGGAT CTGTTAAAAA CTCCCAAATC ATCATTCCTC AAGCAGTATT AGACGAATTA
CAATCACAAG CATCAAATAA AAAAGAGCAG GGATTTGTGG GGTTAGAAAA AATTCGTAAA
CTAAAAGATC TTTCTGGCAG TTTTGGTTTA GAAGTAATCC AAAAAGGCAC CCATCTGTCT
TCTGATGAAA TTAAACTTGC CGGAGGTGGA AGAATTGATG CTTTAATTGC GGATATGGCA
AAACAAAACA ATGCTACCCT TTACACATCT GATCATGTTC AGCATTTAGT TGCACAAGCA
GAAGGAATTC AAACTGTATT TTTGAAAACA GAGATCCCTA AAGAAACTTT GGAATTTCTA
AAGTTTTTTG ATGCTGAAAC AATGAGTGTC CATCTAAAAG AAAACCAACA TCCACTTGCT
AAACGCGGTA AACCTGGTGC CTTTTTACTT ACAAAAATCA GCGATGATAT TCTTTCACGA
GAGTATCTGG AAATGATTTC TTCTCAAATT CTAGATATTG CAAATGCAAA TGATTCTGGC
ACTATTGAAA TATCAAAAAC AGGTGCTTCA GTAGTACAGC ATGAAGACTA TCGAATCGCA
ATTACTCATC CTCCATTCTC TGAATCTTTT GAGATAACAA TAGTTCATCC AATAATCCAA
ATGTCTCTTG AAGATTATGA TATCTCTGAA AAACTAATGG AGCGATTCTC TGATAGGGCA
GAAGGAATTG TAATTTCAGG TGCTCCTGGT TCTGGAAAAA GTACTTTGGC TTCAGGACTA
GCAAATTTTT ATCATAACAA GGGAAAGATT GTAAAAACAT TCGAATCTCC TCGTGACTTG
CAAGTTGATC CTGGTATTAC TCAATACAGT AAGCTTGATG GGAGTTTTGA TAATACTGCT
GATATCTTGC TCTTAGTTCG TCCAGATTAT ACTGTCTTTG ATGAGGTAAG GAGACGCGAA
GATTTCAGAA CCTTTGCTGA TTTGAGATTG ACTGGAGTTG GGATGGTGGG AGTAGTTCAT
GCAAATTCTC CTTTAGATGC AATTCAAAGA TTCATTGGAA AGATTGAGCT TGGAATTATT
CCAAATGTTT TGGATACTGT CGTGTTTGTT AAAGATGGAC AAATCAAAAA AGTCTATGAT
TTAGAATTAA AGGTAAAGGT ACCTTCTGGA ATGACTGAGT CTGATCTTGC TAGACCTGTT
ATTGAAATTC GTGATTTTGC TGATAATACG TTGGAGCATG AAATCTATAC ATTTGGAGAG
GAAAATGTGA TAGTTCCTGT AGGGAAAAAA ACCAAAGTAG GAATTGAAAA ACTAGCTGAA
GAGAAAATTC GTGAGACATT CAAAAAGTAT GATCCTAGAG CACAAGTTGA GATTCTATCT
GATAACAGAG TTAAGGTGAT GGTTGATGAA CAATACATAC CATCAATCAT TGGTAGAGGA
GGTTCTAACA TTAATGAAAT CGAAAAATCC CTTCAAGTAC ATGTTGATGT GGTTCAAAAA
GACTCTGAAC ACTATAATTT AGACTCCAAC GATTTGCCTT TTACTTTTTC AGAATCAAAA
ACAGCTCTAA TCCTCACTGT TAGTAAAGAG TACACTTCAA TGCATGCAGA CGTTTATGTT
CGTGATGAAT ACATTACATC AACTAGGATT GGTAAAAAGG GACAAATCAA AATTCCAAAA
CGCTCTGATG TTGCAAGGAC CTTGATGAAA CTAGCTTCAT CCCAAAACGA TATTCAATTA
TTTCTCAAAG ATTTTTGA
 
Protein sequence
MSKIVVDTSV IINGQLISQI EKGSVKNSQI IIPQAVLDEL QSQASNKKEQ GFVGLEKIRK 
LKDLSGSFGL EVIQKGTHLS SDEIKLAGGG RIDALIADMA KQNNATLYTS DHVQHLVAQA
EGIQTVFLKT EIPKETLEFL KFFDAETMSV HLKENQHPLA KRGKPGAFLL TKISDDILSR
EYLEMISSQI LDIANANDSG TIEISKTGAS VVQHEDYRIA ITHPPFSESF EITIVHPIIQ
MSLEDYDISE KLMERFSDRA EGIVISGAPG SGKSTLASGL ANFYHNKGKI VKTFESPRDL
QVDPGITQYS KLDGSFDNTA DILLLVRPDY TVFDEVRRRE DFRTFADLRL TGVGMVGVVH
ANSPLDAIQR FIGKIELGII PNVLDTVVFV KDGQIKKVYD LELKVKVPSG MTESDLARPV
IEIRDFADNT LEHEIYTFGE ENVIVPVGKK TKVGIEKLAE EKIRETFKKY DPRAQVEILS
DNRVKVMVDE QYIPSIIGRG GSNINEIEKS LQVHVDVVQK DSEHYNLDSN DLPFTFSESK
TALILTVSKE YTSMHADVYV RDEYITSTRI GKKGQIKIPK RSDVARTLMK LASSQNDIQL
FLKDF