Gene Nmar_1528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1528 
Symbol 
ID5774413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1391235 
End bp1393613 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content38% 
IMG OID641317179 
ProductATPase 
Protein accessionYP_001582862 
Protein GI161529036 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000453148 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCTATGA ATCCTGCATT TGCAGCAACT GACTTAGATA ATGACGGTGT TGATGATACT 
GTTGATGCAT GTCCTAATTT ACGTGAAGAT AATGAAGGTG CCATAGATGG TTGTCCATCA
AATTTTGTTC CATGGTATGA TGAAGATTAT GATGGAATAG AAGATCATAT TGATCAATGT
CCAAATCTTA GAGAAAAATA CAATAAATTC CAAGATGAAG ATGGTTGTCC AGATATTTCT
CCTCAAGGAG GACCTGGTGG ATTGCCTGAC TCTGACGGTG ATGGATTTGT AGATACTGTA
GATAAATGTC CTAATCAACC AGAAACATTC AATGGCATTT TAGATAGAGA TGGTTGTCCA
GATGACTTTG GTTCTGGTGA CCGTGATAGA GATGGAGTTC CTGATGCAAT TGATGCATGT
CCACTAAGTC CAGAAACTTA CAATAGATTC CAAGATGCAG ATGGTTGCCC TGATGCACAA
AATGATTTAG CTTTTATTGA TACCGATAAT GATGGCATAA AAGATTCACT AGATTTGTGT
CCAACAGAAC CTGAAGTGTA TAATGATTAT TTAGACACTG ACGGTTGTCC AGATGTTACA
TTAGATTCTG ATTTCATTGA TACTGATGGT GATGGAATTG CAAATCAATT TGATATTTGT
CCTACTGATC CTGAAACTTT TAATCGCTTT GCTGATTATG ACGGTTGCCC AGATTCTATT
CCATCCTTTA TTGGTTCTTT AAATGATGCA GATGGTGATA GAATAATGGA TACTGATGAT
GCATGTCCAC TAGAACCTGA AAGATACAAT GGATTCCAAG ATGATGACGG TTGCCCAGAT
ATTCCACCTT ATACAAGTGA TAGTGATTCA GACATGGATG GAATTCCAAA TAGTATTGAT
CAATGTCCTA CTGTGAAAGA AACTTACAAT AAATTCCAAG ACCTTGATGG CTGTCCAGAC
TTTGTTGCAG ACAACAGAGG TGCAATTGAT TCAGATGGTG ATGGCATAAT TGACAACCGA
GACTTGTGCC CAAATCAACC AGAAATTCAT AATGGATTCC AAGACCGAGA TGGTTGTCCT
GACTCTTATG GTACTGGTGA CCGTGATAGA GATGGTATTA TTGATGGATT AGATCAGTGT
CCTACTGCAA AAGAAACTTA CAACAAGTTC CAAGATTCTG ATGGTTGTCC TGATTCTGTG
TCTGGACTTT CAGCATTGGA CTTTGATGGT GATGGAATTT CTGACTTGAA TGATCAATGT
CCACTAAGAC CTGAAACATT CAACGGATAT CAAGATAAAG ATGGATGTCC AGATGAATCT
GTTATGGATT CAGATGGTGA TGGAATTAGA GATAGTTTAG ATCAATGTCC ATCAGAACCT
GAAACATGGA ATAGATACAA CGACGATGAC GGATGTCCAG ATACTCCTGG TGAAGGAGAT
TCAGACTTTG ATGGTATTTT AGACTCTGTA GATGAATGTC CACTAGATAG AGAAAGATAC
AATGGATTCC AAGATGAAGA TGGCTGTCCT GACTATCCTG ACTTCCTTAC TAGCATGGAT
TCAGACTTTG ATGGAATAGT TGACTTGGAA GATCAATGTC CTCTTGTTCC AGAAACTTAC
AACAGATTCC AAGACTTGGA TGGTTGTCCT GATTCTGTTG CAGATAACAA AGGTTCTCCA
GATTCAGATA ATGATGGAAT AAATGATTAC AATGATCATT GCCCAAATCA ACCAGAAACC
TATAATGGAA TTCTTGATCT TGATGGCTGT CCTGATGATT ACATCCTAGG TTATGACAGA
GATCAAGATG GTATTCCTGA TGCAATTGAT GCATGTCCAA CTGCAAAAGA AACTTGGAAT
AAATTCCAAG ACGATGACGG ATGTCCAGAT ACAATATCTG AATCATTTGT AGTTGATTCA
GACAATGATG GAATTCTTGA TGTTGATGAT GATTGTCCAC TAGTTGCAGA AAATTACAAC
GGCTTCCAAG ATGAAGATGG TTGCCCTGAT GTTGATAATG TCCAACCAGA TACCGATGGT
GATGGTGTAC CTGATGTAAT GGATATGTGC CCAACTCAAA ATGAAACCTG GAATAATTAT
GTTGACTATG ACGGATGTCC AGATACTCTT CCAATTGGAA ATGTTGTAAT TGATAGTGAT
GGCGATGGAA TTAATGATAA TGTTGATTTG TGTCCAAATC TTAAAGAATC TTGGAACAAA
TACAACGATG CTGATGGTTG CCCTGATATT GCTCCAGAAC AATCAAGATA CAAACACGAT
GCTGATTTAG ATGATATCAT CAATGAACAT GATCTCTGCC CACTTGAACC AGAGGACTAT
GACGGTGACA ATGACACTGA CGGTTGTCCA GACTCTTAG
 
Protein sequence
MPMNPAFAAT DLDNDGVDDT VDACPNLRED NEGAIDGCPS NFVPWYDEDY DGIEDHIDQC 
PNLREKYNKF QDEDGCPDIS PQGGPGGLPD SDGDGFVDTV DKCPNQPETF NGILDRDGCP
DDFGSGDRDR DGVPDAIDAC PLSPETYNRF QDADGCPDAQ NDLAFIDTDN DGIKDSLDLC
PTEPEVYNDY LDTDGCPDVT LDSDFIDTDG DGIANQFDIC PTDPETFNRF ADYDGCPDSI
PSFIGSLNDA DGDRIMDTDD ACPLEPERYN GFQDDDGCPD IPPYTSDSDS DMDGIPNSID
QCPTVKETYN KFQDLDGCPD FVADNRGAID SDGDGIIDNR DLCPNQPEIH NGFQDRDGCP
DSYGTGDRDR DGIIDGLDQC PTAKETYNKF QDSDGCPDSV SGLSALDFDG DGISDLNDQC
PLRPETFNGY QDKDGCPDES VMDSDGDGIR DSLDQCPSEP ETWNRYNDDD GCPDTPGEGD
SDFDGILDSV DECPLDRERY NGFQDEDGCP DYPDFLTSMD SDFDGIVDLE DQCPLVPETY
NRFQDLDGCP DSVADNKGSP DSDNDGINDY NDHCPNQPET YNGILDLDGC PDDYILGYDR
DQDGIPDAID ACPTAKETWN KFQDDDGCPD TISESFVVDS DNDGILDVDD DCPLVAENYN
GFQDEDGCPD VDNVQPDTDG DGVPDVMDMC PTQNETWNNY VDYDGCPDTL PIGNVVIDSD
GDGINDNVDL CPNLKESWNK YNDADGCPDI APEQSRYKHD ADLDDIINEH DLCPLEPEDY
DGDNDTDGCP DS