Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1528 |
Symbol | |
ID | 5774413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1391235 |
End bp | 1393613 |
Gene Length | 2379 bp |
Protein Length | 792 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641317179 |
Product | ATPase |
Protein accession | YP_001582862 |
Protein GI | 161529036 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00000000453148 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCCTATGA ATCCTGCATT TGCAGCAACT GACTTAGATA ATGACGGTGT TGATGATACT GTTGATGCAT GTCCTAATTT ACGTGAAGAT AATGAAGGTG CCATAGATGG TTGTCCATCA AATTTTGTTC CATGGTATGA TGAAGATTAT GATGGAATAG AAGATCATAT TGATCAATGT CCAAATCTTA GAGAAAAATA CAATAAATTC CAAGATGAAG ATGGTTGTCC AGATATTTCT CCTCAAGGAG GACCTGGTGG ATTGCCTGAC TCTGACGGTG ATGGATTTGT AGATACTGTA GATAAATGTC CTAATCAACC AGAAACATTC AATGGCATTT TAGATAGAGA TGGTTGTCCA GATGACTTTG GTTCTGGTGA CCGTGATAGA GATGGAGTTC CTGATGCAAT TGATGCATGT CCACTAAGTC CAGAAACTTA CAATAGATTC CAAGATGCAG ATGGTTGCCC TGATGCACAA AATGATTTAG CTTTTATTGA TACCGATAAT GATGGCATAA AAGATTCACT AGATTTGTGT CCAACAGAAC CTGAAGTGTA TAATGATTAT TTAGACACTG ACGGTTGTCC AGATGTTACA TTAGATTCTG ATTTCATTGA TACTGATGGT GATGGAATTG CAAATCAATT TGATATTTGT CCTACTGATC CTGAAACTTT TAATCGCTTT GCTGATTATG ACGGTTGCCC AGATTCTATT CCATCCTTTA TTGGTTCTTT AAATGATGCA GATGGTGATA GAATAATGGA TACTGATGAT GCATGTCCAC TAGAACCTGA AAGATACAAT GGATTCCAAG ATGATGACGG TTGCCCAGAT ATTCCACCTT ATACAAGTGA TAGTGATTCA GACATGGATG GAATTCCAAA TAGTATTGAT CAATGTCCTA CTGTGAAAGA AACTTACAAT AAATTCCAAG ACCTTGATGG CTGTCCAGAC TTTGTTGCAG ACAACAGAGG TGCAATTGAT TCAGATGGTG ATGGCATAAT TGACAACCGA GACTTGTGCC CAAATCAACC AGAAATTCAT AATGGATTCC AAGACCGAGA TGGTTGTCCT GACTCTTATG GTACTGGTGA CCGTGATAGA GATGGTATTA TTGATGGATT AGATCAGTGT CCTACTGCAA AAGAAACTTA CAACAAGTTC CAAGATTCTG ATGGTTGTCC TGATTCTGTG TCTGGACTTT CAGCATTGGA CTTTGATGGT GATGGAATTT CTGACTTGAA TGATCAATGT CCACTAAGAC CTGAAACATT CAACGGATAT CAAGATAAAG ATGGATGTCC AGATGAATCT GTTATGGATT CAGATGGTGA TGGAATTAGA GATAGTTTAG ATCAATGTCC ATCAGAACCT GAAACATGGA ATAGATACAA CGACGATGAC GGATGTCCAG ATACTCCTGG TGAAGGAGAT TCAGACTTTG ATGGTATTTT AGACTCTGTA GATGAATGTC CACTAGATAG AGAAAGATAC AATGGATTCC AAGATGAAGA TGGCTGTCCT GACTATCCTG ACTTCCTTAC TAGCATGGAT TCAGACTTTG ATGGAATAGT TGACTTGGAA GATCAATGTC CTCTTGTTCC AGAAACTTAC AACAGATTCC AAGACTTGGA TGGTTGTCCT GATTCTGTTG CAGATAACAA AGGTTCTCCA GATTCAGATA ATGATGGAAT AAATGATTAC AATGATCATT GCCCAAATCA ACCAGAAACC TATAATGGAA TTCTTGATCT TGATGGCTGT CCTGATGATT ACATCCTAGG TTATGACAGA GATCAAGATG GTATTCCTGA TGCAATTGAT GCATGTCCAA CTGCAAAAGA AACTTGGAAT AAATTCCAAG ACGATGACGG ATGTCCAGAT ACAATATCTG AATCATTTGT AGTTGATTCA GACAATGATG GAATTCTTGA TGTTGATGAT GATTGTCCAC TAGTTGCAGA AAATTACAAC GGCTTCCAAG ATGAAGATGG TTGCCCTGAT GTTGATAATG TCCAACCAGA TACCGATGGT GATGGTGTAC CTGATGTAAT GGATATGTGC CCAACTCAAA ATGAAACCTG GAATAATTAT GTTGACTATG ACGGATGTCC AGATACTCTT CCAATTGGAA ATGTTGTAAT TGATAGTGAT GGCGATGGAA TTAATGATAA TGTTGATTTG TGTCCAAATC TTAAAGAATC TTGGAACAAA TACAACGATG CTGATGGTTG CCCTGATATT GCTCCAGAAC AATCAAGATA CAAACACGAT GCTGATTTAG ATGATATCAT CAATGAACAT GATCTCTGCC CACTTGAACC AGAGGACTAT GACGGTGACA ATGACACTGA CGGTTGTCCA GACTCTTAG
|
Protein sequence | MPMNPAFAAT DLDNDGVDDT VDACPNLRED NEGAIDGCPS NFVPWYDEDY DGIEDHIDQC PNLREKYNKF QDEDGCPDIS PQGGPGGLPD SDGDGFVDTV DKCPNQPETF NGILDRDGCP DDFGSGDRDR DGVPDAIDAC PLSPETYNRF QDADGCPDAQ NDLAFIDTDN DGIKDSLDLC PTEPEVYNDY LDTDGCPDVT LDSDFIDTDG DGIANQFDIC PTDPETFNRF ADYDGCPDSI PSFIGSLNDA DGDRIMDTDD ACPLEPERYN GFQDDDGCPD IPPYTSDSDS DMDGIPNSID QCPTVKETYN KFQDLDGCPD FVADNRGAID SDGDGIIDNR DLCPNQPEIH NGFQDRDGCP DSYGTGDRDR DGIIDGLDQC PTAKETYNKF QDSDGCPDSV SGLSALDFDG DGISDLNDQC PLRPETFNGY QDKDGCPDES VMDSDGDGIR DSLDQCPSEP ETWNRYNDDD GCPDTPGEGD SDFDGILDSV DECPLDRERY NGFQDEDGCP DYPDFLTSMD SDFDGIVDLE DQCPLVPETY NRFQDLDGCP DSVADNKGSP DSDNDGINDY NDHCPNQPET YNGILDLDGC PDDYILGYDR DQDGIPDAID ACPTAKETWN KFQDDDGCPD TISESFVVDS DNDGILDVDD DCPLVAENYN GFQDEDGCPD VDNVQPDTDG DGVPDVMDMC PTQNETWNNY VDYDGCPDTL PIGNVVIDSD GDGINDNVDL CPNLKESWNK YNDADGCPDI APEQSRYKHD ADLDDIINEH DLCPLEPEDY DGDNDTDGCP DS
|
| |