Gene Nmar_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1201 
Symbol 
ID5774584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1096377 
End bp1101503 
Gene Length5127 bp 
Protein Length1708 aa 
Translation table11 
GC content42% 
IMG OID641316845 
Producthypothetical protein 
Protein accessionYP_001582535 
Protein GI161528709 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAACG AAATAGGACG TAAAATAACT AGTCTTACAT TAATGACAAT TATGGTTGCC 
GGTGGACTGA CCTTTGCAAT TCCAGGTGTA ATGCCTGAAG CAATGGCCGC CAACGCCAAC
CTATTTGTTT CCGCAGAAAA CTCACAGTTT GATAACTACA TGTCAGGACC TCAAGTAATT
GAGGTCGTCG TAATTGATAG TGACATCAAT GATACAGATG AGGCAAAAGG TGAGCCAGAC
GTAACTGTCA ACGGCAAAAA TCTGAGAATG GTTCAAGCAG TTGATGGTAA CTGGTATGGT
TACTTTGCAG ACAGAGACCA AGCCCAAATT GCAGACTCTA CTGCTACAAC AATAGGTTCT
GGATTGGACT TTGGTGTATT CTGTGCAGCA TCCTCTGGTA CAGCATTCTT GGGATTCTCA
ACAACTGAAA CAGATGGTAT CGCAGTTCCA ATTACTGTTG CAAATGCAAC AGCAACTGGT
AACGGTACAC AAACCGGTAG CAACAGTGGT GGTGCAATCA CTACTACTTG TGCAGCAAAT
ACACTTGATG CATCTACTGC TAATGGTACC ATCAATGTTG TAAGAGAAGC CAAAGATCCT
GTAGCAGTAT CTGGTAGTGT CAGTGCAGGC CAAATCGGTC TAAAGAACGG TACCAACAAT
GGTGCTAACT GGCCTTTCAT TCAGCTCTAT GAATTAAACC CAACAGGTAA CGTTGTCGTA
CAGTACAACA AAGGTGGTGG TGTTCAATCA ACAACACTTA CTTTTGACAC TGTTGATCAA
TTTGCCGAAT TAAGTTTGGA TAGAGCTGTA TACCCAAGAG TATCACAAGT TCACGCAACA
ATTACTGACT TATGGTTAAA CATTGATCCA ACTGATGAAG ACTCTTGGAC ATTTGCTACC
AACACGAAGA ATACAACATC ATCCTTCAAT GTTGATACAT TTTACCAAGT GTTCGATGAA
AACGGTGCTT CAGGTGGAAG CGCATTAACC CTGAGAACAA CATTACCAAA CTTGATGTGT
GAGGACAATT GTGTATTGAA ATTAGATGTC GATGCACAAA GCTCTGGCAC ACCAGTTGTG
ACAATTCAGG ATAACGGCGA TTCAATCCTT ACCCAACTTA ACTCAACTGC CAACACCAAC
AATGCATCTG CATTTGGTAT TTCCGCAGAG ACAGGTGAGT TAGGTGTAGG TTCCATCCCA
GTAACCATCA CCGAACAAGG TCCAAACAGT GGTGTCTTTG GTACTTATGA TGAATCTGAT
ACATCAGTAC TAAAGATTAC AGATAATGCA AAGAGGGGAA CCTCTGCATC ACTCGATTAC
AATGAAACAC CTCAAACTAT CCTTGTAGGA TTCTCCTTTG CTAGTATTGA TATCCAGCCA
GTTGATGATG AATGGACCTC TGGTGAAGAG ATTCCAGTAG TCATTGTTGA TGCTGATCAA
AACAAAAACA GCAGAGTAGA TGAAGACTTA GACCTAAACA ACCCAGACGT AACCCTGATT
CCAGCACTCG CTACTGGTGA TCCATTCACT ATTGATGAGG GTGGAACTCC TAGCTTGATC
TTTACCAATG GTACAAACGG TGATGATAGT ATCTTTGATA CAGGTGCAAT AAACAACACT
GCAGCAGGTC AAGTAGGTAA CTTTACACTC AACATCAACG TAACTGCATT CACCGGTGCA
ACCAACATCA CTTCAACTGA ATCTGTTGAT ACATTCAGTA AGAGATTAAT CTCTGCACAG
ACTGCCAACA GTTCAGCAAA CTTTGATGTT GACTTTGCAA TCATTGATTT AGGAAGTGCA
ACTATGAAAA CCCTAAAAGA AACTCTAGTT GATGAAGATA ACACCACTAT TGGTTTTAAC
TTCTTTAACT ATGATGTGAA ATCATTAGGG GCAGATACAG TAAGTATCGC ATTGCTTAAC
ACCACAGGAA ATATTCTCCC ATGGGTTAAC AACGATACAA GAAATATTGA CAAAAACAAT
GCAATATTGT TAGTCAGCAA CTCAACTGAG TCACAGGATT ACATTGATTT GACCAATGCA
GTATCTGATG CTATTTACGG ATCAACTAAT GATGATGATA ACGTAAACGT CGGACTTGCA
ATGTACTTCA CAGGTGTCGA TGACATCGCC GCCAAAGAAG TAATCGTCAT GGACTTCTTC
TCATTTGGTT TCACTGATGA TGGTGCACAA TCTGGTGAAA GATTTGCAAA CCAAATAATC
AGAATTGAAG CTGAAGAGAC AGGTGACAAC ACAAGTATCT TTGAAGGTTC ACTTGAGTAT
GTCATGGTTA ACCAAATTAA CATACAGGAT GCTGGTACCT TCGCTGGTAT CACACCAATC
GCAGATGATC CAACATTCAT CGTAATTGAG GATCTTACTG ACGAGGATGC ACCAAGAGTC
AACTATAATG ACTTGGGTGC AGATGGTGTA ACAACTCCTG TATCTGACCA AGAAGATGCT
CCAAGCCACT CTGGTGTTGT ATCTCTAGAT GCTGATTCTT ACAAGATTGC TGACACTGTA
GTAATAACTG TAGAAGACTT GGATCTTAAC GTAGACTCTG ATCTTATTGA CATCTTTACT
GTTGTCTCTG ATACTGGAGC AGATACCTTT GATGTTGTTG GTTCAGCAAC TACACAGGAT
CTCAGCTTTG GTGAACTCGG TAGATTACTA GATGTTAGCT TTAATGATGT AGTATGGAAG
ACTGCCCAAG GTACCTGTGA TGACAACCAA GCAAGTAGTG ATACTGGTCT TGGTGCAACT
GGATTCACTC TAGTTGAAAC AGGTAAAGCA AGTGGTGTCT TTGTTGGTGA TTTCCAAATC
CCAGTTGATT GGTGTAGATC TGACAATGCT ACTCCTGAAA CCATAACCGG TCTTGATATC
GAAGTTAACT ATGTTGACTT CAGAGATGCA TCTGGTGAAA TCATTCAGGT TAGTGATTCC
GCAGGTATCA GAGCACACAC TGGTTCAGTT AGTCTTGACA GAACTGTCTA TCCAGTACCA
TTTGGTACGG TAGGAGAATC TAATGCAGCA GCAAACGCAT CTCCAAATGG AAGATCAGTA
TTCCCAATTC ACGCAACAGG AGTCACTAGT ACTATTGATT CCACTGAAGA ATTGGCAAAT
GGTGATCTAA CTATCCACGT CAGAATTAAC GATCCAGACT TTGACGAAAA CCCATCTGGT
GAAGATACCA TGGACCAAGA TGATGCACTC AAAATCTCTG TTATCAGAGG TTCTGATAGT
GTAGTTCTCG GCTATGCAGG CGCTTCTGAA AGAACCGGAA AGATTGATGT TGGTGGTAAC
AATGGAACCA TCTCAAACAT CAGAAGTTTC GGTGAAATGG TTGAAATCGC ACCAGATGCT
GGTGTTTTCG AGTTAGAGCT AGACATTAAA TTCACTGACG GTCCAGCATC AGCAAAATGT
AACAGTCATG ACACCATCTA TACTGCTACC GACGGTACTA CTGGTAAGGC TGATACCAAC
AGATTTGATG ACGGTGCACC ATCTGGCCAA GAATACTGTA TCTTACAAGG AGATATTCTC
CAAGTAGAAT ACACTGATCC AGCTGACGCA TCCGGCGATG CAAATACTGT TACTGATTCT
GCAACATTTG ACCTAAGAAA CGGTGTATTA CAATCTGACA AATCCGTATA CATTATCGGT
TCAGACATGA TCTTAACACT CATTGAGCCA GACTTTGATC TGGACAATGA CAGTGCTGAG
ACCTATGACT TGGACTTGAT CGAATGGGAC TCTGATGCCG CCACCACTAC CATGGGTAAC
AAAGGTGTAA CCGGCGCAGC AGCTGCATTT GACCCAGAAC CAACTGACTT TAGAGAAACA
GGTGACTCTA CTGGTATCTT CCAGATTGTC ATCGAAATTC CTGAAGAACT TGATGGTGAC
AGATTAGAAA GAGGTGAGGA AATCATCCTA GAGTACACTG ACTGGGGTCC ATCCGGATCT
GATTATGTAG GAGATGAAGA TGAAGATGTC AACTTGACAA TCTACACTTC AAACTTCGGA
GCAACTGTAG AACTTGACCA AAAAGTATAC ACTTGGACTG ACAAAGTATA CATTACTATC
GTCGCACCAG ATCACAACTT TGACAGTTTC CTAGTTGATG AAATCGGCGA ATCTGACAGA
GATCCAATTA AGGTCTCTAC CAGAGGATTT GATCTTGACA ACTATAAACT CGTCGAGACT
GGTACTGACA CCGGCATCTT TACTGGTGAA GTAATCCTCA CAGGATTTAC TTCCCATGAT
GCTGATGGTG ATGGAACTAC TGGCGATGCA AAAGGTACCA CTTCTGGTAC TGGTCCAACA
GATGGTCTCT TGGCCACTGA CGATGATGAC GGACTTACTA TCTCCTTCGA ATTCTCTGAA
GATGAGACAA TTGTAGGCTC TGCCCTTATC AGATGGAACA TCGGTGAAGT CCAATGGCTT
GAGGCAAGCT ACCCAGCTAG CGGAACAGGT GTTGTAAGAG TAATTGATCC AGACATGAAC
TTAGATCCAG AAGCAGTCGA CAACTTTAAT GTTGACGTGT GGTCTGACTC CGATGCCGGA
GGTATTGACC TAACTTTAAC TGAGACTAAT GAGGCAACCG GAATCTTTGA GGGAACTGTG
TTCTTCACAG TTCATAATGA GTCATCTGGT CACAGACTCA GAGTTTCAGA AGGTGACACA
GTCACTGCAG AATATGAGGA CAATACACTA CCTGATCCAT ACACAACTGC AGATGAACTT
GATATTACTG CCACTTCACT AATTGGCACT GTAGTACCAC CTCTCGAGAG AGCACCAGCT
GCTAACTTGA GAGCCGTTGA CGCATTCGGT AACAGCTTAG ATTCTGTTTC CGTTGACCAA
CAGGTACAAA TCAGCGCTGA CTTAGCAAAT GGTCAGGATA GAGAGCAATC ATTTGCATAC
TTGGTACAGA TTCAGGATGC AAACGGTGTT ACCGTCTCAC TAGCATGGAT TACAGGTTCA
CTATCTAGCG GTCAATCATT CAGCCCAGCT TTATCATGGA TTCCAACTGA AGCAGGAACA
TACACTGCTA CTGCATTCGT CTGGGAGTCT GTTGATAATC CTACGGCATT ATCACCACCA
GTTAGTACAA CTGTCAACGT AAGTTAG
 
Protein sequence
MYNEIGRKIT SLTLMTIMVA GGLTFAIPGV MPEAMAANAN LFVSAENSQF DNYMSGPQVI 
EVVVIDSDIN DTDEAKGEPD VTVNGKNLRM VQAVDGNWYG YFADRDQAQI ADSTATTIGS
GLDFGVFCAA SSGTAFLGFS TTETDGIAVP ITVANATATG NGTQTGSNSG GAITTTCAAN
TLDASTANGT INVVREAKDP VAVSGSVSAG QIGLKNGTNN GANWPFIQLY ELNPTGNVVV
QYNKGGGVQS TTLTFDTVDQ FAELSLDRAV YPRVSQVHAT ITDLWLNIDP TDEDSWTFAT
NTKNTTSSFN VDTFYQVFDE NGASGGSALT LRTTLPNLMC EDNCVLKLDV DAQSSGTPVV
TIQDNGDSIL TQLNSTANTN NASAFGISAE TGELGVGSIP VTITEQGPNS GVFGTYDESD
TSVLKITDNA KRGTSASLDY NETPQTILVG FSFASIDIQP VDDEWTSGEE IPVVIVDADQ
NKNSRVDEDL DLNNPDVTLI PALATGDPFT IDEGGTPSLI FTNGTNGDDS IFDTGAINNT
AAGQVGNFTL NINVTAFTGA TNITSTESVD TFSKRLISAQ TANSSANFDV DFAIIDLGSA
TMKTLKETLV DEDNTTIGFN FFNYDVKSLG ADTVSIALLN TTGNILPWVN NDTRNIDKNN
AILLVSNSTE SQDYIDLTNA VSDAIYGSTN DDDNVNVGLA MYFTGVDDIA AKEVIVMDFF
SFGFTDDGAQ SGERFANQII RIEAEETGDN TSIFEGSLEY VMVNQINIQD AGTFAGITPI
ADDPTFIVIE DLTDEDAPRV NYNDLGADGV TTPVSDQEDA PSHSGVVSLD ADSYKIADTV
VITVEDLDLN VDSDLIDIFT VVSDTGADTF DVVGSATTQD LSFGELGRLL DVSFNDVVWK
TAQGTCDDNQ ASSDTGLGAT GFTLVETGKA SGVFVGDFQI PVDWCRSDNA TPETITGLDI
EVNYVDFRDA SGEIIQVSDS AGIRAHTGSV SLDRTVYPVP FGTVGESNAA ANASPNGRSV
FPIHATGVTS TIDSTEELAN GDLTIHVRIN DPDFDENPSG EDTMDQDDAL KISVIRGSDS
VVLGYAGASE RTGKIDVGGN NGTISNIRSF GEMVEIAPDA GVFELELDIK FTDGPASAKC
NSHDTIYTAT DGTTGKADTN RFDDGAPSGQ EYCILQGDIL QVEYTDPADA SGDANTVTDS
ATFDLRNGVL QSDKSVYIIG SDMILTLIEP DFDLDNDSAE TYDLDLIEWD SDAATTTMGN
KGVTGAAAAF DPEPTDFRET GDSTGIFQIV IEIPEELDGD RLERGEEIIL EYTDWGPSGS
DYVGDEDEDV NLTIYTSNFG ATVELDQKVY TWTDKVYITI VAPDHNFDSF LVDEIGESDR
DPIKVSTRGF DLDNYKLVET GTDTGIFTGE VILTGFTSHD ADGDGTTGDA KGTTSGTGPT
DGLLATDDDD GLTISFEFSE DETIVGSALI RWNIGEVQWL EASYPASGTG VVRVIDPDMN
LDPEAVDNFN VDVWSDSDAG GIDLTLTETN EATGIFEGTV FFTVHNESSG HRLRVSEGDT
VTAEYEDNTL PDPYTTADEL DITATSLIGT VVPPLERAPA ANLRAVDAFG NSLDSVSVDQ
QVQISADLAN GQDREQSFAY LVQIQDANGV TVSLAWITGS LSSGQSFSPA LSWIPTEAGT
YTATAFVWES VDNPTALSPP VSTTVNVS