Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1201 |
Symbol | |
ID | 5774584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1096377 |
End bp | 1101503 |
Gene Length | 5127 bp |
Protein Length | 1708 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641316845 |
Product | hypothetical protein |
Protein accession | YP_001582535 |
Protein GI | 161528709 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACAACG AAATAGGACG TAAAATAACT AGTCTTACAT TAATGACAAT TATGGTTGCC GGTGGACTGA CCTTTGCAAT TCCAGGTGTA ATGCCTGAAG CAATGGCCGC CAACGCCAAC CTATTTGTTT CCGCAGAAAA CTCACAGTTT GATAACTACA TGTCAGGACC TCAAGTAATT GAGGTCGTCG TAATTGATAG TGACATCAAT GATACAGATG AGGCAAAAGG TGAGCCAGAC GTAACTGTCA ACGGCAAAAA TCTGAGAATG GTTCAAGCAG TTGATGGTAA CTGGTATGGT TACTTTGCAG ACAGAGACCA AGCCCAAATT GCAGACTCTA CTGCTACAAC AATAGGTTCT GGATTGGACT TTGGTGTATT CTGTGCAGCA TCCTCTGGTA CAGCATTCTT GGGATTCTCA ACAACTGAAA CAGATGGTAT CGCAGTTCCA ATTACTGTTG CAAATGCAAC AGCAACTGGT AACGGTACAC AAACCGGTAG CAACAGTGGT GGTGCAATCA CTACTACTTG TGCAGCAAAT ACACTTGATG CATCTACTGC TAATGGTACC ATCAATGTTG TAAGAGAAGC CAAAGATCCT GTAGCAGTAT CTGGTAGTGT CAGTGCAGGC CAAATCGGTC TAAAGAACGG TACCAACAAT GGTGCTAACT GGCCTTTCAT TCAGCTCTAT GAATTAAACC CAACAGGTAA CGTTGTCGTA CAGTACAACA AAGGTGGTGG TGTTCAATCA ACAACACTTA CTTTTGACAC TGTTGATCAA TTTGCCGAAT TAAGTTTGGA TAGAGCTGTA TACCCAAGAG TATCACAAGT TCACGCAACA ATTACTGACT TATGGTTAAA CATTGATCCA ACTGATGAAG ACTCTTGGAC ATTTGCTACC AACACGAAGA ATACAACATC ATCCTTCAAT GTTGATACAT TTTACCAAGT GTTCGATGAA AACGGTGCTT CAGGTGGAAG CGCATTAACC CTGAGAACAA CATTACCAAA CTTGATGTGT GAGGACAATT GTGTATTGAA ATTAGATGTC GATGCACAAA GCTCTGGCAC ACCAGTTGTG ACAATTCAGG ATAACGGCGA TTCAATCCTT ACCCAACTTA ACTCAACTGC CAACACCAAC AATGCATCTG CATTTGGTAT TTCCGCAGAG ACAGGTGAGT TAGGTGTAGG TTCCATCCCA GTAACCATCA CCGAACAAGG TCCAAACAGT GGTGTCTTTG GTACTTATGA TGAATCTGAT ACATCAGTAC TAAAGATTAC AGATAATGCA AAGAGGGGAA CCTCTGCATC ACTCGATTAC AATGAAACAC CTCAAACTAT CCTTGTAGGA TTCTCCTTTG CTAGTATTGA TATCCAGCCA GTTGATGATG AATGGACCTC TGGTGAAGAG ATTCCAGTAG TCATTGTTGA TGCTGATCAA AACAAAAACA GCAGAGTAGA TGAAGACTTA GACCTAAACA ACCCAGACGT AACCCTGATT CCAGCACTCG CTACTGGTGA TCCATTCACT ATTGATGAGG GTGGAACTCC TAGCTTGATC TTTACCAATG GTACAAACGG TGATGATAGT ATCTTTGATA CAGGTGCAAT AAACAACACT GCAGCAGGTC AAGTAGGTAA CTTTACACTC AACATCAACG TAACTGCATT CACCGGTGCA ACCAACATCA CTTCAACTGA ATCTGTTGAT ACATTCAGTA AGAGATTAAT CTCTGCACAG ACTGCCAACA GTTCAGCAAA CTTTGATGTT GACTTTGCAA TCATTGATTT AGGAAGTGCA ACTATGAAAA CCCTAAAAGA AACTCTAGTT GATGAAGATA ACACCACTAT TGGTTTTAAC TTCTTTAACT ATGATGTGAA ATCATTAGGG GCAGATACAG TAAGTATCGC ATTGCTTAAC ACCACAGGAA ATATTCTCCC ATGGGTTAAC AACGATACAA GAAATATTGA CAAAAACAAT GCAATATTGT TAGTCAGCAA CTCAACTGAG TCACAGGATT ACATTGATTT GACCAATGCA GTATCTGATG CTATTTACGG ATCAACTAAT GATGATGATA ACGTAAACGT CGGACTTGCA ATGTACTTCA CAGGTGTCGA TGACATCGCC GCCAAAGAAG TAATCGTCAT GGACTTCTTC TCATTTGGTT TCACTGATGA TGGTGCACAA TCTGGTGAAA GATTTGCAAA CCAAATAATC AGAATTGAAG CTGAAGAGAC AGGTGACAAC ACAAGTATCT TTGAAGGTTC ACTTGAGTAT GTCATGGTTA ACCAAATTAA CATACAGGAT GCTGGTACCT TCGCTGGTAT CACACCAATC GCAGATGATC CAACATTCAT CGTAATTGAG GATCTTACTG ACGAGGATGC ACCAAGAGTC AACTATAATG ACTTGGGTGC AGATGGTGTA ACAACTCCTG TATCTGACCA AGAAGATGCT CCAAGCCACT CTGGTGTTGT ATCTCTAGAT GCTGATTCTT ACAAGATTGC TGACACTGTA GTAATAACTG TAGAAGACTT GGATCTTAAC GTAGACTCTG ATCTTATTGA CATCTTTACT GTTGTCTCTG ATACTGGAGC AGATACCTTT GATGTTGTTG GTTCAGCAAC TACACAGGAT CTCAGCTTTG GTGAACTCGG TAGATTACTA GATGTTAGCT TTAATGATGT AGTATGGAAG ACTGCCCAAG GTACCTGTGA TGACAACCAA GCAAGTAGTG ATACTGGTCT TGGTGCAACT GGATTCACTC TAGTTGAAAC AGGTAAAGCA AGTGGTGTCT TTGTTGGTGA TTTCCAAATC CCAGTTGATT GGTGTAGATC TGACAATGCT ACTCCTGAAA CCATAACCGG TCTTGATATC GAAGTTAACT ATGTTGACTT CAGAGATGCA TCTGGTGAAA TCATTCAGGT TAGTGATTCC GCAGGTATCA GAGCACACAC TGGTTCAGTT AGTCTTGACA GAACTGTCTA TCCAGTACCA TTTGGTACGG TAGGAGAATC TAATGCAGCA GCAAACGCAT CTCCAAATGG AAGATCAGTA TTCCCAATTC ACGCAACAGG AGTCACTAGT ACTATTGATT CCACTGAAGA ATTGGCAAAT GGTGATCTAA CTATCCACGT CAGAATTAAC GATCCAGACT TTGACGAAAA CCCATCTGGT GAAGATACCA TGGACCAAGA TGATGCACTC AAAATCTCTG TTATCAGAGG TTCTGATAGT GTAGTTCTCG GCTATGCAGG CGCTTCTGAA AGAACCGGAA AGATTGATGT TGGTGGTAAC AATGGAACCA TCTCAAACAT CAGAAGTTTC GGTGAAATGG TTGAAATCGC ACCAGATGCT GGTGTTTTCG AGTTAGAGCT AGACATTAAA TTCACTGACG GTCCAGCATC AGCAAAATGT AACAGTCATG ACACCATCTA TACTGCTACC GACGGTACTA CTGGTAAGGC TGATACCAAC AGATTTGATG ACGGTGCACC ATCTGGCCAA GAATACTGTA TCTTACAAGG AGATATTCTC CAAGTAGAAT ACACTGATCC AGCTGACGCA TCCGGCGATG CAAATACTGT TACTGATTCT GCAACATTTG ACCTAAGAAA CGGTGTATTA CAATCTGACA AATCCGTATA CATTATCGGT TCAGACATGA TCTTAACACT CATTGAGCCA GACTTTGATC TGGACAATGA CAGTGCTGAG ACCTATGACT TGGACTTGAT CGAATGGGAC TCTGATGCCG CCACCACTAC CATGGGTAAC AAAGGTGTAA CCGGCGCAGC AGCTGCATTT GACCCAGAAC CAACTGACTT TAGAGAAACA GGTGACTCTA CTGGTATCTT CCAGATTGTC ATCGAAATTC CTGAAGAACT TGATGGTGAC AGATTAGAAA GAGGTGAGGA AATCATCCTA GAGTACACTG ACTGGGGTCC ATCCGGATCT GATTATGTAG GAGATGAAGA TGAAGATGTC AACTTGACAA TCTACACTTC AAACTTCGGA GCAACTGTAG AACTTGACCA AAAAGTATAC ACTTGGACTG ACAAAGTATA CATTACTATC GTCGCACCAG ATCACAACTT TGACAGTTTC CTAGTTGATG AAATCGGCGA ATCTGACAGA GATCCAATTA AGGTCTCTAC CAGAGGATTT GATCTTGACA ACTATAAACT CGTCGAGACT GGTACTGACA CCGGCATCTT TACTGGTGAA GTAATCCTCA CAGGATTTAC TTCCCATGAT GCTGATGGTG ATGGAACTAC TGGCGATGCA AAAGGTACCA CTTCTGGTAC TGGTCCAACA GATGGTCTCT TGGCCACTGA CGATGATGAC GGACTTACTA TCTCCTTCGA ATTCTCTGAA GATGAGACAA TTGTAGGCTC TGCCCTTATC AGATGGAACA TCGGTGAAGT CCAATGGCTT GAGGCAAGCT ACCCAGCTAG CGGAACAGGT GTTGTAAGAG TAATTGATCC AGACATGAAC TTAGATCCAG AAGCAGTCGA CAACTTTAAT GTTGACGTGT GGTCTGACTC CGATGCCGGA GGTATTGACC TAACTTTAAC TGAGACTAAT GAGGCAACCG GAATCTTTGA GGGAACTGTG TTCTTCACAG TTCATAATGA GTCATCTGGT CACAGACTCA GAGTTTCAGA AGGTGACACA GTCACTGCAG AATATGAGGA CAATACACTA CCTGATCCAT ACACAACTGC AGATGAACTT GATATTACTG CCACTTCACT AATTGGCACT GTAGTACCAC CTCTCGAGAG AGCACCAGCT GCTAACTTGA GAGCCGTTGA CGCATTCGGT AACAGCTTAG ATTCTGTTTC CGTTGACCAA CAGGTACAAA TCAGCGCTGA CTTAGCAAAT GGTCAGGATA GAGAGCAATC ATTTGCATAC TTGGTACAGA TTCAGGATGC AAACGGTGTT ACCGTCTCAC TAGCATGGAT TACAGGTTCA CTATCTAGCG GTCAATCATT CAGCCCAGCT TTATCATGGA TTCCAACTGA AGCAGGAACA TACACTGCTA CTGCATTCGT CTGGGAGTCT GTTGATAATC CTACGGCATT ATCACCACCA GTTAGTACAA CTGTCAACGT AAGTTAG
|
Protein sequence | MYNEIGRKIT SLTLMTIMVA GGLTFAIPGV MPEAMAANAN LFVSAENSQF DNYMSGPQVI EVVVIDSDIN DTDEAKGEPD VTVNGKNLRM VQAVDGNWYG YFADRDQAQI ADSTATTIGS GLDFGVFCAA SSGTAFLGFS TTETDGIAVP ITVANATATG NGTQTGSNSG GAITTTCAAN TLDASTANGT INVVREAKDP VAVSGSVSAG QIGLKNGTNN GANWPFIQLY ELNPTGNVVV QYNKGGGVQS TTLTFDTVDQ FAELSLDRAV YPRVSQVHAT ITDLWLNIDP TDEDSWTFAT NTKNTTSSFN VDTFYQVFDE NGASGGSALT LRTTLPNLMC EDNCVLKLDV DAQSSGTPVV TIQDNGDSIL TQLNSTANTN NASAFGISAE TGELGVGSIP VTITEQGPNS GVFGTYDESD TSVLKITDNA KRGTSASLDY NETPQTILVG FSFASIDIQP VDDEWTSGEE IPVVIVDADQ NKNSRVDEDL DLNNPDVTLI PALATGDPFT IDEGGTPSLI FTNGTNGDDS IFDTGAINNT AAGQVGNFTL NINVTAFTGA TNITSTESVD TFSKRLISAQ TANSSANFDV DFAIIDLGSA TMKTLKETLV DEDNTTIGFN FFNYDVKSLG ADTVSIALLN TTGNILPWVN NDTRNIDKNN AILLVSNSTE SQDYIDLTNA VSDAIYGSTN DDDNVNVGLA MYFTGVDDIA AKEVIVMDFF SFGFTDDGAQ SGERFANQII RIEAEETGDN TSIFEGSLEY VMVNQINIQD AGTFAGITPI ADDPTFIVIE DLTDEDAPRV NYNDLGADGV TTPVSDQEDA PSHSGVVSLD ADSYKIADTV VITVEDLDLN VDSDLIDIFT VVSDTGADTF DVVGSATTQD LSFGELGRLL DVSFNDVVWK TAQGTCDDNQ ASSDTGLGAT GFTLVETGKA SGVFVGDFQI PVDWCRSDNA TPETITGLDI EVNYVDFRDA SGEIIQVSDS AGIRAHTGSV SLDRTVYPVP FGTVGESNAA ANASPNGRSV FPIHATGVTS TIDSTEELAN GDLTIHVRIN DPDFDENPSG EDTMDQDDAL KISVIRGSDS VVLGYAGASE RTGKIDVGGN NGTISNIRSF GEMVEIAPDA GVFELELDIK FTDGPASAKC NSHDTIYTAT DGTTGKADTN RFDDGAPSGQ EYCILQGDIL QVEYTDPADA SGDANTVTDS ATFDLRNGVL QSDKSVYIIG SDMILTLIEP DFDLDNDSAE TYDLDLIEWD SDAATTTMGN KGVTGAAAAF DPEPTDFRET GDSTGIFQIV IEIPEELDGD RLERGEEIIL EYTDWGPSGS DYVGDEDEDV NLTIYTSNFG ATVELDQKVY TWTDKVYITI VAPDHNFDSF LVDEIGESDR DPIKVSTRGF DLDNYKLVET GTDTGIFTGE VILTGFTSHD ADGDGTTGDA KGTTSGTGPT DGLLATDDDD GLTISFEFSE DETIVGSALI RWNIGEVQWL EASYPASGTG VVRVIDPDMN LDPEAVDNFN VDVWSDSDAG GIDLTLTETN EATGIFEGTV FFTVHNESSG HRLRVSEGDT VTAEYEDNTL PDPYTTADEL DITATSLIGT VVPPLERAPA ANLRAVDAFG NSLDSVSVDQ QVQISADLAN GQDREQSFAY LVQIQDANGV TVSLAWITGS LSSGQSFSPA LSWIPTEAGT YTATAFVWES VDNPTALSPP VSTTVNVS
|
| |