Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0067 |
Symbol | |
ID | 5773199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 55201 |
End bp | 58062 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641315684 |
Product | hypothetical protein |
Protein accession | YP_001581405 |
Protein GI | 161527579 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATTTTGT ATAGCGCCTC TACTGATAAG CAGACTCCTC CTCCTGATGC TGGAAAATAC ATCAGATTAG GTATTATTGC AATCATTGGT ATTGTTATTT TTGCAATTGT GGGAAATCAG GCAGTTATTT TATCGATGAA TTTTACAGAA TTTGGCGATC AATTTACCAA GCCTCTCTAT TACACACTTG TATCTACTAT TATTTTAGCT GCCATTGCAT TAGTTCGTGT TAATATCGCT GGACGATCTT CTATTTTCTG GTATGCAATT AATACTGCAA TTGGATTTTT AGGTCGAAGT GGACAACAAC CAATAACTAA TGCTGTACCT AGTTTTAGAG ACTACAAATT AAGTTCAGGC CAATTTATTC TCTGGCAAAT TACCAAAATT TTCCTCTTTG GTGCATTCTT TGCAAATATC ATGTTTGGAT TTGCTGCAAT GTCGTTTATT GATGGCAATT ATCTTGGAAT AGAAAATCTA CCAAAATTAT TTTCATTACC ATTTGTAACT CCCGAAACTG ATCCAAATTA TGCTGCAGAA AACGTAGTTC CAATGATACC TGCATTAGTA ATTTTGATAC CTCCAATACT TGCTGCAATT GGATTACGTT TAGTTTTGTA TGTTGGCTTG CACAGAATAA TTAATGTAAT CACTTCATTT CTTCAAGACT CTAATGAAGG CAAACCCAGA TACCTAAATT ATGTTTCTAC CATTGAAGGA ATAATTGGTA TTGGAATACT CTGGGCAGGA TTCAATCTCT TTTTCACTGA TCAAATCGAT TATAACACAA GATATGTCAT CGGTGGTACA CTTGTTATTG GTTCTGCATT AATTGCATTT TCAGTAGTTG ATAGGATTAG AGCACGTGTT CTAACTCATA TGTTTAAGAG AGATGTATAC ATTAGAATTC TTACAATAAT TGCAATTGCA ATAATTATTG CGGGTGTTGT TTCTGTAAAT AATAGCATTG CCGATGCAAA AAAGATTGAA TTTTTGGGAC CATATACTGC CCAACAAATT GGTGTGAACA GATATCTTGG TGAACTAAAT AATATACAAG AAAACACACA TGATGTCCAA CTCACATCTG TTTCTCCAAA CAATATCAAG AATTATGTCT CTCAAAACAG CGATGTTTTA GATGTAATTC GAGTTTGGGA CTGGGAAGCA GCTTTTGCCA AATTAAAACC TGAAATTGGT TTGATCCCTT ATGTTGACTT TGAAGATAAT GATATTTTGA GATTTAACAA TACTTTGTAT TGGACAGCCT CCATGAAACC AATTCTTCCA ACCTCTGTTA GCCTTGAAAA TAGATGGTAC AATGAGCATT TTGTATATAC ACATGTTCCA GAAGGATTCC TTACTTTAGA AGCAACAGAC GGACAAATTG TTGATAGTGA GCAGTTTTTC AAACAAAGAG AAATTTACTA TGGTGAAGGC GGTTTATTTG AACAAACCTG GTCAGGTTAT CCTAATTCTA GGGGATCTGA AAGCGCAGAA CTTGGTGGCG TTTCATATTC TGGATTAGGA GGATTAGATG TTTCACCTCC ATTGAGTTGG ATATTTGAGC CAAACTTTTT GCTTTCATTT CCAGGAGAAT CTGTTCATAT CATGAGATAC AAAGATGTTC ATGATAGAAT GCAAACACTG TATCCTTATT TCCTCTATGA CTTGTTTGGT AAAGAACTAG ATTCACTTCC TGTTACTGAT GGTGAAAACT CCTATTGGTT AATTCCATTA ATCATTGGAT TTGATACACG AGATGTTCCA TGGTCTGTTG GAAATCCATA TTTGCGTTTA GTTGGATATG CTCTTGTTGA TTCATACGAT GGTGACATTC AATTACTAAA GACAGGAGAT GATTTCTTTA CAGAAATGTT TGCAAGTCAA TATGAAGAAC AATTCCAACC AATGCCTGCT TGGTTAGAGG AACAGATCAG ATATCCTGTT GAGTTATTCA ACTGGAAGAC TGAGATGTAC AATGTTTACC ATGTCACAAA TGTTGAAACA TTCATCCAAG CAAATGAATT TTATGAGATT CCTCGTGGTC TTGATACCTA TTATGTTGAG GCAAAACCAC CAGGATTTGA TCAAACTGAA TTCTTAGGCT TATTATCATT AGAATTACGT GGATCTGAAG GAAGAAACCT TGCTGGATAT ATGGTAGTTG AAAACGATCT TACTAATCTA GGAGACTTGC AGTTCTATGA AGTTCCATTA GATTCTGAAA CAAAACTAAT TGGTCCAACC GCAGTTAGAG AAGCACTCGA TAGAGATCCT GAATTTGCTC AATTAAAGAC ATTATTGAGA AATCCAAGAA TTGGTGATAA TATTCTCTAC AGGGTAGGTG CTCATGATGT TTACTTTATT CCTGTTTATA CTGCTGGTGC TGGTGGTGTA GTTGCACAAT TAGGAACAAT TGCAGCTGTA GGTGCTGCAT TTAATGGAGA ATACTTTGTT GGATTGGGAG ATACTCAAGA ACAAGCATTT GAAGCTTATC TGAAGAAAGT TTCAGGCGTA GCAGGTTCTA TTACAGTAGC TGATGAGAAT TATGTTGAGC TAGACAGAAA TGATAGAATC GAAATCATCA AGAAAGTATT CGAAGAAAAT GAAATTACTA TATCTGAACC TACATCAATA CAAATTCCAT TATCATTCAA TGAAGGAGAA TTGTTCTTCT TTACTGAAAA TGAACGTGAA GAAACTGTAG AATTCTTGTC CCAGTTTATT GATGACTTTG TAAAACCACG AAGTGATAGA GTATTCATGT GGCAAGAAGA AAATAATCTC AATATCGGAA CAATATATGT CAAGGATGGT ATATCTGAGA TTCATTATGT TTCAATTGAG GTAGGCAGCT AA
|
Protein sequence | MILYSASTDK QTPPPDAGKY IRLGIIAIIG IVIFAIVGNQ AVILSMNFTE FGDQFTKPLY YTLVSTIILA AIALVRVNIA GRSSIFWYAI NTAIGFLGRS GQQPITNAVP SFRDYKLSSG QFILWQITKI FLFGAFFANI MFGFAAMSFI DGNYLGIENL PKLFSLPFVT PETDPNYAAE NVVPMIPALV ILIPPILAAI GLRLVLYVGL HRIINVITSF LQDSNEGKPR YLNYVSTIEG IIGIGILWAG FNLFFTDQID YNTRYVIGGT LVIGSALIAF SVVDRIRARV LTHMFKRDVY IRILTIIAIA IIIAGVVSVN NSIADAKKIE FLGPYTAQQI GVNRYLGELN NIQENTHDVQ LTSVSPNNIK NYVSQNSDVL DVIRVWDWEA AFAKLKPEIG LIPYVDFEDN DILRFNNTLY WTASMKPILP TSVSLENRWY NEHFVYTHVP EGFLTLEATD GQIVDSEQFF KQREIYYGEG GLFEQTWSGY PNSRGSESAE LGGVSYSGLG GLDVSPPLSW IFEPNFLLSF PGESVHIMRY KDVHDRMQTL YPYFLYDLFG KELDSLPVTD GENSYWLIPL IIGFDTRDVP WSVGNPYLRL VGYALVDSYD GDIQLLKTGD DFFTEMFASQ YEEQFQPMPA WLEEQIRYPV ELFNWKTEMY NVYHVTNVET FIQANEFYEI PRGLDTYYVE AKPPGFDQTE FLGLLSLELR GSEGRNLAGY MVVENDLTNL GDLQFYEVPL DSETKLIGPT AVREALDRDP EFAQLKTLLR NPRIGDNILY RVGAHDVYFI PVYTAGAGGV VAQLGTIAAV GAAFNGEYFV GLGDTQEQAF EAYLKKVSGV AGSITVADEN YVELDRNDRI EIIKKVFEEN EITISEPTSI QIPLSFNEGE LFFFTENERE ETVEFLSQFI DDFVKPRSDR VFMWQEENNL NIGTIYVKDG ISEIHYVSIE VGS
|
| |