Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1047 |
Symbol | |
ID | 5774406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 921429 |
End bp | 924140 |
Gene Length | 2712 bp |
Protein Length | 903 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641316689 |
Product | fibronectin type III domain-containing protein |
Protein accession | YP_001582381 |
Protein GI | 161528555 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGCAA TAATCCCGAC TGCATATGCT GATGTGGAAT TTTCATTCAA ATTTGGAACT CTTGGTTCTG ATGACGATGA ACTGGATAAT CCTACTGATG TTATAGTGAA AAGTAATGGG CGGGAAATTT ATGTTGTAGA TAACAACAAT AATCGAATAA ACGTATTTGA TGATGATGGT GATGCTGATT TTCTATATGG TACTTTCTGT AATGTAGCAC AAATTCAAGA TTGTAATGAT AATGCTGATG GCGCTGAAGA AGATGGAGAT GGACAGTTTA ACACTCCTCT TTACATTGCT ATGGATGCAT TAGGTAAATT TTTTGTAGTA GATTCTGAAA ACGAGCGAGT ACAAGTATTT GACGATGATG GAGAATTCCA ATTCAAACTT GGTTCATCTG ATAGTGGTGA CGATGAATAT CTTGGCGGTG CACAAGGTGT GACAATTCAA GATTCTTCAA GAAAAATATT TGTTTCAAAC ACTGAAAATG ACTCAATCTC TGTTTTTGGC TCTACTGGAA ATTTTCTATT TGATTTTGAC TCTTTTAACG GAAATGATGA TTTTACAAAT CCAAGTGAAA TGATCATTGA CAATTCTAAT GATTTGTTAT ATGTTGCAGA TTCTGGAAAT GATAGAATAG TCATTTTTGA GATTGTTGAT GGAACTACGT GTCCTGATGG TACAATAGAA TCAGTAGATG GAATATGTTA TGTCAAAGAA TTCGGCTCTT CGGGAGATGA TGAAGGTGAA TTTGATGATC CTTCTGGCTT GGCATTAAAT TCTGAAAATG ATTTGTTATA TGTTTCAGAT TCTGACAATG ACCGAATCCA AATTTTTGAG ATTGTTGATG GAACTACGTG TCCTGATGGT ACCGATGAAA TTATTGATGG CGTGTGTTTT GTAGATGAGT TTGGCTCTAC TGGAACAGCC GATGGACAGT TTGATTCTCC TCTTGGAATT GCTTTAGACA ATTCTAATGA TTTGTTATAT GTTGCAGATT CTAAAAATGA CCGAATTCAA GTGTTTGATC TAAACTCTGA ACCTGCCGTG CAAACCCCTG AAAAACCAGT AAATGTTGAT GCATCTCCTG TTTCCCCTAC ATCTATTATT CTTACTTGGG ATGCTCCTGA ACAACATGAA ACCATTCCAG AAATTACTGG ATACAAGATT GAGTATAGGA TAGGTTCTGA AAACTATATT GCGATAACTC CTGATGCATC TAGCAATGTA TTTTCATTTG TTCATGATGG ATTATCTGAA AGTGAAACCT ACAGTTACCG TGTATATTCT ATCAACTCTG TGGGAACTAG TAGTGCATCA TCAATTGCTA CAGTTAAACC AGAATCCACA ACCACTCCTG TAGCATTAAC TGCCTCTGCA ATTTCTCCTA GTCAGATAAA ACTTTCATGG ATGGCACCTT CTGAAACATT CCAACAATCA ATTAGCGGAT ACAACATAAA ACGCGTACTT ACTCCTGGCG TTTATGATGA TGTTGGAAGT ACTAATGGAC AGACTTTGAC ATATGTTGTT TCTAATTTGG CAACTGACAA AACTTACACG TATGCAGTTA CTGCAAATAT TGGATTTGGT CAAACAGGGG AATCAAACAC TGCCTCTGCA ACCCCTAGAT CCGATTCTAC TGATACTACT GAAGATCCAC TAGTTTCAAC ATCTGTAGAT ATGACTGTTC CTTCATCACC TATCAAATTA ACCGCATCTA CCAAAACTTC CACTTCTATA ACTCTCACAT GGGTCTCTCC TACTGATGAT GGCAATTCTG AAATTACCGG ATACAAGATT GAATCTAAAA AAGATAATGG TTCTTTTAGT ACTGTAGTTG AAGACACCCA AAACTCTTCT ACAACATATG TTCACTCTGA GCTTGTAGAG AATTCAAAAT ATGCGTATAG AGTTTCAGCA ATAAACTCTG TAGGTGTTAG TGAACCTTCA AATGAATCAT CTGCAACTGC CAAGATTACT GGTCTTGCAC TCAGTCCTAT GGGCAAATTG ACAGTAAATG AAGGCCAATT GCTGTCATTT GCAGTTAAAC TAACTGATAA TACAATCAAA GATCCTGTGT TTAGTCTAAA GAATGCTCCT TCTGGTGCAA AAATAATCTC AAACACTGGT GCATTTGCAT GGACACCTTC ATCTTCTGAT GGTGGTCAGA CATACAATAT TGTAGTTGAA GTTAGGAAAA ATGAATTATT TGATTCCCAA ACAATAGAGA TTAAAGTAAA TGACTCATCT GTTTCAGAAC CCATATCTGA ACCAACCTCC GAGCCAACAT CTGAACCTGT AAAGACAGAA CCTGGTGAAT TGGGACTGGC TTCATTTGTG GATGAATCTG TAGACCCTCA AAACTATGTT GACCGATACA ACAATGAACC AAATTACAAG AAGTGGTTTG ATGACAATTA TTCAGAATAC GATTCAATTT ATCAGGCAGT CGGATTAGAG AAACCTCCAC AAATTCCTGC TGATTTTGTA GATGAATCAA TGGATCCATA CTATTATGTT GCACGTTACA ACATTGATCA AAAATTCCAG AAGTGGTTTG ATGATAATTA TTCTCAATAC TCTTCAATAG GTCAAGCAGT TGATTTTCAT GATTCTGGAG AGCCTCAAAA GGTGTATGGT TTCTGTGGTA CTGGCACTAA ACTAATTGAT GGCGTGTGCA CTGTTATCAG AACTACTGAA TCTACTCCTT AA
|
Protein sequence | MIAIIPTAYA DVEFSFKFGT LGSDDDELDN PTDVIVKSNG REIYVVDNNN NRINVFDDDG DADFLYGTFC NVAQIQDCND NADGAEEDGD GQFNTPLYIA MDALGKFFVV DSENERVQVF DDDGEFQFKL GSSDSGDDEY LGGAQGVTIQ DSSRKIFVSN TENDSISVFG STGNFLFDFD SFNGNDDFTN PSEMIIDNSN DLLYVADSGN DRIVIFEIVD GTTCPDGTIE SVDGICYVKE FGSSGDDEGE FDDPSGLALN SENDLLYVSD SDNDRIQIFE IVDGTTCPDG TDEIIDGVCF VDEFGSTGTA DGQFDSPLGI ALDNSNDLLY VADSKNDRIQ VFDLNSEPAV QTPEKPVNVD ASPVSPTSII LTWDAPEQHE TIPEITGYKI EYRIGSENYI AITPDASSNV FSFVHDGLSE SETYSYRVYS INSVGTSSAS SIATVKPEST TTPVALTASA ISPSQIKLSW MAPSETFQQS ISGYNIKRVL TPGVYDDVGS TNGQTLTYVV SNLATDKTYT YAVTANIGFG QTGESNTASA TPRSDSTDTT EDPLVSTSVD MTVPSSPIKL TASTKTSTSI TLTWVSPTDD GNSEITGYKI ESKKDNGSFS TVVEDTQNSS TTYVHSELVE NSKYAYRVSA INSVGVSEPS NESSATAKIT GLALSPMGKL TVNEGQLLSF AVKLTDNTIK DPVFSLKNAP SGAKIISNTG AFAWTPSSSD GGQTYNIVVE VRKNELFDSQ TIEIKVNDSS VSEPISEPTS EPTSEPVKTE PGELGLASFV DESVDPQNYV DRYNNEPNYK KWFDDNYSEY DSIYQAVGLE KPPQIPADFV DESMDPYYYV ARYNIDQKFQ KWFDDNYSQY SSIGQAVDFH DSGEPQKVYG FCGTGTKLID GVCTVIRTTE STP
|
| |