Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0062 |
Symbol | |
ID | 5774179 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 49278 |
End bp | 50438 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 641315679 |
Product | glycosyl transferase family protein |
Protein accession | YP_001581400 |
Protein GI | 161527574 |
COG category | [M] Cell wall/membrane/envelope biogenesis [S] Function unknown |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis [COG2246] Predicted membrane protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.963311 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGGTAG AAAAGCCAAA TAATCAGATC TCAATCATAA TTCCTACATA TAATGAATCT CAAAATATTC TCAACATTTT AAAATCAATT AAAGAAAATT TACCCAAAAA TATTTCGGCT CAAGCAATTG TTGTTGATGA TAATTCTCCT GATGGAACAG GGAAAATTGT TGATGATTAT CTAAAAAATT TGAAGAAAAT TACAAATTAT ACAATTGAAG TCATTCATAG AAAAACAAAA GATGGTTTAG GTTCTGCAAT TCTTAAAGGA ATTCAGCAAG CAACAGGCGA TACAATTGTT GTCATGGATT CTGATTTTTC TCATCCACCA CAAATTATTC CAAAATTAGT TGAATCAATA AAAAAATACC AATACGACAT TGCAGTTGCA TCACGTTACA TTAAAGGTGG TAAAATTGAA AATTGGTCTG CAAAAAGAAA ACTAATTAGT AAATTTGCAA CACTTATTGC AAAAAAAGGA TTGGGAATTA ATACAAAAGA TCCAATGTCT GGGTTTTTTG CATTCAAAAA AAATATTCTT AATGGACTAA ATATTGACGC AATTGGTTAC AAAATCCTTT TGGAAATTCT TGTTAAAACA AAAAATGTTT CAATTACAGA AATTCCATAC ACATTTCAAG ATAGAGAATT AGGTTCTAGT AAACTAAGTA TGAAAACAGT CTTTGACTAT TACAAATCGG TTTGGAAGCT TTACAGATAT GGAAAGCCAG AAGAAGAGAA AGAGAAGAGA AAGTCTGTGA AATTTCTTTA CAAAGCAGCA AGATTCTATA CAGTTGGAGC TTCTGGATTT GTAGTAAACT ATTTGATTTC ATTATTATTT GCAGGTGGAA TTTCAGATAT GTGGTACTTG CATGCAAATG TTATTGGAAT TATTGCATCA ATTTCAACTA ATTTTATTCT AAACAAAGCA TGGACATTTG GAGATAGAGA TTTCAGAATT AAAAAGACAA TGTCACAATA TGGCAAGTTT GCATTGTTTA GTTCGCTAGG TGCATTAGTA CAATTAGGAA TGGTGTATTT CCTAGTGGAT AGTGCTGAGA TTTCATATCC ATTAGCATTA ATTTTAGCAG TGGCTACAGC AGCTTTTGGA AACTTTGTAT TAAACAAGAA ATTTACCTTC AAAGAAAAAT TGCTAAACTA G
|
Protein sequence | MLVEKPNNQI SIIIPTYNES QNILNILKSI KENLPKNISA QAIVVDDNSP DGTGKIVDDY LKNLKKITNY TIEVIHRKTK DGLGSAILKG IQQATGDTIV VMDSDFSHPP QIIPKLVESI KKYQYDIAVA SRYIKGGKIE NWSAKRKLIS KFATLIAKKG LGINTKDPMS GFFAFKKNIL NGLNIDAIGY KILLEILVKT KNVSITEIPY TFQDRELGSS KLSMKTVFDY YKSVWKLYRY GKPEEEKEKR KSVKFLYKAA RFYTVGASGF VVNYLISLLF AGGISDMWYL HANVIGIIAS ISTNFILNKA WTFGDRDFRI KKTMSQYGKF ALFSSLGALV QLGMVYFLVD SAEISYPLAL ILAVATAAFG NFVLNKKFTF KEKLLN
|
| |