Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0165 |
Symbol | |
ID | 5773416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 152989 |
End bp | 153954 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 641315781 |
Product | glycosyl transferase family protein |
Protein accession | YP_001581499 |
Protein GI | 161527673 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.000128177 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATGAATG AAACCATGGA AACTAAAAAA AATTCTATGT CTGATGAAAT CTTCGTTTGT ATTCCTTCAT ATAATGCTGA ATCTACAATT GAGGAAGCCA TTAAACAATG TAAAAAATTT GCAACTAGTG TTTTAGTAAT TAATGATGGT TCTTCTGATA AAACTGAAGA AATTGCAAAA AAAGCTGGAG CAGAAATTAT TACACATAAA CAGAACAAAG GATATGGTGG CTCTATCAAA ACTGGTTTAT CTGAGGCTTT AAGACGTAGA GCAAAAGTAA CTATCACATT TGATGCTGAT TTACAACATG ATTCAAATGA TCTTCCAAAA ATTATTCAAC CAATTCTTAC CAACAAAGCA GATATTGTTA TTGGTTCTAG ATTTTTAGAA GATAATGATA ATGTTAAACC TTATAGAAAA TTTGGAATAA AATTGATAAC ACGACTAGTG AATTCTTTTT CTAAAAATAA TATAAGAGAT GCCGAAAGTG GTTTACGCGC TTATAACTAT GAATCTTTAA AAAAAATTGT TCCAAGTTTA GAAACACAAG GAATGGGAAT GTCTGCTGAA ATTCTTTTGA AAGCTGCTGT AAATCAATTA AAAATAATTG AAATTCCTAG AAAAGAAATG TATCCTGATA ATGTTCAAAC TTCTTCTCAG AACCCCTTAA AACACGGTTT AACTGTAGTT TTAACAATCA TCAAATTAAT TATTGAAACA AAACCATTAC CTGCATTTGG AATTCCATCT CTTGTTTTTT TCTTTATAAC TGGAATTAGT TCTTATTTTG TTGTTGAGTT TTACAATGAA ATTGGCAGAC TTCCTGTAGG ACTTACTATT TTTACTTTGA GCACTTTGAC TATAGCATTT TTCTTAATAA TGGTAGCAAC AATTTTGTAT GTATTGAGTA GAATTTCTGC AAAGTTAAAT TTTCAATTAC ATAATGATCT AAACTCAAAT AATTGA
|
Protein sequence | MMNETMETKK NSMSDEIFVC IPSYNAESTI EEAIKQCKKF ATSVLVINDG SSDKTEEIAK KAGAEIITHK QNKGYGGSIK TGLSEALRRR AKVTITFDAD LQHDSNDLPK IIQPILTNKA DIVIGSRFLE DNDNVKPYRK FGIKLITRLV NSFSKNNIRD AESGLRAYNY ESLKKIVPSL ETQGMGMSAE ILLKAAVNQL KIIEIPRKEM YPDNVQTSSQ NPLKHGLTVV LTIIKLIIET KPLPAFGIPS LVFFFITGIS SYFVVEFYNE IGRLPVGLTI FTLSTLTIAF FLIMVATILY VLSRISAKLN FQLHNDLNSN N
|
| |