Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0116 |
Symbol | |
ID | 5773206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 104553 |
End bp | 105578 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 641315736 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001581454 |
Protein GI | 161527628 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0000000347461 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAATTC TGATCATTTC TCCGACTCAA GAAGGAATTG GTGGTGTTGC TAGACATGTA CAAGGTCTTA CAAAATTTTT AAAAAATGAT GGGCATGAAG TAGATGTAAT CTCTTCTGAA AATACATTCA CAATTCCTAT ACGAAAATTA AAAAATCCTA GCTTCATGCT ATCTTCTTTT TTAAAAACAA AATTTTCAAA AAAATATGAT GTAGTACATG CTCAAAATGT TGTATCTGCG TTTGCAATGA AAAATGTTTT GGGAAAAAAA TTATTGGCAA TACACGGAAT TCATCATGAA CAAGTAGATC ATTTACATGG AAAAACTGCT GGAAATGTAG CAAAGGATTA TGAAGATAAA GCGTTAAACT GGGTTGACGC AATAACGGTT TCTTCAAAAG AAATGCTTGA TTATTACTCT CAAAAAGGAT TGAACACGTT TTTTCTCCCA AATGCATTGG ATATACAATC AATTACCAAA AAATCTAATC GAAAATTTGA CAAACAAATT GTTTATGCTG CTAGATTATC AAAAGAAAAA GGTATTCTTG AAGTATTGGA TGTTGCAGAA AAATTACCAC AAGACATTCA TCTTTTAATT TTAGGATCAG GGGTAGAAGA GAATAAAGTA AAAGAATTAT CAGAACTACA AAAAAATATT CATTTTTTAG GATATCAAAA TAGAGAAAAC ACTCTATCTA TAATTCGTGG TTCTGATTTG TTAATACAAC CATCTAGAAT GGAAGGTGGA CTAAGTTATA CTTTGTTGGA ATCTATGGCA TGTGGAACTC CAATTATATG TACTGATGTT GGTGGTGCTA AAGATACTTT ATCTCATATG AAAAATGCAT TTATTATCAA ACCTGAAAAT TCAACAGAAT TAAAAAATGC TATTAATCAA TTAATGAACA ACTCAAAACA AAGAGAGGAA CTAAAGAACA ATGCTTTGGA TGAAATCAAA AATCATGATT GGTCCGTTGT AGGGCCAAAA TATGTAGAAA TTTATCAAAA ATTACTTTCA TCATAA
|
Protein sequence | MKILIISPTQ EGIGGVARHV QGLTKFLKND GHEVDVISSE NTFTIPIRKL KNPSFMLSSF LKTKFSKKYD VVHAQNVVSA FAMKNVLGKK LLAIHGIHHE QVDHLHGKTA GNVAKDYEDK ALNWVDAITV SSKEMLDYYS QKGLNTFFLP NALDIQSITK KSNRKFDKQI VYAARLSKEK GILEVLDVAE KLPQDIHLLI LGSGVEENKV KELSELQKNI HFLGYQNREN TLSIIRGSDL LIQPSRMEGG LSYTLLESMA CGTPIICTDV GGAKDTLSHM KNAFIIKPEN STELKNAINQ LMNNSKQREE LKNNALDEIK NHDWSVVGPK YVEIYQKLLS S
|
| |