Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0120 |
Symbol | |
ID | 5774767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 109495 |
End bp | 110676 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 641315740 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001581458 |
Protein GI | 161527632 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000000000209107 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAATTG CTTTAACGTG TCCCGCATTT TTACCTGCTA CTCAGTTTGG AGGGATTTTA TTTCTATGTT TAGACATAGC AAGATTTGTT TCAAAAAACC ATGAAACAAC TGTATATACC ACAGATTTAG ATTTTGATAA CAGCGTCAAC AAATTTAATT CAGAGTTACC AAAGATTGAA AAATACGAAA AATTTGTAAT TAAAAGAAAC CATGTTTTTT TTAAAATAAA ATTATTTTTT ATAAATCCAG GATTGTTTTT TCAACTAAAG AAAGATAAGC CTGACGTAAT TCATGCCATA GGAATTAGAG GGTTTCAAGC ATTTGTTTCT GCAGTATACT CAAAAATTTA CAAAATTCCA TTATTATTAA GTGATCAAGG AGGATTACAT ACACATCCTG AATATCAAAA AGGGGCGGGT AAAATTCTAA ATAAAATTCA GGAGCCTTTA GTAAAATTTG TAATTAATCA AGCTTCACAT ATTATTGCAG CAAATGAATA TGAAAAATCA GTTTTTTTAA AATACTCAAA TGAGAAAAAA ATTACCATAG TGCATAATGG AATAGATTAC AGAAACTTTG CTGCAAATAA TATAGATTTT AAAGACAAAT ACAACATTAG TGAGTCATTT ATTTTATTTC TTGGCAGATT CACCAAAATT AAAGGGATTG ATTTACTGCT TTTATCATTC AAAAAAATTG TTGACAAGAA AAAATTTCAA GATTTAAAAC TTGTGATTTT AGGAGCCAAT TTTGGATATG AAAGAGAGAT GAATTCAATG ATAGAAAAAT TAAATTTAAA AGAAAATATT TTGGTTATAG AAAAACCTAC AAGAGGAGAG GTAATTTCAG CATATCATGC ATGTAAATTT CTTGTACTAC CATCAAGATG GGAAATGTCT CCACTAACTC CTTTGGAAGG ATTCGCTTGT AAAAAACCAA CAATTAGTAC GAATATCTTT GGAATACCAT ATGTTGTTTT GAATAACAAA AACGGTTTAC TTTTTGAACC AGAAAGTGTT GATGATTTAA AAGAAAAAAT TGAAATTTTG TTAGAAGACA AAGAATTGGT AAAAAAACTA GGAAGTAATG GTTACGAGTT TGTCAAAAAA GAATATTCTT CTGATAATAT GGGCAATCAA ATTTTGAAAC TATATGAAAA ATCTCAGAAA AAAATGGAAT GA
|
Protein sequence | MKIALTCPAF LPATQFGGIL FLCLDIARFV SKNHETTVYT TDLDFDNSVN KFNSELPKIE KYEKFVIKRN HVFFKIKLFF INPGLFFQLK KDKPDVIHAI GIRGFQAFVS AVYSKIYKIP LLLSDQGGLH THPEYQKGAG KILNKIQEPL VKFVINQASH IIAANEYEKS VFLKYSNEKK ITIVHNGIDY RNFAANNIDF KDKYNISESF ILFLGRFTKI KGIDLLLLSF KKIVDKKKFQ DLKLVILGAN FGYEREMNSM IEKLNLKENI LVIEKPTRGE VISAYHACKF LVLPSRWEMS PLTPLEGFAC KKPTISTNIF GIPYVVLNNK NGLLFEPESV DDLKEKIEIL LEDKELVKKL GSNGYEFVKK EYSSDNMGNQ ILKLYEKSQK KME
|
| |