Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0123 |
Symbol | |
ID | 5774409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 112659 |
End bp | 113699 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 25% |
IMG OID | 641315743 |
Product | glycosyl transferase family protein |
Protein accession | YP_001581461 |
Protein GI | 161527635 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000000000010329 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAATACAG AAGAGCCATT AGTCAGCATT ATTATTTTAA ATTATAATGC AGGTAAATTA ATTGAAAACT GTATTGAATC AATTCATAAA AGTGATTACA AGAATTTTGA AATAATATTA GTTGATAATG TATCGACAGA TAATAGTCAA AATAAATGTA AAAAAAAATT TCCTGAAATT AAACTAATAC AAAATCAAGA AAATTTAGGA TATTGTGGAG GAAACAATAT AGGGATAAAA ACTGCCAAAG GGGAATTTAT TGTGATACTA AATCCAGATA CAATTGTAGA AAAATCTTGG TTAAAGGAAT TTCTTCAAGA GTATAAAAAA ATAGGTTTAG GCCTTTACCA ACCAAAATTA TTAGCATTAG ACGATACATC TAGAATAAAT TCTGCTGGAA ACATGATTCA AATTTTTGGT TTTGGGTATT CTTTTGGAAA GGGTGAGAAA GAAAATTCAA ATCATGATAA AAATTATCTA ATTAATTATG CTTCTGGTGC ATGCCTTTTT ACTACAAAGC AAGTATTAGA AAAAATCGGT TTCTTTGATG ATTTTTTGTT TGCATATCAT GATGATTTAG AATTGGGATG GAGAGCTAGA CAGTTAGGAA TTAAATCACA CTATGTTCCT AGGTGTGTGG TATATCATGC TGAAAGTTTT AGTTTTGGTT GGAGCAAGAA AAAATATTTT CTTTTAGAAA GAAATAGACA TTATTGTTTA CTGACACATT ATTCTAGAAA AACATTTTTT AAAATGCTAC CATCTTTGAT CATAATCGAA ATAATTGTAA TAATGTTTTA CTTATCAAAA GGAATGATAA AAGAAAAAAT TGAGGGATAT TCAAATATTT TAAAAAACTG GAATGGAATT AAGAAAAAAT ATTTAGAAAT AGAATCAAAG AAAGAAATTA AAGATGTAGA AATCATCAAA GAATTTAAAA ATCAAATTGA AATTCCAAGC ATAGTAACAG GAAGAATATA TTCAAAAAAA ATTAATTATA TTCTAAATAT CTTGTCAAAA TTTTTTATTA AAATTTTATA A
|
Protein sequence | MNTEEPLVSI IILNYNAGKL IENCIESIHK SDYKNFEIIL VDNVSTDNSQ NKCKKKFPEI KLIQNQENLG YCGGNNIGIK TAKGEFIVIL NPDTIVEKSW LKEFLQEYKK IGLGLYQPKL LALDDTSRIN SAGNMIQIFG FGYSFGKGEK ENSNHDKNYL INYASGACLF TTKQVLEKIG FFDDFLFAYH DDLELGWRAR QLGIKSHYVP RCVVYHAESF SFGWSKKKYF LLERNRHYCL LTHYSRKTFF KMLPSLIIIE IIVIMFYLSK GMIKEKIEGY SNILKNWNGI KKKYLEIESK KEIKDVEIIK EFKNQIEIPS IVTGRIYSKK INYILNILSK FFIKIL
|
| |