Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44117 |
Symbol | |
ID | 7203874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 1027730 |
End bp | 1029328 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | dolichyl pyrophosphate Glc1Man9GlcNAc2 alpha-1,3-glucosyltransferase |
Protein accession | XP_002186451 |
Protein GI | 219113735 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.449308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCGAT CATCTCTTAA TAGCGTCAGA GATACCGGCG GGGGGACATC TTCCAGCTTG ATTGTTGCAG TCGTTGCGGG GCTGGTGTTG CTTCGCGTTT TGGTTGGATA CCACCCTCAC TCGGGACAGG ACAATTACCA TGGCTTGCAC TCGGCCTATG GAGGAGACTT TGAGGCCCAG CGGCATTGGA TGGAGCTCAC ACTTCATCTC CCGGTTGGAG AATGGTACTG GTACGATCTC AGCTACTGGG GGTTGGACTA TCCTCCGATA TCAGCTTACG TTTCGTGGAT TTGTGGCTGG CTTTCGCATC GGCTCGTCGG CCCGGAATCT GTAGCTTTAG AAACTTCTCG AGGTTTCGAA AATCCGACGC ACAAGGCATT CATGCGGTCA ACGGTGATCG TGCTGGATCT TCTCGTTTAC GGGACGGCTG TCTGGTTTTG GACAATGCAT CGCCAATACG ACCGCAATCT TCCCGACTCG ACGCGATTGT GGAGATTTGC TCTCGCCATG TCGCAACCGG CAATTCTACT CATCGACCAC GGCCACTTCC AATACAATAC AACAGCACTG GGACTATCAC TATGGGCATT TTATTTCATG ACGTTGCCAG ACTTTTTTTA TTGCATGATT GGCTCATTCA TGTTTTGTGC AGCGCTTTCC TTTAAACAAA TGACGCTCTA CTACGCCCCA GCCGTATTTT TTTACCTGCT TGGACGGTGT TTCACAACCC GCGGTCGTTT CCTGGTACAG CGCTTCTATC TTTTAGGGAT GACCGTTGTT GCTACGACAT TTGCCTTGTG GTGGCCCTTT GTTGCGTTTG GTCCGGAAGG CACATCTCAC ATCGAAAGAG CTGCGCACGT CTTTCGGCGT ATTATTCCCT TACAACGGGG TCTGTTTGAA GGCAAAGTGT CCAATCTGTG GTGTGTCCTC TCGCTCAAGC CAATCCGGAT CCGCAAACTC ATACCCTCAC AGCTGCAGCC GCTTGCGGCC CTACTCCTAA CGTTGATATT CGCGGCACCT GCCTGCTATC GACTCTTTCG ACTAGGACAG AAGCAACAGC AGGACAATGA ACAGCATCAG GGGAAACTGA TTCTATACGG AGCCACTAGT AGCGCTCTCG CATTCTTCCT CGCAAGCTTT CAAGTGCATG AAAAGAGTTT GCTTTTGGCT CTGGGCCCCG CTTCTCTGTT GTTTTTTGAC GATGCGACCT TTGTGCAATG GTTCTCCGTC ATTGCAGCAT GGACACTATG GCCATTGTTG GTTGTTGATC GTCTGCAAGT CGCATACACC TGTATTATGA TTATTTTCAT TATGTTACAG CGATTGCTAC ACTCGCTACA ACAAGGATCG ACAGCATCTA CATCGGGATT CTTGGAACAT TTACCACTAT TGCGATGGGT TCCGCATTTT TCTGGATTGG TGATGCTGCT TTTGCACTTT ACGGAATTGG GTGTGACGAT TCCGCCCCAC TTACCGGATG TCTTCTCTGT ATTGTGGAGT ATCGCTGGTT GCGCGTTTTG TTCGTTGGCA TGGCTCGCTT CTTGTTGGCA TCTCTATGGC GCAAGTGAAC CTAACGTGAA AGGGACCAAG TTTGATTAG
|
Protein sequence | MSRSSLNSVR DTGGGTSSSL IVAVVAGLVL LRVLVGYHPH SGQDNYHGLH SAYGGDFEAQ RHWMELTLHL PVGEWYWYDL SYWGLDYPPI SAYVSWICGW LSHRLVGPES VALETSRGFE NPTHKAFMRS TVIVLDLLVY GTAVWFWTMH RQYDRNLPDS TRLWRFALAM SQPAILLIDH GHFQYNTTAL GLSLWAFYFM TLPDFFYCMI GSFMFCAALS FKQMTLYYAP AVFFYLLGRC FTTRGRFLVQ RFYLLGMTVV ATTFALWWPF VAFGPEGTSH IERAAHVFRR IIPLQRGLFE GKVSNLWCVL SLKPIRIRKL IPSQLQPLAA LLLTLIFAAP ACYRLFRLGQ KQQQDNEQHQ GKLILYGATS SALAFFLASF QVHEKSLLLA LGPASLLFFD DATFVQWFSV IAAWTLWPLL VVDRLQVAYT CIMIIFIMLQ RLLHSLQQGS TASTSGFLEH LPLLRWVPHF SGLVMLLLHF TELGVTIPPH LPDVFSVLWS IAGCAFCSLA WLASCWHLYG ASEPNVKGTK FD
|
| |