Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_29880 |
Symbol | ALG6 |
ID | 4837584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 1409541 |
End bp | 1411100 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640388899 |
Product | glucosyltransferase required for N-linked glycosylation pathway |
Protein accession | XP_001382493 |
Protein GI | 150863870 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.060757 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.449956 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAAGA CTAAGAAGAA GGCCAAGGGC TCCGGTAGCC AGTCCAGCAC TCATCACCAC CATCAAGATT CGTCTACTTC ACATTCCATC TTTAAAAACT CACCCGTTCA TGACCTTTTG CACAACTTTG AAAAGGCTCC AGACCAATGG GCTGCCAGAT ACGTGTTGGT TATGACAGCC ATATTGCTCC GTGCCGCTAT AGGACTCGGT GGCTATTCCG GAAAGGCCAC TCCGCCGATG TTTGGAGATT TTGAAGCTCA AAGACACTGG ATGGAGTTGA CGATTCATCT TCCAATTTCA CAGTGGTACT GGTTTGACTT ACAATACTGG GGTTTGGACT ATCCGCCTTT GACAGCTTAC CATCTGTATA TAATCGGGAA GATAGGGAGC TTCATCAATC CTGACTGGTT TCTGTTGAAT GCTTCACGTG GAATAGAAGG AAGCGACATC AAGTTCTTCA TGAGATTCAT GAGTTTGGTC AGCGAACTCG TTCTCTACAT CCCAGCAGTT TTAACGTTAG CCAATTTGAT GGGTAAGAAG TTCAACTTGA GCCGAATGGA CCAGATCATT ATCTCGTTAT TGACAATTAA CCAGGCCCAT CTTGTGTTGA TAGATCATGG TCATTTCCAG TTCAACTCGG TGATGTTGGG TTTCTTCATC TACGCCATGA TAGAGCTTAT AAATTCGAGC TATGTTATCG CCAGTGTATG GTTCATTGGT TGCATCAACT TTAAGCAGAT GGGCTTGTAC TACTCGACAT TTATTTTCGT GTTCATCCTA AGCCAACTCA AGAGCTTTGG CCAACTTGTA GGAGTAGGTG TAACTGTGAT TCTTTCACAA GCTGTCGTAT TATCACCATT CATCTCTGAC CCTAAACAAG CACTCCAGAT CCTTTACAGA GTGTTTCCCT TTAACAGGGG CTTGTTTGAA GACAAGGTCG CCAACTTCTG GTGTACCACC AATGTCCTAG TCAAGTACAG AGAGATCGTA GCTCCCCAGA CATTGTCCAA AATGGCCCTC ATTACAACTG TGCTATCGAT TTTGCCAATG AACATCTTGT TGTTCATCAA GTTGAGAAAG ACCAAAAACG TTATTCCTGG CTTGATCTAC GGATTCGCCG GCAATTCGTT AGCATTCTAC TTATTTTCGT TCCAAGTTCA CGAAAAGAGT ATCTTGATTC CATTGGTTCC ATCTACGTTG TTGCTACTTG TCGATCCTTC GCTCATAGAC ATCGTGCAAT GGATCAACAA CGTCGGGACG TTCAGTCTCT ATCCGTTGTT GAAGAAGGAC GACTTGGTTC TACAGTACTT TGTCAGCAAC TTCTTGATCA ACTGGTTGAT TGGCCGCAAG TTGCTTATGA AGAGTAGAAG TATGGTGTGG GACTTGATTA TCAAGGGCAG TTACTTGCTG TTAGTCGTAT ATCATATCAT CGACTATACT TCAGATCCGC CCGCACGTTA TCCCGATTTG TGGGTGATTC TTAACATCAG CATTTCATTC GCAGCCTTTG CCTTGTTCTG GTTGTGGTTG AACTTCCGCA TCTACAAGTT GAAGGTCTAG
|
Protein sequence | MAKTKKKAKG SGSQSSTHHH HQDSSTSHSI FKNSPVHDLL HNFEKAPDQW AARYVLVMTA ILLRAAIGLG GYSGKATPPM FGDFEAQRHW MELTIHLPIS QWYWFDLQYW GLDYPPLTAY HSYIIGKIGS FINPDWFSLN ASRGIEGSDI KFFMRFMSLV SELVLYIPAV LTLANLMGKK FNLSRMDQII ISLLTINQAH LVLIDHGHFQ FNSVMLGFFI YAMIELINSS YVIASVWFIG CINFKQMGLY YSTFIFVFIL SQLKSFGQLV GVGVTVILSQ AVVLSPFISD PKQALQILYR VFPFNRGLFE DKVANFWCTT NVLVKYREIV APQTLSKMAL ITTVLSILPM NILLFIKLRK TKNVIPGLIY GFAGNSLAFY LFSFQVHEKS ILIPLVPSTL LLLVDPSLID IVQWINNVGT FSLYPLLKKD DLVLQYFVSN FLINWLIGRK LLMKSRSMVW DLIIKGSYLS LVVYHIIDYT SDPPARYPDL WVILNISISF AAFALFWLWL NFRIYKLKV
|
| |