Gene PICST_29880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29880 
SymbolALG6 
ID4837584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1409541 
End bp1411100 
Gene Length1560 bp 
Protein Length519 aa 
Translation table12 
GC content44% 
IMG OID640388899 
Productglucosyltransferase required for N-linked glycosylation pathway 
Protein accessionXP_001382493 
Protein GI150863870 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.060757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.449956 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGA CTAAGAAGAA GGCCAAGGGC TCCGGTAGCC AGTCCAGCAC TCATCACCAC 
CATCAAGATT CGTCTACTTC ACATTCCATC TTTAAAAACT CACCCGTTCA TGACCTTTTG
CACAACTTTG AAAAGGCTCC AGACCAATGG GCTGCCAGAT ACGTGTTGGT TATGACAGCC
ATATTGCTCC GTGCCGCTAT AGGACTCGGT GGCTATTCCG GAAAGGCCAC TCCGCCGATG
TTTGGAGATT TTGAAGCTCA AAGACACTGG ATGGAGTTGA CGATTCATCT TCCAATTTCA
CAGTGGTACT GGTTTGACTT ACAATACTGG GGTTTGGACT ATCCGCCTTT GACAGCTTAC
CATCTGTATA TAATCGGGAA GATAGGGAGC TTCATCAATC CTGACTGGTT TCTGTTGAAT
GCTTCACGTG GAATAGAAGG AAGCGACATC AAGTTCTTCA TGAGATTCAT GAGTTTGGTC
AGCGAACTCG TTCTCTACAT CCCAGCAGTT TTAACGTTAG CCAATTTGAT GGGTAAGAAG
TTCAACTTGA GCCGAATGGA CCAGATCATT ATCTCGTTAT TGACAATTAA CCAGGCCCAT
CTTGTGTTGA TAGATCATGG TCATTTCCAG TTCAACTCGG TGATGTTGGG TTTCTTCATC
TACGCCATGA TAGAGCTTAT AAATTCGAGC TATGTTATCG CCAGTGTATG GTTCATTGGT
TGCATCAACT TTAAGCAGAT GGGCTTGTAC TACTCGACAT TTATTTTCGT GTTCATCCTA
AGCCAACTCA AGAGCTTTGG CCAACTTGTA GGAGTAGGTG TAACTGTGAT TCTTTCACAA
GCTGTCGTAT TATCACCATT CATCTCTGAC CCTAAACAAG CACTCCAGAT CCTTTACAGA
GTGTTTCCCT TTAACAGGGG CTTGTTTGAA GACAAGGTCG CCAACTTCTG GTGTACCACC
AATGTCCTAG TCAAGTACAG AGAGATCGTA GCTCCCCAGA CATTGTCCAA AATGGCCCTC
ATTACAACTG TGCTATCGAT TTTGCCAATG AACATCTTGT TGTTCATCAA GTTGAGAAAG
ACCAAAAACG TTATTCCTGG CTTGATCTAC GGATTCGCCG GCAATTCGTT AGCATTCTAC
TTATTTTCGT TCCAAGTTCA CGAAAAGAGT ATCTTGATTC CATTGGTTCC ATCTACGTTG
TTGCTACTTG TCGATCCTTC GCTCATAGAC ATCGTGCAAT GGATCAACAA CGTCGGGACG
TTCAGTCTCT ATCCGTTGTT GAAGAAGGAC GACTTGGTTC TACAGTACTT TGTCAGCAAC
TTCTTGATCA ACTGGTTGAT TGGCCGCAAG TTGCTTATGA AGAGTAGAAG TATGGTGTGG
GACTTGATTA TCAAGGGCAG TTACTTGCTG TTAGTCGTAT ATCATATCAT CGACTATACT
TCAGATCCGC CCGCACGTTA TCCCGATTTG TGGGTGATTC TTAACATCAG CATTTCATTC
GCAGCCTTTG CCTTGTTCTG GTTGTGGTTG AACTTCCGCA TCTACAAGTT GAAGGTCTAG
 
Protein sequence
MAKTKKKAKG SGSQSSTHHH HQDSSTSHSI FKNSPVHDLL HNFEKAPDQW AARYVLVMTA 
ILLRAAIGLG GYSGKATPPM FGDFEAQRHW MELTIHLPIS QWYWFDLQYW GLDYPPLTAY
HSYIIGKIGS FINPDWFSLN ASRGIEGSDI KFFMRFMSLV SELVLYIPAV LTLANLMGKK
FNLSRMDQII ISLLTINQAH LVLIDHGHFQ FNSVMLGFFI YAMIELINSS YVIASVWFIG
CINFKQMGLY YSTFIFVFIL SQLKSFGQLV GVGVTVILSQ AVVLSPFISD PKQALQILYR
VFPFNRGLFE DKVANFWCTT NVLVKYREIV APQTLSKMAL ITTVLSILPM NILLFIKLRK
TKNVIPGLIY GFAGNSLAFY LFSFQVHEKS ILIPLVPSTL LLLVDPSLID IVQWINNVGT
FSLYPLLKKD DLVLQYFVSN FLINWLIGRK LLMKSRSMVW DLIIKGSYLS LVVYHIIDYT
SDPPARYPDL WVILNISISF AAFALFWLWL NFRIYKLKV