Gene PICST_35844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_35844 
SymbolGLP1 
ID4838574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp510967 
End bp512178 
Gene Length1212 bp 
Protein Length403 aa 
Translation table12 
GC content45% 
IMG OID640389889 
ProductGlycerophosphodiester phosphodiesterase 
Protein accessionXP_001384392 
Protein GI150865251 
COG category[C] Energy production and conversion 
COG ID[COG0584] Glycerophosphoryl diester phosphodiesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.313147 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCCG ACTCCCTCGA CACATATACA TCTCCGGTCA TAGCGGGCCA CAGAGGATTC 
AAGGGCGAAT ACCCCGAGAA CACCCTTACG GGATTCAACA AGTGCTATGA AACCGGGGCC
ACGGTGATAG AAACAGACCT TTGGCTCACC CTCGACGAAG TAATTGTCAT CTCCCATGAT
CCCAATACGA AGAGAGTGTT TGTAGATTCC GAGGGTAATG AAACTGACTA CAACATTCCT
AAGACTAGCT ACGAGGAGGT GTTGAAGTAC TTGAAGACAA AAGAAGGTGG AGAACCGCTT
CTAACTTTCC GCGAAGTGTT GCAGTGGTTC GTAGACTATG TGAGCGAATC CAGATCTAAC
ATCCACAAGT TGATGTTGGA TATCAAGCGT CTTAATCCTG CCAAAGTGTT GAAGTTCATC
ATTGGCGATC TCCTTGCCGT CAACAACGAC ATCTCCTGGT GGTTCCACCG TATCCAGTTG
GGTGTATGGG ATTTAAATGT CGTCAAATAC ATGAACCAAG ACGAGTTCTT CCAGAGTTTA
GTCAAGAATT CTCACGGAAA GAATCCCTTG GGCTGGGTCT GGTTCGACGT GTTCCATATT
TCAGTATCGT GGAGAGACTC CATCCACTAC ATAAACTACA ATTTCTACCT TGATACACTC
AAGGATGAGG ATAGCAAGAC CGGAATTGTC CGGTTCAAGG TAACAGGAAT TTCTTTGCTC
TACTTCCTGA CGTGGTCAAC AGGATTTCTC ACCAAGTTCT TACCGTTGCT TCGTATCCAG
CGCTTGAAAT TATACTCTTG GACGATTAAT ACAGCGGTTC AGTATGACTT CTTGAGCAAG
GTCGGGAAAG TAGCTGATTT GCCAGAGTAC GGTGTCATTT CTGACTATCC GGACCAGATG
GTGAAACACA AAGAGGATGA AGAAAGAAAG GAAGAATTCG AAAAGAACTC TGTTGACGAA
TTATCGAGGT TGACTCCTTC CTCTACTGAT TACTACGACG AGGATGGCAA TTTGTCGGTG
AAGCTAACAT TCAGAATGAA ATTCGGAAAC TACTTCTACG AAAGTTTTCA GGCTTTGGCG
GGATCAAAGA GAATTACAGA TGAGGAGAAG CAGTTTGATC TTGAGGTTGA TGAGAATAAG
GTCGCTGTTG TCAAGGTTAG CCAACTTTTC ATCTGGGTTT TTTCTACGTG CCAGAAGTTG
GGTATTTTCT GA
 
Protein sequence
MTSDSLDTYT SPVIAGHRGF KGEYPENTLT GFNKCYETGA TVIETDLWLT LDEVIVISHD 
PNTKRVFVDS EGNETDYNIP KTSYEEVLKY LKTKEGGEPL LTFREVLQWF VDYVSESRSN
IHKLMLDIKR LNPAKVLKFI IGDLLAVNND ISWWFHRIQL GVWDLNVVKY MNQDEFFQSL
VKNSHGKNPL GWVWFDVFHI SVSWRDSIHY INYNFYLDTL KDEDSKTGIV RFKVTGISLL
YFSTWSTGFL TKFLPLLRIQ RLKLYSWTIN TAVQYDFLSK VGKVADLPEY GVISDYPDQM
VKHKEDEERK EEFEKNSVDE LSRLTPSSTD YYDEDGNLSV KLTFRMKFGN YFYESFQALA
GSKRITDEEK QFDLEVDENK VAVVKVSQLF IWVFSTCQKL GIF