Gene PICST_51972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_51972 
SymbolGBD1 
ID4851102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp939116 
End bp940201 
Gene Length1086 bp 
Protein Length361 aa 
Translation table 
GC content43% 
IMG OID640392810 
Productflavonol synthase 
Protein accessionXP_001387413 
Protein GI126274088 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.842126 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCC TGTCCAACAT AAGACCTTGG CAATCGCCAG AAGAAACTCA GGAAGATCTA 
GACTGGGCAA ATCTTACCAT TATTGACTTG GCCCACTTTG ATGAACCTGG CCAGAAGCAA
ATTCTTGCCA ATCAGCTAAA GGAGGCTCTG AACTCAGATG GTTTCTGGGC TGTGATTAAC
GGAGGATTTG ACCAAAATGA TATCAATGAA GCATTTGCCT ATGGAAGAAG TTTCTTCGAA
GATTACACTG AAGAAGAGAA GAAAGCGTTG GAAGTAGATT TCACCACTGG AAACTATTTT
GGCTACAAAG TCCGTGGAAA CAAGCCTGTT TTTGGCACCC AAGTCAGAGA CAACACCGAA
ACCCTCAACA TTGCCAAATT CACCAAAGAT GACACATTTG CCGAATACCA CAAGAACAAC
TTCATCCAAA ATAACCATGA TAAGTTAGCC CAACTCTCTC GGAAGGTGTT TGAAGTTGCC
CGTAAGTTAT TTATATTGTT TGCCATTATT CTCGAACTAG ATGAGAATTA TTTTGTTGAT
CGTCATCTTT ATGACGATCC CAGTGACGAT CTGCTTCGGT TCATGAAATA TCACCCAAGA
ACAAAAGAAG AAGACGCTCA GGTAGAGAAC ATATGGGCAA GAGCACATAC TGACTTTGGG
AGTTTGACCC TATTGTTCAA CCAGGTGGTA GCTGGCCTAC AGATCAAGTT GGCTGACGGA
GAATGGAAGT ATGTCAAACC GGTCACTGGT GGACTTATCT GCAACATCGG AGATACTTTA
AATTTCTGGT CTGGAGGATA TTTCAAGACC ACTATTCACA GAGTAGTGAG ACCTCCTGAA
GATCAGGTTA ATGCACCTAG AATTGGAGCC TTCTATTTCG TTCGTCCAGG AGACAAAGCC
CAAATACAAA TTGCCCCATC TCCGTTATTG AAGCGTTTAG GGTTATACAG AGAAACCGAA
CCTATTGGTG GTACGGAATA CGTGAGAAAG AGAGTCAAGG ATTACCATGA CGTGAAAGGT
TATAATAAGC AGGCCGACAA GGTATTCAAG TTGGGAGAGT TTGAGGTTAT TGACGGTTTT
AATTAG
 
Protein sequence
MTVLSNIRPW QSPEETQEDL DWANLTIIDL AHFDEPGQKQ ILANQLKEAL NSDGFWAVIN 
GGFDQNDINE AFAYGRSFFE DYTEEEKKAL EVDFTTGNYF GYKVRGNKPV FGTQVRDNTE
TLNIAKFTKD DTFAEYHKNN FIQNNHDKLA QLSRKVFEVA RKLFILFAII LELDENYFVD
RHLYDDPSDD LLRFMKYHPR TKEEDAQVEN IWARAHTDFG SLTLLFNQVV AGLQIKLADG
EWKYVKPVTG GLICNIGDTL NFWSGGYFKT TIHRVVRPPE DQVNAPRIGA FYFVRPGDKA
QIQIAPSPLL KRLGLYRETE PIGGTEYVRK RVKDYHDVKG YNKQADKVFK LGEFEVIDGF
N