Gene PICST_30002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30002 
SymbolBUD6 
ID4837319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1712077 
End bp1714146 
Gene Length2070 bp 
Protein Length689 aa 
Translation table12 
GC content39% 
IMG OID640388634 
Productbud site selection protein 
Protein accessionXP_001383093 
Protein GI150864325 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.180708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCTT CGTCCTCTCG CTCAAATAAC CGACGCCACT CAATGAATTC GATTGAATCT 
AGTGTTACTC GATTGCTAGT TTCTACGAAA CATCTTCTTG AGAGTCTAAC TCAATGGGCG
AGACAAGAGG CGGATGATAA ATACGTTTCT GATGCATATG TTAAATTGGG CAATAACTTT
CGAGCTGCAA CTAAAGCATT TACAAATAGC GGAGTAGACA TCTCAGACCT AGGAGATGTG
CCTCAAGCAT TGAGAATCAT CTTGGAAGCA GCGTTGAGTG AGGCACCCAG CCAAGAGAAT
TTAGATAGAT TCTTACCCAA TATCAGGAAT ATTATTGTTA GCTTGTTGCT GAATTTAAAA
GCAAAACAAG CAAAGGCGGA AGCAATCGCG AAGTCAAAAC GATTAGTTTC CGAACCCCCT
GCTTTTCATG GATCTGATTC CACCACAACG GCATCTTCGT CTTCAATACC AGATCTCGAT
TCTGCAACCT CTACTGCAAA GGCAGAGTAT CCAGATACTC AACCCAACGA TGCCTCTGAT
TTTAAGAAAA ATTTGGCCGT GAATGATGCG TTAACGCAAT TGCAGAAGGG TAATGTTCTC
CAACGCCGTG CTTCTAAACG TTTCAGTGCA TATCAGTTTG CGAAGCTAAC AAACGTATCC
CATAACACAG CTTTGCCAAG ATTAACACAA GAGTACACAA GTTCAAATGA GGATCAGAAG
CTCCCTGAAG ATGACGAAAT AGACGAAATA GAAAGGCCCA TTATGTCTAA TGTGGACATT
TCATTAGTAA ATATTTTCCT CAAAATCGGC AGTAAAACTA AAAAAGCTAA AATTTCATTG
CCAGTTTCCC TTGCTTCCGT CAGACTTCTA TTCGTGGAAA AATTTGCTTA CTCTCCAGGA
TCTTCCTCAT TTCCAGATAT ATATGTTCAA GATCCGAACA GTAACATTTC GTACGAACTC
GAGGATCTTG AAGATGTTAA AGATGGTACA TTGCTCAGTC TAAATGATCC TGATTCGCAG
GCATTTCTAA TAAAAGGTCT TGATTCAAAA GTTCAGTCAC TTTCAGCGAA GTTAGACTTG
ATGAGCTTGG AAATTCGCAA TCAGATTAGA GAAGAAGTTG GAGCTATAGA AATTCCGACA
CCTACTGTTA TTGCAACCCC GGTTTCTGAT ATTTCCAACC AGAAGGAAGT AAAATCTTTA
GACTCATCTG GAGCAAAGAA GGAATTGATT GCAATTGAAC AGGAATTGAA ATCAATTAGA
CAAATTCAAC GCGTCTCAAG TGATCAAGTA AGGCTGATTG TCAACGATCT AGTGCTGGCT
TCCAAGGCTG TTCATGACAA TGTACAGTCA CACAATGATT CTAACAGAGT ATATATGGAT
GCTTGTCACT CAAAATTGTC AGAAGAGTCT GATATCTTGT TGACTAAAGT TGATGACTTG
CAGGATGTAA TGGAAGCCCT AAGAAAGGAT GTTGCTCAGA GAGGTGTTAG AGTAGGTGAA
AAACAGCTTA AATCAACTCA AAAGGAGATA GATGACGCAA AAGTTTCGCT TCAGCTGATG
TCAGAATATA TTTCTCGAGA AAGAGTAGTA TGGAAAAAGA TATGGGAATC AGAACTTGAT
AAAGTCTGCG AAGAACAACA GTTTTTTAAT CTACAAGATG ATTTGACGCA GGATCTAGAA
GATGATATTA GGAAAATAGA GGAAACATAC AAGTTGATTG AGAAATGCTC TTTGGAGCAA
ATCAAACAAT CATCCTCCAA AAGAAATAGA GTAGCTGCTC AGTTGTATAT ACCTGAGCCT
GGAGAGAGCC TTCACAGTAT TAAGGATGCA GTTCTCAGCG AAGTTGCTGC ATTGAGACCA
AATCACGAAA GCAGAGTCTT GGCCATTGAG AGAGCTGAAA AATTAAGAGA AAGAGAGAGG
GAATTATTGA AGCTAAATAA ATTTCAAGAA GAACTAGGTG ATTTTGTTGT GGACAACAAA
CTAAAAAAGT CAGGAGGTAT TGAAGAGTTA GAAAAACAAA GACAGCTTAA GGACAGCGAA
AACTTAAAGA GTTCATTTGG GATCATTTAA
 
Protein sequence
MSSSSSRSNN RRHSMNSIES SVTRLLVSTK HLLESLTQWA RQEADDKYVS DAYVKLGNNF 
RAATKAFTNS GVDISDLGDV PQALRIILEA ALSEAPSQEN LDRFLPNIRN IIVSLLSNLK
AKQAKAEAIA KSKRLVSEPP AFHGSDSTTT ASSSSIPDLD SATSTAKAEY PDTQPNDASD
FKKNLAVNDA LTQLQKGNVL QRRASKRFSA YQFAKLTNVS HNTALPRLTQ EYTSSNEDQK
LPEDDEIDEI ERPIMSNVDI SLVNIFLKIG SKTKKAKISL PVSLASVRLL FVEKFAYSPG
SSSFPDIYVQ DPNSNISYEL EDLEDVKDGT LLSLNDPDSQ AFLIKGLDSK VQSLSAKLDL
MSLEIRNQIR EEVGAIEIPT PTVIATPVSD ISNQKEVKSL DSSGAKKELI AIEQELKSIR
QIQRVSSDQV RSIVNDLVSA SKAVHDNVQS HNDSNRVYMD ACHSKLSEES DILLTKVDDL
QDVMEALRKD VAQRGVRVGE KQLKSTQKEI DDAKVSLQSM SEYISRERVV WKKIWESELD
KVCEEQQFFN LQDDLTQDLE DDIRKIEETY KLIEKCSLEQ IKQSSSKRNR VAAQLYIPEP
GESLHSIKDA VLSEVAALRP NHESRVLAIE RAEKLRERER ELLKLNKFQE ELGDFVVDNK
LKKSGGIEEL EKQRQLKDSE NLKSSFGII