Gene PICST_31740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31740 
Symbol 
ID4838701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1566321 
End bp1567697 
Gene Length1377 bp 
Protein Length458 aa 
Translation table12 
GC content47% 
IMG OID640390016 
Productpredicted protein 
Protein accessionXP_001384255 
Protein GI150865153 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGCCC TACTCTTGGA TAGAGCCTTT GGTAAGGTAG CACCGCTTGA GTTTGCTACG 
ACCGTAACGG AGTCGTACTA TTCTTTACTC CAACAGAACG CACTTACACG CGTGTTTCCT
CCCAATTGTC ATACAAATGC AGCTATAAAC AGTCTCTCGT TAGAAACGGA AGACTACCAG
TACCTTTTGA GCGGATGTGC TGATTCTTCC ATCAAGCTCT GGGACCTAAA TTCACAACTG
GAGATAGACA ATGGTTCCAG CACCATCCAC CAGGATTTAA ACAAACAGCA TTCGGACTAC
GATATTTACG ACTACGATCA TCCGGTCCAG ACTTTTACCA ACATTGCTAC AGTTCCCCGG
AAGTCTGCCC ATACATTTGG TATTTCTGCC ATTCAGTGGT GGCCGTACGA TACAGGGATG
TTTGTTCTGG CCAGTTTTGA TCACACTGTG AAAATATGGG ATACCAATGA ACTCACACCG
GTACACTCTT TCGATGTTAC CAATCGGGTA TATGCCATCG ACCTCTCGGG AAGCGAGTCA
CCGAATGGCT TTTCTTCCTC GGCTTTGGTA GCTGTAGGCA GTGACCAACC ATTCATTCGG
CTCTTGGACT TGCGATCTAC TTCAAGTGCC CATACGCTCA CAGGTCACAA GGGGAAGACG
TTGGCTGTCA AATGGCATCC GCTCAATCCT AACTTACTTC TGTCTGGAGG ATTTGACGGT
GAAGTCAAGA TTTGGGATAT CAGGCGAAGC AAGAGTTGCC TTTGCCGCTT GGATATGCTC
CGTACCAACA ATCAAGCAGA CAGTGCAGAT AATCTTGCTA AAGCCTCGGT CAAAGCCCAT
CTGGGTCCTG TCAATGGTCT CGTCTGGAAT GAACAGGGTA CAGAGCTATA TACTGCTGGT
AACGACGACA AGGTGCGAGT CTGGGACATG ATTTCCTCTT TGGCTCCACC TATCAATAAA
TTGGTCAACT TTGGGCCATT GACACGAAAC AAGTATCCCC AGACTATCCC CATTATGCTT
AACCCCAGCT ATGAGACCGA GTTGCAGTAT TTATTATTTC CCTCTGATAA TAGCGACTTG
TTTGTATTCA GAACTGTTGA CGGCAAGATG GTTTCGCGAT TATCTAGAAA AGGCACCAAG
AACAGCGGTA GGACATGTTC TATGGTTAAT GCAGGGCCAT TTACAGGGAA GTATTATTGT
GGGACAATTG ATGGAGAAAT CATCGCCTGG TCGCCGCATT GGGAACAGCC CAATATTGAG
GATTTAGTCG AGGACACGAA CGAGGTGGAT GTTCAAGATG TCTTATCCAA GCGAAAGTTG
GCTGAAGAAG CTCGACGCAA CCTTGAGGAC GATCCCTACT TTAATGGCGA ACCGTAG
 
Protein sequence
MQALLLDRAF GKVAPLEFAT TVTESYYSLL QQNALTRVFP PNCHTNAAIN SLSLETEDYQ 
YLLSGCADSS IKLWDLNSQS EIDNGSSTIH QDLNKQHSDY DIYDYDHPVQ TFTNIATVPR
KSAHTFGISA IQWWPYDTGM FVSASFDHTV KIWDTNELTP VHSFDVTNRV YAIDLSGSES
PNGFSSSALV AVGSDQPFIR LLDLRSTSSA HTLTGHKGKT LAVKWHPLNP NLLSSGGFDG
EVKIWDIRRS KSCLCRLDML RTNNQADSAD NLAKASVKAH SGPVNGLVWN EQGTELYTAG
NDDKVRVWDM ISSLAPPINK LVNFGPLTRN KYPQTIPIML NPSYETELQY LLFPSDNSDL
FVFRTVDGKM VSRLSRKGTK NSGRTCSMVN AGPFTGKYYC GTIDGEIIAW SPHWEQPNIE
DLVEDTNEVD VQDVLSKRKL AEEARRNLED DPYFNGEP