Gene PICST_33508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33508 
Symbol 
ID4840803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp824548 
End bp825924 
Gene Length1377 bp 
Protein Length458 aa 
Translation table12 
GC content46% 
IMG OID640392118 
Productpredicted protein 
Protein accessionXP_001386351 
Protein GI150866682 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAACA CTACGTCTAT AGCCATGCCG CATGATCCCC AGCCTCCTAC AGCAGCTACT 
CCATCTACAA TTTCCACATC GCTGGCCATC GCTCCTCCTC ACACTGGACC TTCGGTGCTC
CACTCTCGGA GCATTAAGCA CCAGTGGTCT GGAGCTGCGA GCTTGTTGAA TCCATTGCCC
GCTGACGATG TTCGTAGTGG GTCTGAAACT GTAGAAACTG CTGTATCTGT AGTACAAGCT
GATGCTCACA GCGCCGGATC GATAGAGGCT GTAGCTGCTG CACGCTCGCT AGAAGCACAA
GAGGTTGCCC AAGCTGTTGA GTCTGTAGAT GCTGAAACAG TTGATATCAA CGGCGAAAGC
TACGGTCCCG ACGAGCAAGA TTCTTCGTCG TCTTCACCTT CTAATTCTAC CAACTTCCTC
TTGGGAGCGA AGAATGCCCA CCGCACAAAG TTGACCACAC ACGACATCAG ACTAATTCTA
TACTTCATTG TCCAAATTAA ACCATTTAAG TATGTGGGTG ATCGCTCACT TTCCCAGACG
AAGAAGTGGG AGTTGATTCA GCAGAAGTTT GCAAGCCACA AACATCTGGA TCATGAGAAG
GATAGGAAAA ACGACGACTC GCCCGTAGTA GTTCCCACCG TAAGAACGCT TCAAAGACAG
TTGGCTACTG CCATCCGTAA GGCTAGTATC AGACGTCACG AACGCAAGCA GGCTGGCATA
ATTGACTCGA GCCCTAGCAG GTCTCAGGAT GACGAATACT ATTTGTTTAA GCATATTTCA
GCAGACAGTT CGCTAACAGA ATTAGAAGCA GCATTACTTG ACCTCAATGA TCTTAGCGAT
AAATTGAAGA CTGGCAAATT AGCGAACACC TCTCACCTCT TCCAGGGAAG CATGGATACA
GAGGTGCAAC GAGGTGTCAC CAATTTGACA AGTATGACTT CGTCGTTGAG AGCCCTTATT
GACTCTACTA ATTCCGCCAA TGGAGCGATT GATACTCGCC TAACTTCTAC GTTGCGGGAG
TTGTCAGATA TCAAGGATGA CATTGGAGCT TTGTATGCCA ACGACAGATA CAGTTATTCT
AGTATTTCTC AGTCTATGCA AGCATTTGAT GACTTCTTGG CTAAGTCAGC AGACTTCCAG
AGTCAAGTAA TTAACGAAAA TCATTCCTTG TTCCTCGAGC TCGACAAGTT GATCAAGAAT
CACTATGACA AGTTAGAGGC AATTAACAAA AACTATGCCG ACTACAGGGA CGAAGTGAGC
GAAAAGATTG TATCGCTTCT CGCTGACAAA ATCCAACATT CCACGGAGGT TAAGAAAGAC
GTCCAAGATC GTATCCTTTC CAAATTGACT TCGTTAAGGG ACACCGTAAG GAGGTGA
 
Protein sequence
MSNTTSIAMP HDPQPPTAAT PSTISTSSAI APPHTGPSVL HSRSIKHQWS GAASLLNPLP 
ADDVRSGSET VETAVSVVQA DAHSAGSIEA VAAARSLEAQ EVAQAVESVD AETVDINGES
YGPDEQDSSS SSPSNSTNFL LGAKNAHRTK LTTHDIRLIL YFIVQIKPFK YVGDRSLSQT
KKWELIQQKF ASHKHSDHEK DRKNDDSPVV VPTVRTLQRQ LATAIRKASI RRHERKQAGI
IDSSPSRSQD DEYYLFKHIS ADSSLTELEA ALLDLNDLSD KLKTGKLANT SHLFQGSMDT
EVQRGVTNLT SMTSSLRALI DSTNSANGAI DTRLTSTLRE LSDIKDDIGA LYANDRYSYS
SISQSMQAFD DFLAKSADFQ SQVINENHSL FLELDKLIKN HYDKLEAINK NYADYRDEVS
EKIVSLLADK IQHSTEVKKD VQDRILSKLT SLRDTVRR