Gene PICST_38766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_38766 
Symbol 
ID4850783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp45009 
End bp46010 
Gene Length1002 bp 
Protein Length333 aa 
Translation table 
GC content42% 
IMG OID640392491 
Productpredicted protein 
Protein accessionXP_001387663 
Protein GI126273521 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2313] Uncharacterized enzyme involved in pigment biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0468003 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTATTAG CTCACAGGAG ACTTTTGTCT TCTTCTCAAG CTCGTGGATC ATTTCATATA 
GACATTTCGG AGGAAATCAA ACATGCATTG AATAGTCTGA AGCCCGTTGT ATCCTTGGAA
TCAACAATCA TAACCCATGG TTTGCCATTT CCCCAGAACT TCGAGATGGC TAAGCAAGTT
GAAGAAGTTG TCAGAGACAA TGGAGCTATC CCAGCAACTT GCGCTTTCAT AGACGGAAAA
CCTCGTGTGG GCTTGAGCGA ACTACTGTTA AAATATTTGG CGGAACAGGC TAATAAGGGT
AAGGCAAATA AGGTTTCTCG AAGAGATATT GGCTATACCA TGGCCAAAGG TTACAATGGA
GGAACCACGA TTGCTCTGAC AATGATTCTT TCCCATATGG CGGGAATCAA GGTGTTTGCT
ACTGGTGGTT TAGGTGGTGT TCATAAGGGT GGCCAGAATT CATTTGATGT TTCGGCAGAT
TTGACGGAAC TTGGTAGAAC ACCAGTTTCT GTGGTATGTT CTGGACCAAA GTCTATCTTG
GATATTGGTT TGACGCTTGA GTTCTTGGAG ACTCAAGGAG TTTTTGTAGG AACATACAAC
GACGATGGAA GGCTCGACGT TGAGGTACCT GGTTTCTACT GTCGTGAATC TGGCTATAGA
TCGCCATATG ATTTTTCCAG CTTCGAAGAG GCTGCCTCCA TTATCCATAA TCACAACAAT
ATCATGTCTC TTAATTCAGG GAACATATTC TGCATTCCTC CACCGAGAGA ATCGGCATTG
TCGTCTTCTT TCATAAGCAA AGTGATCGAC CGTGCCAATC AAGAAGCGAT TGCTCAAAAT
ATTTCGGGCA AGAATTTGAC TCCTTTCTTG TTATCAAAGA TTGCAGAAGA AACCAATGGC
AAATCTGTTG AATGTAATAT TAAATTTGTA TTAAATAATG CTAGAGCAGC CACCCAGATT
GCAACAAGCT TGAGTAAATT AGAGAACAAT GTGAGTATAT GA
 
Protein sequence
MLLAHRRLLS SSQARGSFHI DISEEIKHAL NSLKPVVSLE STIITHGLPF PQNFEMAKQV 
EEVVRDNGAI PATCAFIDGK PRVGLSELLL KYLAEQANKG KANKVSRRDI GYTMAKGYNG
GTTIALTMIL SHMAGIKVFA TGGLGGVHKG GQNSFDVSAD LTELGRTPVS VVCSGPKSIL
DIGLTLEFLE TQGVFVGTYN DDGRLDVEVP GFYCRESGYR SPYDFSSFEE AASIIHNHNN
IMSLNSGNIF CIPPPRESAL SSSFISKVID RANQEAIAQN ISGKNLTPFL LSKIAEETNG
KSVECNIKFV LNNARAATQI ATSLSKLENN VSI