Gene PICST_33737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33737 
Symbol 
ID4841062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp364115 
End bp365281 
Gene Length1167 bp 
Protein Length388 aa 
Translation table12 
GC content45% 
IMG OID640392377 
Productpredicted protein 
Protein accessionXP_001386657 
Protein GI150866906 
COG category[I] Lipid transport and metabolism 
COG ID[COG1562] Phytoene/squalene synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.605326 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAAGGT GTAACCGTCT AGTTGTGCGA TCGGGTTTGC GAAGATCTTA CTCTTCAATT 
TCTTCTTCCA AGTACGAAGT GCAGTTGTTT AATGCTACTG AGAATATCAA TAAGCTCTTG
GAGACCCATG ATCGTTCTTC ATATATATTA GCACAGTACG TTCCCGAGCC AGCCCGAAAC
GCCTTCTTGG CTATACGGGC TTTCAACTTG GAGATCAATA AGATCAGCGA TGGTGGCAGC
AATACAGGCT CCGTCGCTTC CAAGGCATCT TCACAACTTT CCAAGTCAAT GGGAATCTCC
ACAGCAGACA TGAAGTTCAA GTTCTGGAGT GATTTAGTCG CCAGGGTGTT CACGGAAGAC
CCATATCTGG AAAAAGACAT CGGTGAACCC ATAGCCATAT TGTTGAGAGA TGCATTACGG
AACGACTTGA ACTTAGACGT TACTTATTTC CATCAATTCC TACAGACTAG AAGGCAATTC
TTGAAGTCCC CGACGTTCCA GACTGTAGAC GACATTTGTT CCTACGGTGA AGGAACCTAC
TCCCAATTGA ACTACCAGAC TCAAGCTCTA TTACTATCTC CGTCCATATC GCCTTCTGTG
ATTAGTCTTT TGGAACAATC AACATCGTTG CAGTCTAAGG TAAGTGATAT CGCTGCACAT
ATCGGGCAAG CCACAGCTGT CGGTGCCATG ATCTTGGGAA TGAACTACTA TGCTACGTCC
AGAAACCAGG TCACGTTGCC TGTGAATTTG ATGTCCAAGT ACGACTTGTC CCAGGAGTCA
GTGTTGAAAT TGGCCCAAGG ACACGTGAAA GAGAAGACAG AGGTAGATGC TATCAGAGAC
AAGTTGAAAA ATATCGTTTA CGAAACAGCC ACAACATCCA ATGATCATAT CCTTACAGCC
AGGGCCAAGT TATCACAATG CAAACAAGAG ATCAACGAGA TAGTCAGGGC CAACATGCAT
GACCAATTAC TTCAGAAGAA CTCCAAGCGT TGGCGGAAGT TCATGCCAGA TGTAATTTTC
ACTCCTTTCA TGGTAGCCAT TCCTACGACG TTGTACTTGA ACAAGTTAGA GAAACACGAC
TTTGATATTT ACCACCACAA GATGCAGCAG AAGGAATGGC GGTTGGCGTG GACTTCGTTC
AAGGACTACT ACCAGAGAAC GATATAG
 
Protein sequence
MLRCNRLVVR SGLRRSYSSI SSSKYEVQLF NATENINKLL ETHDRSSYIL AQYVPEPARN 
AFLAIRAFNL EINKISDGGS NTGSVASKAS SQLSKSMGIS TADMKFKFWS DLVARVFTED
PYSEKDIGEP IAILLRDALR NDLNLDVTYF HQFLQTRRQF LKSPTFQTVD DICSYGEGTY
SQLNYQTQAL LLSPSISPSV ISLLEQSTSL QSKVSDIAAH IGQATAVGAM ILGMNYYATS
RNQVTLPVNL MSKYDLSQES VLKLAQGHVK EKTEVDAIRD KLKNIVYETA TTSNDHILTA
RAKLSQCKQE INEIVRANMH DQLLQKNSKR WRKFMPDVIF TPFMVAIPTT LYLNKLEKHD
FDIYHHKMQQ KEWRLAWTSF KDYYQRTI