Gene PICST_33739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33739 
SymbolSAN1 
ID4840867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp366829 
End bp368650 
Gene Length1822 bp 
Protein Length607 aa 
Translation table12 
GC content43% 
IMG OID640392182 
Productmating-type transcriptional regulator (putative) 
Protein accessionXP_001386658 
Protein GI150866907 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5243] HRD ubiquitin ligase complex, ER membrane component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0111037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGCA ACAACCGTGA AACCACTAGT CCAGGCGGGA ATAACCCGTC TGCTGACTCT 
TCCACTAACA TGACAGATGC CGAATATATT GGCAATCTTT TGAACAACAC TAATACAAAT
GCAACTAATT CAAACTCAAC TAATTCAGGC CCGACTAATT CAAACTCAAC CAATTCAAAT
TCAACTAACA CAAGTACACA CAATACAAGC TCTTCTGGCA ATAATAGCAG TGCCACTGGC
AATGGAAATG GGAATACTTC CCCCATGAGA AGAATGCCTT CTTCAGCATC GTCCAATTCT
GGTAATACTT CATCATCTTC CAGCCGCATA TTCTCTGCAT TTAATAGAAG TCATAGAGCG
GAGTTAGGTC TTTTTGATAG ATTTAGAGAT ACGGTTTCGG GTTTGTACCG TTCCAACAGA
ACCAGCAGAG AAGGTAGTCA CCAGCCTAAC TCTGCTACAC AGACTTCTAC AGATATACCT
ACTGTTGTGC CACAGGCTGC TCAGCTTGAA ACAACTGCTA CAATCGATAC CAGCGGCCCT
ACCACTGGAC CTTCTGCTCT GACTTTCAAC AGTGAAAGTG ACTATACTCG TGCTATAGTC
ATCACCGTTA ACTATGTGTT TTCTGACGAG AACAACCCTC AGTATCCCAA CAGGGCCGGA
TCATTGATCA TGTCACTCCC CAATAATTCA TCCAATAGAG ACCCCAGAGT AATCCAGGAG
TTCATCAGAT TGGCCACACA AATGGCTTAT TCCACCATCA TTAACGGTTT ACACAAGGAG
AAAGGTGTAA CTCTTTGTAA ATTCAATTCT TTCCCGAGTG TTAAGGAAGC TGATTTGGGC
GATTCTCGAG CATGCTCTAT CTGTTTTGAC GAATTTGACA TGGTAGAAGC AGAAAAAGAA
AAATCATTGG TAGACGAAAG TGATGATGAG CTAGTTGTAG TGAAGAAGAG AAGAATAGAC
GAACTAAGAC TGGCTGCTAC GAGCGAAAGC AATAGCAGAG TACAATCCGA TGACGAATCA
GACTCGGCTA TTCCTCCCTC GAACAACATT CTTGAAAGCA CTTCCATCGA CTCCACAAAC
ACCGATCTAC CCAATACAGA CCCTGCAAGT ACATCGAATA CAAATCCAAC TAATACAAAT
ACACCCAGCA CCAATGAAGA GCTGGCAGAG CCAAAGTACT TATCTGAGTA CACTGGGGTT
TTTGATCATA GCCCAATCAG GATGACTTGT GGTCATATCT TTGGAAAGGA TTGTCTTTCG
GAATGGTTAA AAGAGCACAC TACTTGCCCA TTGTGTAGAG ATTCAGTCGC AGAACCAACG
TCAAGAACGA ATTCAAGCAA TGTGACAATC TTTAACTTAC CTACCAACTC TAGTCGTCCA
ACTACAACCG AAGCTCATGT TGACATTAAC GAGACTACGA CTCCCCAAGA AGAAACCTCA
ATTGACTCAG ATACCGTTTA TCGGGAAGCT GACAGATTCC ACTACTTTCC TTTATCAGGA
GGTTCAGCGC GTCCTTTGAG AAGAGTTCTC AGATCTGGAA GCGGTATATC TGATGCGGCA
GCAGAACAGA CAGAAAGAGA CTACTCTACC TCACACAGCC AGTTTTCTCA CATCTTGGGC
TTTCTTAGAA GACAAAGAAC AGGCAACCAT CCTGAGCCAT TGTTTCCAAC TGGAATTTCC
AGCAGAAGAA CAGCCAATGG TATTGAGACG AGACTGACTG ATGAAGATGA CGATGCTGCT
TCGGAAATTC TCGATTTCAT GAATTTGCAT GAGTTTAATC CTCCGGTCAA TTCTGATTCA
GCGAACACTA TTTCTGAAGC AG
 
Protein sequence
MNSNNRETTS PGGNNPSADS STNMTDAEYI GNLLNNTNTN ATNSNSTNSG PTNSNSTNSN 
STNTSTHNTS SSGNNSSATG NGNGNTSPMR RMPSSASSNS GNTSSSSSRI FSAFNRSHRA
ELGLFDRFRD TVSGLYRSNR TSREGSHQPN SATQTSTDIP TVVPQAAQLE TTATIDTSGP
TTGPSASTFN SESDYTRAIV ITVNYVFSDE NNPQYPNRAG SLIMSLPNNS SNRDPRVIQE
FIRLATQMAY STIINGLHKE KGVTLCKFNS FPSVKEADLG DSRACSICFD EFDMVEAEKE
KSLVDESDDE LVVVKKRRID ELRSAATSES NSRVQSDDES DSAIPPSNNI LESTSIDSTN
TDLPNTDPAS TSNTNPTNTN TPSTNEESAE PKYLSEYTGV FDHSPIRMTC GHIFGKDCLS
EWLKEHTTCP LCRDSVAEPT SRTNSSNVTI FNLPTNSSRP TTTEAHVDIN ETTTPQEETS
IDSDTVYREA DRFHYFPLSG GSARPLRRVL RSGSGISDAA AEQTERDYST SHSQFSHILG
FLRRQRTGNH PEPLFPTGIS SRRTANGIET RSTDEDDDAA SEILDFMNLH EFNPPVNSDS
ANTISEA