Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33739 |
Symbol | SAN1 |
ID | 4840867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | - |
Start bp | 366829 |
End bp | 368650 |
Gene Length | 1822 bp |
Protein Length | 607 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640392182 |
Product | mating-type transcriptional regulator (putative) |
Protein accession | XP_001386658 |
Protein GI | 150866907 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5243] HRD ubiquitin ligase complex, ER membrane component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0111037 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAGCA ACAACCGTGA AACCACTAGT CCAGGCGGGA ATAACCCGTC TGCTGACTCT TCCACTAACA TGACAGATGC CGAATATATT GGCAATCTTT TGAACAACAC TAATACAAAT GCAACTAATT CAAACTCAAC TAATTCAGGC CCGACTAATT CAAACTCAAC CAATTCAAAT TCAACTAACA CAAGTACACA CAATACAAGC TCTTCTGGCA ATAATAGCAG TGCCACTGGC AATGGAAATG GGAATACTTC CCCCATGAGA AGAATGCCTT CTTCAGCATC GTCCAATTCT GGTAATACTT CATCATCTTC CAGCCGCATA TTCTCTGCAT TTAATAGAAG TCATAGAGCG GAGTTAGGTC TTTTTGATAG ATTTAGAGAT ACGGTTTCGG GTTTGTACCG TTCCAACAGA ACCAGCAGAG AAGGTAGTCA CCAGCCTAAC TCTGCTACAC AGACTTCTAC AGATATACCT ACTGTTGTGC CACAGGCTGC TCAGCTTGAA ACAACTGCTA CAATCGATAC CAGCGGCCCT ACCACTGGAC CTTCTGCTCT GACTTTCAAC AGTGAAAGTG ACTATACTCG TGCTATAGTC ATCACCGTTA ACTATGTGTT TTCTGACGAG AACAACCCTC AGTATCCCAA CAGGGCCGGA TCATTGATCA TGTCACTCCC CAATAATTCA TCCAATAGAG ACCCCAGAGT AATCCAGGAG TTCATCAGAT TGGCCACACA AATGGCTTAT TCCACCATCA TTAACGGTTT ACACAAGGAG AAAGGTGTAA CTCTTTGTAA ATTCAATTCT TTCCCGAGTG TTAAGGAAGC TGATTTGGGC GATTCTCGAG CATGCTCTAT CTGTTTTGAC GAATTTGACA TGGTAGAAGC AGAAAAAGAA AAATCATTGG TAGACGAAAG TGATGATGAG CTAGTTGTAG TGAAGAAGAG AAGAATAGAC GAACTAAGAC TGGCTGCTAC GAGCGAAAGC AATAGCAGAG TACAATCCGA TGACGAATCA GACTCGGCTA TTCCTCCCTC GAACAACATT CTTGAAAGCA CTTCCATCGA CTCCACAAAC ACCGATCTAC CCAATACAGA CCCTGCAAGT ACATCGAATA CAAATCCAAC TAATACAAAT ACACCCAGCA CCAATGAAGA GCTGGCAGAG CCAAAGTACT TATCTGAGTA CACTGGGGTT TTTGATCATA GCCCAATCAG GATGACTTGT GGTCATATCT TTGGAAAGGA TTGTCTTTCG GAATGGTTAA AAGAGCACAC TACTTGCCCA TTGTGTAGAG ATTCAGTCGC AGAACCAACG TCAAGAACGA ATTCAAGCAA TGTGACAATC TTTAACTTAC CTACCAACTC TAGTCGTCCA ACTACAACCG AAGCTCATGT TGACATTAAC GAGACTACGA CTCCCCAAGA AGAAACCTCA ATTGACTCAG ATACCGTTTA TCGGGAAGCT GACAGATTCC ACTACTTTCC TTTATCAGGA GGTTCAGCGC GTCCTTTGAG AAGAGTTCTC AGATCTGGAA GCGGTATATC TGATGCGGCA GCAGAACAGA CAGAAAGAGA CTACTCTACC TCACACAGCC AGTTTTCTCA CATCTTGGGC TTTCTTAGAA GACAAAGAAC AGGCAACCAT CCTGAGCCAT TGTTTCCAAC TGGAATTTCC AGCAGAAGAA CAGCCAATGG TATTGAGACG AGACTGACTG ATGAAGATGA CGATGCTGCT TCGGAAATTC TCGATTTCAT GAATTTGCAT GAGTTTAATC CTCCGGTCAA TTCTGATTCA GCGAACACTA TTTCTGAAGC AG
|
Protein sequence | MNSNNRETTS PGGNNPSADS STNMTDAEYI GNLLNNTNTN ATNSNSTNSG PTNSNSTNSN STNTSTHNTS SSGNNSSATG NGNGNTSPMR RMPSSASSNS GNTSSSSSRI FSAFNRSHRA ELGLFDRFRD TVSGLYRSNR TSREGSHQPN SATQTSTDIP TVVPQAAQLE TTATIDTSGP TTGPSASTFN SESDYTRAIV ITVNYVFSDE NNPQYPNRAG SLIMSLPNNS SNRDPRVIQE FIRLATQMAY STIINGLHKE KGVTLCKFNS FPSVKEADLG DSRACSICFD EFDMVEAEKE KSLVDESDDE LVVVKKRRID ELRSAATSES NSRVQSDDES DSAIPPSNNI LESTSIDSTN TDLPNTDPAS TSNTNPTNTN TPSTNEESAE PKYLSEYTGV FDHSPIRMTC GHIFGKDCLS EWLKEHTTCP LCRDSVAEPT SRTNSSNVTI FNLPTNSSRP TTTEAHVDIN ETTTPQEETS IDSDTVYREA DRFHYFPLSG GSARPLRRVL RSGSGISDAA AEQTERDYST SHSQFSHILG FLRRQRTGNH PEPLFPTGIS SRRTANGIET RSTDEDDDAA SEILDFMNLH EFNPPVNSDS ANTISEA
|
| |