Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_34942 |
Symbol | |
ID | 4837287 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 777944 |
End bp | 779767 |
Gene Length | 1824 bp |
Protein Length | 471 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640388602 |
Product | predicted protein |
Protein accession | XP_001382920 |
Protein GI | 150864195 |
COG category | [K] Transcription |
COG ID | [COG5576] Homeodomain-containing transcription factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.442314 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTCGG ATGAAAACAG CACAACCTCC AACAAGAGGA CTAGAGCATC TGGTGAGGCT CTCGATTTCT TGCTCCAGGA ATTTGAAAAA AACCCCAACC CTTCCACAGA GCAACGGAAA GATATCTCCA GCAAGACGAA CATGTCCGAA AAGGCCGTTC GGATCTGGTT TCAGAACCGT AGAGCCAAAC TCCGGAAATT TGAACGGTTG AACCGTTTGC AGACTGGAGG TTCAAGCATT CACTCGTCCC GTTCCAATAG CATCAGCAAT ATAAGCCCTA TCCACCCCAA CTATGGAAAC CAGGCTATTC CCATCGAGAT CAACGAAAAG TACTGCTTTG TTGACTGCAC TTCACTTAGT GTAGGCTCCT GGCAGAGGAT TAAGTCAGGT TACCATGACG AAAGACTGCT CCGTAACAAC CTCATCAACT TGTCGCCCTT CACGATTAAC TCGGTAATGA CCAGTGTAGA TTTGCTAGTG ATTTTGTCTA AGAAAAATTG CGAAATTAAC TATTTCTTCC TGGCCATCTC CAACAACTCC AAGATCCTCT TCCGCATATT TTATCCTATA TCTTCTGTTG CTACCTGTTC TTTGCTCGAT AATAATATCA CCAAAGAGAA CAGCGAGCTT CGTGTTAGTT TGACTCACCA GCCCAAGTTT TCGGTGTACT TCTTCAACGG AATCAACTCA CAAGCTAACC AGTGGTCCAT CTGTGACGAT TTCAGCGAAG GTCAACAAGT CAGTCAGGCT TACACTTCAG AAGGAGGTAC GTCCATTCCT CACGTGTTGG TCGGAATCAA GAGCTCTTTG CAGTACTTGA ACTCGTTTAT AGCTGACAAC AATAACCTGA CCTACTCACA ATTCCCGACC TCGGTAACAC CATCATTCCA ACAACCATTC CAAGAAGACA ATACAAGCAG AAATTCCAAC ATCAATAATA CCAACAATAC CAGCAATATC AATACTACTA AAATCAATAA TAATAATCAT GATTTCTTCA GTACGGAAGA TTTGCTCTGG GATGAAACCT CATCTTTGGC TCCTACTAAT ACCAACAGAT CCAATACACC ATTGCCATTT CCACAATCTT CCTCTGGTTC TATCAGCAAC GGCAGTCATT TGAGAAATGC CCAGATGTCT GGATTCAGCC CGTTGGCCGA TTTCAACTCT GACACTTCTC CTAACTCCAT AGGTAGTACC AACAGTAATC AAATCAATGT CAAGAACTCG AGCTCTCATA CTCCTGCCTT ACAACAACAC CATCTGCTCC AGCTGGTGCC TCAATACCAG CCACATACCT CCACGTTCAA TTCTAGCGCG AATACACCAC ATAGAATATA TTCGCGTCAT TCCATACCAC AGCATAACTC CCATAGCAAT ATACCTGAAA CCAGTTCTGT TGATGGTTAC GACGTTTTCA ACACAGCAAA CACCCCCGAC TTCTTCACTA CACTTAGTGG AGATGGCGGT CAGACTCCTT CCAATATGTT AAACCACGAG AACAGCCCCT CCATGAACTC CCAAGGCCAT TCTAATGGTG TTAATAGTAA CACTATCTCT ATTCATGCCT ATACACACCA GAACAATAAC CACAACGATT TCTTGGACTC AATACAAACG TTTCCATCGT CGCATGACTT CGAATTTGGA CTTGGTGGGG ATTTGGCTTC CAACGGTGAA ACTCCTTTGG GTGGAGCTTT TGATGGTGCT CCTAATGGTG GAAATGCTGC TGGTAACAAT AATAGCAGTA ACAATAACAA TAATGCTACT AGTGGTGGCA CTTCCAGCAA CGTCGATAGC TTTATAGACT TTGGCAGCCA CTGA
|
Protein sequence | MSSDENSTTS NKRTRASGEA LDFLLQEFEK NPNPSTEQRK DISSKTNMSE KAVRIWFQNR RAKLRKFERL NRLQTGGSSI HSSRSNSISN ISPIHPNYGN QAIPIEINEK YCFVDCTSLS VGSWQRIKSG YHDERSLRNN LINLSPFTIN SVMTSVDLLV ILSKKNCEIN YFFSAISNNS KILFRIFYPI SSVATCSLLD NNITKENSEL RVSLTHQPKF SVYFFNGINS QANQWSICDD FSEGQQVSQA YTSEGGTSIP HVLVGIKSSL QYLNSFIADN NNSTYSQFPT SMSGFSPLAD FNSDTSPNSI GSTNSNQINV KNSSSHTPAL QQHHSLQSFS NTPDFFTTLS GDGGQTPSNM LNHENSPSMN SQGHSNGVNS NTISIHAYTH QNNNHNDFLD SIQTFPSSHD FEFGLGGDLA SNGETPLGGA FDGAPNGGNA AGNNNSSNNN NNATSGGTSS NVDSFIDFGS H
|
| |