Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_30840 |
Symbol | SFU1 |
ID | 4837767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 1076135 |
End bp | 1077595 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 12 |
GC content | 51% |
IMG OID | 640389082 |
Product | GATA type transcriptional activator of nitrogen-regulated genes |
Protein accession | XP_001383491 |
Protein GI | 150864605 |
COG category | [K] Transcription |
COG ID | [COG5641] GATA Zn-finger-containing transcription factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTCCC TAACAGCACC CGCGTCTTCT GAGACAGCGA CACCAGTTAA ACAGGAGGAC TCCTCGACAC TCGAGCACAT CCAGGTGCCG TCCACTGCTA GCCTTGTCAC TCCAGCTACG CCAGCAGCAC CAACAACTCC GGCTACACCG ACTTCGTGTC TGACGGCTGC ACTTGCGACT GTAACATCTC CGCAAGATGG CCAACAATGC TCAAACTGTG GCACTACCAA AACTCCACTT TGGAGGAGGG CGCCTGATGG AACCCTCATC TGCAATGCCT GTGGGTTGTA CCTTCGTTCA AACAACACCC ATAGGCCCGT AAATCTCAAG CGTCCTCCCA ACACGATACC CATAACCAAA ACCGAAGAAG GATCCTGTAA AGGCGATGGC AGATGTAACG GAACTGGAGG ATCGGCTGCC TGTAAAGGAT GTCCCGCCTA CAACAACCGG GTTGTAGTAA GCAAACGCGA GAAGTCTGCA TCTACACCTC CCATGGAATC GACTCCGGCC ACATCACCAC AGCCAGAAAA GAGAGTAGCT ACCGATGTAG ACGAAGACTC TCTAGCCATT GCCTGTTTCA ACTGTGGAAC CACTATTACT CCACTCTGGA GAAGGGATGA TGCCGGTAAT ACTATCTGCA ATGCCTGTGG ACTATACTAC AGATTGCATG GGTCGCATCG TCCAATCAGG ATGAAAAGAA CGACCATCAA GAGAAGGAAG AGGAACATGG CTTCGGGCAA GAAAGATGCT TCGGCCTCTG ACTCAAACAC AGAAGACAAA GAGCCTCGTA GTCCGGAGGA CGCTCGAAAT ACCCATAGTC CACAATTGAA TACTCTTCAT TCCCCTACTG CAGGTTTGGG CTCTCCTAAT GGGTCGACTC TACCTCCAAT TTCGTACAAC CATCCTCTCC TGAGACTGGC TCCTCCGGTG TCAAACTCTC CTACTTCGTA CTATCCTCCA TACACCCCTG GAGGACGTAT TCCAAACGGT CCAGGTCCTC TACCAGGTCC TCCACCACCT CAACCGGCTT CGCAGTACGG CTTCTCCATC CCCCAAACCC ACCAACCACC TTTGCATTAT GGAATATACA ACCAAAATTC GGACATAAAA CTTCCTCTGA TCCAGCTCCA CGAAGCTGGG CCCCACAGCC GGCACCTTTT GGCACCTATC ATCAAGAAAG AAGCCATTCC GTCAATCTCT ACATCCAAAT CTGCATCAAT TATCCCGCCT GTGTCGGAAT TCGTAGGTTC TCGACAATCT ACTCCGGGAG TATCTGATGG TATCAAAAAG AGATCGGCGT CTTCAACTCC AATTGCGATC GATTTTACGT CGACTTTCAG GTCTCCTGCT GCTTCTAACG GTAATACGTC TACTATATCT GATACTAACA ACGATAGTGA AGGCAGCAAA AAGGACCACC GCACGCACGC ATTGTCGATT GGAGGCTTGT TAAATGGATA G
|
Protein sequence | MSSLTAPASS ETATPVKQED SSTLEHIQVP STASLVTPAT PAAPTTPATP TSCSTAALAT VTSPQDGQQC SNCGTTKTPL WRRAPDGTLI CNACGLYLRS NNTHRPVNLK RPPNTIPITK TEEGSCKGDG RCNGTGGSAA CKGCPAYNNR VVVSKREKSA STPPMESTPA TSPQPEKRVA TDVDEDSLAI ACFNCGTTIT PLWRRDDAGN TICNACGLYY RLHGSHRPIR MKRTTIKRRK RNMASGKKDA SASDSNTEDK EPRSPEDARN THSPQLNTLH SPTAGLGSPN GSTLPPISYN HPLSRSAPPV SNSPTSYYPP YTPGGRIPNG PGPLPGPPPP QPASQYGFSI PQTHQPPLHY GIYNQNSDIK LPSIQLHEAG PHSRHLLAPI IKKEAIPSIS TSKSASIIPP VSEFVGSRQS TPGVSDGIKK RSASSTPIAI DFTSTFRSPA ASNGNTSTIS DTNNDSEGSK KDHRTHALSI GGLLNG
|
| |