Gene PICST_30840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30840 
SymbolSFU1 
ID4837767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1076135 
End bp1077595 
Gene Length1461 bp 
Protein Length486 aa 
Translation table12 
GC content51% 
IMG OID640389082 
ProductGATA type transcriptional activator of nitrogen-regulated genes 
Protein accessionXP_001383491 
Protein GI150864605 
COG category[K] Transcription 
COG ID[COG5641] GATA Zn-finger-containing transcription factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCCC TAACAGCACC CGCGTCTTCT GAGACAGCGA CACCAGTTAA ACAGGAGGAC 
TCCTCGACAC TCGAGCACAT CCAGGTGCCG TCCACTGCTA GCCTTGTCAC TCCAGCTACG
CCAGCAGCAC CAACAACTCC GGCTACACCG ACTTCGTGTC TGACGGCTGC ACTTGCGACT
GTAACATCTC CGCAAGATGG CCAACAATGC TCAAACTGTG GCACTACCAA AACTCCACTT
TGGAGGAGGG CGCCTGATGG AACCCTCATC TGCAATGCCT GTGGGTTGTA CCTTCGTTCA
AACAACACCC ATAGGCCCGT AAATCTCAAG CGTCCTCCCA ACACGATACC CATAACCAAA
ACCGAAGAAG GATCCTGTAA AGGCGATGGC AGATGTAACG GAACTGGAGG ATCGGCTGCC
TGTAAAGGAT GTCCCGCCTA CAACAACCGG GTTGTAGTAA GCAAACGCGA GAAGTCTGCA
TCTACACCTC CCATGGAATC GACTCCGGCC ACATCACCAC AGCCAGAAAA GAGAGTAGCT
ACCGATGTAG ACGAAGACTC TCTAGCCATT GCCTGTTTCA ACTGTGGAAC CACTATTACT
CCACTCTGGA GAAGGGATGA TGCCGGTAAT ACTATCTGCA ATGCCTGTGG ACTATACTAC
AGATTGCATG GGTCGCATCG TCCAATCAGG ATGAAAAGAA CGACCATCAA GAGAAGGAAG
AGGAACATGG CTTCGGGCAA GAAAGATGCT TCGGCCTCTG ACTCAAACAC AGAAGACAAA
GAGCCTCGTA GTCCGGAGGA CGCTCGAAAT ACCCATAGTC CACAATTGAA TACTCTTCAT
TCCCCTACTG CAGGTTTGGG CTCTCCTAAT GGGTCGACTC TACCTCCAAT TTCGTACAAC
CATCCTCTCC TGAGACTGGC TCCTCCGGTG TCAAACTCTC CTACTTCGTA CTATCCTCCA
TACACCCCTG GAGGACGTAT TCCAAACGGT CCAGGTCCTC TACCAGGTCC TCCACCACCT
CAACCGGCTT CGCAGTACGG CTTCTCCATC CCCCAAACCC ACCAACCACC TTTGCATTAT
GGAATATACA ACCAAAATTC GGACATAAAA CTTCCTCTGA TCCAGCTCCA CGAAGCTGGG
CCCCACAGCC GGCACCTTTT GGCACCTATC ATCAAGAAAG AAGCCATTCC GTCAATCTCT
ACATCCAAAT CTGCATCAAT TATCCCGCCT GTGTCGGAAT TCGTAGGTTC TCGACAATCT
ACTCCGGGAG TATCTGATGG TATCAAAAAG AGATCGGCGT CTTCAACTCC AATTGCGATC
GATTTTACGT CGACTTTCAG GTCTCCTGCT GCTTCTAACG GTAATACGTC TACTATATCT
GATACTAACA ACGATAGTGA AGGCAGCAAA AAGGACCACC GCACGCACGC ATTGTCGATT
GGAGGCTTGT TAAATGGATA G
 
Protein sequence
MSSLTAPASS ETATPVKQED SSTLEHIQVP STASLVTPAT PAAPTTPATP TSCSTAALAT 
VTSPQDGQQC SNCGTTKTPL WRRAPDGTLI CNACGLYLRS NNTHRPVNLK RPPNTIPITK
TEEGSCKGDG RCNGTGGSAA CKGCPAYNNR VVVSKREKSA STPPMESTPA TSPQPEKRVA
TDVDEDSLAI ACFNCGTTIT PLWRRDDAGN TICNACGLYY RLHGSHRPIR MKRTTIKRRK
RNMASGKKDA SASDSNTEDK EPRSPEDARN THSPQLNTLH SPTAGLGSPN GSTLPPISYN
HPLSRSAPPV SNSPTSYYPP YTPGGRIPNG PGPLPGPPPP QPASQYGFSI PQTHQPPLHY
GIYNQNSDIK LPSIQLHEAG PHSRHLLAPI IKKEAIPSIS TSKSASIIPP VSEFVGSRQS
TPGVSDGIKK RSASSTPIAI DFTSTFRSPA ASNGNTSTIS DTNNDSEGSK KDHRTHALSI
GGLLNG