Gene PICST_85698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_85698 
SymbolSUL1 
ID4841147 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp738029 
End bp741073 
Gene Length3045 bp 
Protein Length963 aa 
Translation table12 
GC content43% 
IMG OID640392462 
Productsulfate transporter Sulfate/bicarbonate/oxalate exchanger SAT-1 and related transporters (SLC26 family) 
Protein accessionXP_001386734 
Protein GI126140424 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.525688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GGAGTAAGGA AATACTCCAT TTCCGAAGCG TTTGGCTCGT TCCTGAATGG CGCAGTTGGC 
AGTCCCGTTC CCTTTGACTT TGGAGGAAGA AGCTATGTCT CCATGAGCCA TCGTCCCACT
AGTGTAAACC ATTTCAATTC TCCCATTGTA AACTCTGGTT CAGAAATCCA CGAGCAAACA
GTCAACTTGG CCAACATCGG AGTCACAGAC GATAATGTAG ACCATGAACC ACGTGAAGAA
GACGAAACTT CCATAGTGGA GGACTTTTCA ACTTCGCACT CTTCTACCTT ACCCAAACCC
ATTCGTGGAA GAGCTGCACC ACAAGCAGAT ATAGCATCAA TAACCAGTAC AACACCATTA
ATATCCAAAT CACCTAAGAA TTTATATATT GAAGAGCAAT CGTTGTTCAC GTCATCCGCC
AAAATTGGGT TGGATTTGGA AGCTGGTGCC CCAGACCAAA CTGCTATTGG AACTGGTCAA
GATGCGACTT TGAAACTCAG ACCCACAGTA TCTGATCACA AGGAAGACCT CAGTCAGCAA
CACCCGCCCA GCTTATTGCA TCAGTATGTT GTCAAGCCCA TCCACTACGT GCCCGCTGTC
TTCCTTGGTA CACTTTTGAA CATCTTAGAT GGCTTATCGT ATGGGATGAT CATGTTCCCA
GTTAGTGAGG CTGTCTTCTC ATCGTTGGCT CCGGCGGGTT TGTCAATGTT TTACATGTCT
TGTATTGTTT CTCAATTGGT TTATTCACTT GGAGGATCGG CATTCAGGTC GGGAATAGGT
TCGGAAATGA TCGAGGTCAC TCCCTTCTTC CATACAATGG CGCTTTCCAT TGCTTCAGAA
ATGGCCAACG AGTCCCAGGC AGCTATAATC GCCACTACTA TCACTTCATA CGCTCTTTCT
TCTATAGTCA CGGGGCTTGT ATTCTATATC TTAGGTAAAT GCCGCTTGGG TGTTCTTGTG
GGATTCTTTC CTCGTCACAT TCTAGTTGGC TGTATCGGAG GAGTGGGTTA CTTCTTGGTA
GCAACCGGTG TAGAAGTCTC GTCCAGATTG GAAGGAGGCT TGGAGTACAA CTACGAGACA
TTCAAGTACT TGTTCTGCAA CACATTGACT TTGGCAGAAT GGACTCTCCC ATTGCTTTTA
GCTATATTTC TCATTGCTCT TCAGCACAAG TTCCACAACT CTTTGCTTGT TCCCTTGTAT
TTCATAGCTG TGTTCATTCT CTTCCATGTT ACAGTCTTAG TCGTTCCTTC TTGGAACTTG
CAAAGTGCTA GAAATAGCGG CTGGGTATTC CCGGCAGTGG AAGATAACGA ACCATGGTAT
GAATTCTACA CTTATTACAA GTTCAACTTG GTGGATTGGC TTGCCGTACT CAAACAAGTA
CCTTCGATGT TGGCATTGAC ATTCTTTGGT ATCTTGCATG TTCCCATTAA CGTTCCAGCA
CTTGCTGTAA CGGTGGGTAT GGATGAAGTA GATGTAGACA GAGAATTGGT AGCTCACGGA
TATTCAAATG TTTTGTCTGG TTTAGTAGGC TCTATTCAAA ACTACTTGGT GTACACCAAT
TCTGTTCTTT TCATCAGAGC AGGTGCCGAT GACCGTTTGG CTGGTGTTTT GCTTGCCATT
GCTACTGGCG CAGTGATGAT GACTGGTCCT GTCGTCATTG GCTACATCCC AGTATGTGTG
GTTGGAGCTT TGATCTTCCT TTTGGGTTAC GAATTATTGA AGGAATCTGT CTACGATACC
TGGGGCCGTT TGAGAAACAT CGAATACACC ACTGTCATCA TCATTGTTAT CACAATGGGT
GCCTTTGACT TTGTCTTTGG TATATTGGTA GGTATCTTGC TTGCCTGTCT TTCATTTGTA
GTTGAAGCTG GTAGGAGTCC AGTTGTCCAG GGAGTATACT CTGGTTCTGT AGCTAGATCT
ACAGTTTTAA GACATCCTAA ACAACAAGAG TTCTTGAAGG ATGTCGGTGA CCAAATCTGC
GTTATCAAAT TACAAGGGAC TGTCTTTTTC GGTTCCATTG GTGGAGTTGA AAATGCTATC
AGAGGTAAGT TTTCGCAGGA TGAGTTCAAA TTAAATCCCA TCAAATTCTT GATTATCGAT
ATGAAAGGAG TTAGTTCCAT TGATTTTTCT GCGGCTGAAG GTTTCAGAAG AGTCCTCAAT
TTAACCAATG AATTTAACAC CCGTTTGATT TTCTCCTCCG TTCAAGAAAA CGACGATATC
ATCAAGGGTT TGCGTGATGC TGGTTTGTGG GATAATAGTA GCGGAGAAAA TCCAATTGAA
TTGTTTAACA CATTGAATTA CGCATTGGAA TGGTGTGAAA ACTCCTTCTT GAGATATTAC
AAGACTGTTA AGAGAAAGGA GCAAAATGTC AATATTCGTT CTGTACCGAA CAGTGGCAGC
ACCAATCTTT CCCCTGGAAG AAAGTCCATT ATCTCCCCTC AGAACAATTC TCTTAATACC
AGACAGTTAA TGAGTATGAA CTTTGACATC GGAACTCCTA GAACTACTCA AGTCTATAAT
GCTGCAACTA GAACTGTTCA AGATGAACAG AAGTCACAGA CCAGGTACTA TTCCTCGTCC
GACTCGTCGT TCAAAAAACA ACCACTACCA TTAATCATGA TAACTTTCCA AGGCTTATCG
GACAAGGACG AAGAGTTCTG GTCGGCTATC ACTCCGTACT TCCAAAAGGA AAAAATCCCA
GAAGATATTG AATTCTACAA TACAACCTCT GAGCATGCTG CCTTCTTCAT AGTGGAGTCT
GGGTTAATCA GATCCGTTTA CAAGCTAGAA GAGGGACGCG AATTGCATTC AAGTATCTTG
CCATTGACTG CGTTTGGAGA TCTTTTTCAA ACAAGACGTT ATAGAAAAAT CAGCTACACC
ACTGTTAGCG ATTCCGTAGT GTGGAAGCTT TCTGATTCCA AACTAAGTGA GATGTTGAAG
ACCAAAGAAG GGCAATCGTT GTACAATGAG TTGTTGAAGA TCGAGACGGA GTTGGTGAGA
GAACGTTTTG ACACCATGAC GGCTAATTTG GTTATTGCTG GTTAA
 
Protein sequence
MSHRPTSVNH FNSPIVNSGS EIHEQTVNLA NIGVTDDNVD HEPREEDETS IVEDFSTSHS 
STLPKPIRGR AAPQADIASI TSTTPLISKS PKNLYIEEQS LFTSSAKIGL DLEAGAPDQT
AIGTDLSQQH PPSLLHQYVV KPIHYVPAVF LGTLLNILDG LSYGMIMFPV SEAVFSSLAP
AGLSMFYMSC IVSQLVYSLG GSAFRSGIGS EMIEVTPFFH TMALSIASEM ANESQAAIIA
TTITSYALSS IVTGLVFYIL GKCRLGVLVG FFPRHILVGC IGGVGYFLVA TGVEVSSRLE
GGLEYNYETF KYLFCNTLTL AEWTLPLLLA IFLIALQHKF HNSLLVPLYF IAVFILFHVT
VLVVPSWNLQ SARNSGWVFP AVEDNEPWYE FYTYYKFNLV DWLAVLKQVP SMLALTFFGI
LHVPINVPAL AVTVGMDEVD VDRELVAHGY SNVLSGLVGS IQNYLVYTNS VLFIRAGADD
RLAGVLLAIA TGAVMMTGPV VIGYIPVCVV GALIFLLGYE LLKESVYDTW GRLRNIEYTT
VIIIVITMGA FDFVFGILVG ILLACLSFVV EAGRSPVVQG VYSGSVARST VLRHPKQQEF
LKDVGDQICV IKLQGTVFFG SIGGVENAIR GKFSQDEFKL NPIKFLIIDM KGVSSIDFSA
AEGFRRVLNL TNEFNTRLIF SSVQENDDII KGLRDAGLWD NSSGENPIEL FNTLNYALEW
CENSFLRYYK TVKRKEQNVN IRSVPNSGST NLSPGRKSII SPQNNSLNTR QLMSMNFDIG
TPRTTQVYNA ATRTVQDEQK SQTRYYSSSD SSFKKQPLPL IMITFQGLSD KDEEFWSAIT
PYFQKEKIPE DIEFYNTTSE HAAFFIVESG LIRSVYKLEE GRELHSSILP LTAFGDLFQT
RRYRKISYTT VSDSVVWKLS DSKLSEMLKT KEGQSLYNEL LKIETELVRE RFDTMTANLV
IAG