Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_85698 |
Symbol | SUL1 |
ID | 4841147 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | - |
Start bp | 738029 |
End bp | 741073 |
Gene Length | 3045 bp |
Protein Length | 963 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640392462 |
Product | sulfate transporter Sulfate/bicarbonate/oxalate exchanger SAT-1 and related transporters (SLC26 family) |
Protein accession | XP_001386734 |
Protein GI | 126140424 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.525688 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GGAGTAAGGA AATACTCCAT TTCCGAAGCG TTTGGCTCGT TCCTGAATGG CGCAGTTGGC AGTCCCGTTC CCTTTGACTT TGGAGGAAGA AGCTATGTCT CCATGAGCCA TCGTCCCACT AGTGTAAACC ATTTCAATTC TCCCATTGTA AACTCTGGTT CAGAAATCCA CGAGCAAACA GTCAACTTGG CCAACATCGG AGTCACAGAC GATAATGTAG ACCATGAACC ACGTGAAGAA GACGAAACTT CCATAGTGGA GGACTTTTCA ACTTCGCACT CTTCTACCTT ACCCAAACCC ATTCGTGGAA GAGCTGCACC ACAAGCAGAT ATAGCATCAA TAACCAGTAC AACACCATTA ATATCCAAAT CACCTAAGAA TTTATATATT GAAGAGCAAT CGTTGTTCAC GTCATCCGCC AAAATTGGGT TGGATTTGGA AGCTGGTGCC CCAGACCAAA CTGCTATTGG AACTGGTCAA GATGCGACTT TGAAACTCAG ACCCACAGTA TCTGATCACA AGGAAGACCT CAGTCAGCAA CACCCGCCCA GCTTATTGCA TCAGTATGTT GTCAAGCCCA TCCACTACGT GCCCGCTGTC TTCCTTGGTA CACTTTTGAA CATCTTAGAT GGCTTATCGT ATGGGATGAT CATGTTCCCA GTTAGTGAGG CTGTCTTCTC ATCGTTGGCT CCGGCGGGTT TGTCAATGTT TTACATGTCT TGTATTGTTT CTCAATTGGT TTATTCACTT GGAGGATCGG CATTCAGGTC GGGAATAGGT TCGGAAATGA TCGAGGTCAC TCCCTTCTTC CATACAATGG CGCTTTCCAT TGCTTCAGAA ATGGCCAACG AGTCCCAGGC AGCTATAATC GCCACTACTA TCACTTCATA CGCTCTTTCT TCTATAGTCA CGGGGCTTGT ATTCTATATC TTAGGTAAAT GCCGCTTGGG TGTTCTTGTG GGATTCTTTC CTCGTCACAT TCTAGTTGGC TGTATCGGAG GAGTGGGTTA CTTCTTGGTA GCAACCGGTG TAGAAGTCTC GTCCAGATTG GAAGGAGGCT TGGAGTACAA CTACGAGACA TTCAAGTACT TGTTCTGCAA CACATTGACT TTGGCAGAAT GGACTCTCCC ATTGCTTTTA GCTATATTTC TCATTGCTCT TCAGCACAAG TTCCACAACT CTTTGCTTGT TCCCTTGTAT TTCATAGCTG TGTTCATTCT CTTCCATGTT ACAGTCTTAG TCGTTCCTTC TTGGAACTTG CAAAGTGCTA GAAATAGCGG CTGGGTATTC CCGGCAGTGG AAGATAACGA ACCATGGTAT GAATTCTACA CTTATTACAA GTTCAACTTG GTGGATTGGC TTGCCGTACT CAAACAAGTA CCTTCGATGT TGGCATTGAC ATTCTTTGGT ATCTTGCATG TTCCCATTAA CGTTCCAGCA CTTGCTGTAA CGGTGGGTAT GGATGAAGTA GATGTAGACA GAGAATTGGT AGCTCACGGA TATTCAAATG TTTTGTCTGG TTTAGTAGGC TCTATTCAAA ACTACTTGGT GTACACCAAT TCTGTTCTTT TCATCAGAGC AGGTGCCGAT GACCGTTTGG CTGGTGTTTT GCTTGCCATT GCTACTGGCG CAGTGATGAT GACTGGTCCT GTCGTCATTG GCTACATCCC AGTATGTGTG GTTGGAGCTT TGATCTTCCT TTTGGGTTAC GAATTATTGA AGGAATCTGT CTACGATACC TGGGGCCGTT TGAGAAACAT CGAATACACC ACTGTCATCA TCATTGTTAT CACAATGGGT GCCTTTGACT TTGTCTTTGG TATATTGGTA GGTATCTTGC TTGCCTGTCT TTCATTTGTA GTTGAAGCTG GTAGGAGTCC AGTTGTCCAG GGAGTATACT CTGGTTCTGT AGCTAGATCT ACAGTTTTAA GACATCCTAA ACAACAAGAG TTCTTGAAGG ATGTCGGTGA CCAAATCTGC GTTATCAAAT TACAAGGGAC TGTCTTTTTC GGTTCCATTG GTGGAGTTGA AAATGCTATC AGAGGTAAGT TTTCGCAGGA TGAGTTCAAA TTAAATCCCA TCAAATTCTT GATTATCGAT ATGAAAGGAG TTAGTTCCAT TGATTTTTCT GCGGCTGAAG GTTTCAGAAG AGTCCTCAAT TTAACCAATG AATTTAACAC CCGTTTGATT TTCTCCTCCG TTCAAGAAAA CGACGATATC ATCAAGGGTT TGCGTGATGC TGGTTTGTGG GATAATAGTA GCGGAGAAAA TCCAATTGAA TTGTTTAACA CATTGAATTA CGCATTGGAA TGGTGTGAAA ACTCCTTCTT GAGATATTAC AAGACTGTTA AGAGAAAGGA GCAAAATGTC AATATTCGTT CTGTACCGAA CAGTGGCAGC ACCAATCTTT CCCCTGGAAG AAAGTCCATT ATCTCCCCTC AGAACAATTC TCTTAATACC AGACAGTTAA TGAGTATGAA CTTTGACATC GGAACTCCTA GAACTACTCA AGTCTATAAT GCTGCAACTA GAACTGTTCA AGATGAACAG AAGTCACAGA CCAGGTACTA TTCCTCGTCC GACTCGTCGT TCAAAAAACA ACCACTACCA TTAATCATGA TAACTTTCCA AGGCTTATCG GACAAGGACG AAGAGTTCTG GTCGGCTATC ACTCCGTACT TCCAAAAGGA AAAAATCCCA GAAGATATTG AATTCTACAA TACAACCTCT GAGCATGCTG CCTTCTTCAT AGTGGAGTCT GGGTTAATCA GATCCGTTTA CAAGCTAGAA GAGGGACGCG AATTGCATTC AAGTATCTTG CCATTGACTG CGTTTGGAGA TCTTTTTCAA ACAAGACGTT ATAGAAAAAT CAGCTACACC ACTGTTAGCG ATTCCGTAGT GTGGAAGCTT TCTGATTCCA AACTAAGTGA GATGTTGAAG ACCAAAGAAG GGCAATCGTT GTACAATGAG TTGTTGAAGA TCGAGACGGA GTTGGTGAGA GAACGTTTTG ACACCATGAC GGCTAATTTG GTTATTGCTG GTTAA
|
Protein sequence | MSHRPTSVNH FNSPIVNSGS EIHEQTVNLA NIGVTDDNVD HEPREEDETS IVEDFSTSHS STLPKPIRGR AAPQADIASI TSTTPLISKS PKNLYIEEQS LFTSSAKIGL DLEAGAPDQT AIGTDLSQQH PPSLLHQYVV KPIHYVPAVF LGTLLNILDG LSYGMIMFPV SEAVFSSLAP AGLSMFYMSC IVSQLVYSLG GSAFRSGIGS EMIEVTPFFH TMALSIASEM ANESQAAIIA TTITSYALSS IVTGLVFYIL GKCRLGVLVG FFPRHILVGC IGGVGYFLVA TGVEVSSRLE GGLEYNYETF KYLFCNTLTL AEWTLPLLLA IFLIALQHKF HNSLLVPLYF IAVFILFHVT VLVVPSWNLQ SARNSGWVFP AVEDNEPWYE FYTYYKFNLV DWLAVLKQVP SMLALTFFGI LHVPINVPAL AVTVGMDEVD VDRELVAHGY SNVLSGLVGS IQNYLVYTNS VLFIRAGADD RLAGVLLAIA TGAVMMTGPV VIGYIPVCVV GALIFLLGYE LLKESVYDTW GRLRNIEYTT VIIIVITMGA FDFVFGILVG ILLACLSFVV EAGRSPVVQG VYSGSVARST VLRHPKQQEF LKDVGDQICV IKLQGTVFFG SIGGVENAIR GKFSQDEFKL NPIKFLIIDM KGVSSIDFSA AEGFRRVLNL TNEFNTRLIF SSVQENDDII KGLRDAGLWD NSSGENPIEL FNTLNYALEW CENSFLRYYK TVKRKEQNVN IRSVPNSGST NLSPGRKSII SPQNNSLNTR QLMSMNFDIG TPRTTQVYNA ATRTVQDEQK SQTRYYSSSD SSFKKQPLPL IMITFQGLSD KDEEFWSAIT PYFQKEKIPE DIEFYNTTSE HAAFFIVESG LIRSVYKLEE GRELHSSILP LTAFGDLFQT RRYRKISYTT VSDSVVWKLS DSKLSEMLKT KEGQSLYNEL LKIETELVRE RFDTMTANLV IAG
|
| |