Gene PICST_80830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80830 
SymbolSUL2 
ID4851365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1615054 
End bp1618303 
Gene Length3250 bp 
Protein Length824 aa 
Translation table 
GC content43% 
IMG OID640393073 
Producthigh affinity sulfate permease 
Protein accessionXP_001387542 
Protein GI126274406 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AATCAGTAAG ATAAACCAGC CGTAGCAGCA GCGTTTTAGG CGAAACGTTC ATAACCACAT 
TCTGAGGCAG TCAACACAGG GCAAATTTTG TATAGAGACT TTAACTGGAA ATCACAGCCT
AACCACAAGA ATTCCATCAA TTTCTCAGCC ATTTTGCAAC GACAAACCAG GCTCTCTAAT
AAAAAGAAAA GCTAATTTAA TTGCATTCCC CAGTAGGCAA TTTAAGATAT ATTGGCAATC
AAAAGAAGTC ACCTAAGGCA TCGAAAGCTG CCGAATCGAC CAACAAAAAC CTGGTGTAAT
ACATCCAAAC AAGCCCCCTG TTATCGGAGC AATTAACCGG CGTAGGTTGC AGCAGAAAAT
AAGATCTTCT CAATTTCGCC GTGTCCATTA CCCCCTCTGT AGATTCCAGA TTAAGACGGA
CAATAGAAAT ATCCTGTTGC ATATCTTCCT GAGTATTATT TGCATAGACA GTTCGTATTT
CAGTGTCGCC CTATTGCAAT TCGGGTACTA GCATTTAATC ACTTTCCTTG TCTATAAAGT
CAAGACGATC TGCCCGTTTT CTTACCCAGC TTTTTTCACT TATTATTTTT CCATTGCATT
TTCTCTGCAT AGCATTGCAT ACTAGCCGCT AGCAGCTTTT AGCATCCAAT TCAATATGGC
TGATATTCTC CACGCCTTGA ATTCCAACAA CCCTCCGGAT TCCCAGGTGC CTCATATCTT
GGACGCTGAA GAGATCATCT CGCACAGATC CAAGTCGCAC GAGTTCTCCG ACATCAGAAC
CCACGAATAT GCCTCCTTCG GAAACAACAC CGTCGTTCCC GAGTATACAG AAAGAGAAGT
CACTGTTGTA GACTGGGCCA AACATGTCTT TGGCCATCCC TTGCAGAAGA TTGCCAACTA
CTTCATCTCG TTGTTCCCAA TTGCTAAGTG GATCTTGCAC TATAACGGTA AATGGCTCTA
CGGTGACTTG GTTGCTGGTA TCACTGTTGG TGTCGTGTTG GTTCCTCAGT CTATGTCGTA
CGCTCAGTTG GCTGGATTAG AGGCTCAATT TGGTTTGTAC TCTTCTTTTG TCGGTGTTTT
CATCTACTCG TTCTTTGCCA CGTCGAAGGA TGTTTCCATT GGTCCTGTTG CTGTCATGTC
GCTTCAGGTG TCGAAGGTTA TTGCTCATGT TCAAGGAAAG GTTGGTGACA AGTACGCACC
AGAAGTCATT GCTACCTTCT TGTCTTTGAT CTGTGGTGGT ATCGCAGCAG GTATCGGTAT
CTTGAGACTT GGTTTCATAT TAGAATTCAT CTCGATTCCT GCCGTGATTG GGTTCATGAC
TGGTTCTGCC CTTAACATTA TCTCTGGTCA AGTTCCAGGG CTTATGGGCT TCAACTCGTT
GGTTAACACC AGAGCATCTA CCTACAAGGT TATCATCAAC ACCCTTAAGA ACTTGAAGCA
TAGCAATTCC GACGCCGCCT TTGGTTTGAT TCCTTTGTTC ATCTTGTACG TATGGAAATT
CTCGACTGAC TACGGTCAAA AGAAATACCC AAAGTACAAG TACTGGTTCT TCTACATCCA
GCAGTTGAGA AATGCCATTG TAATTATCGT AGCTACTGCT ATTTCCTGGG GTATTGTACA
TCCCAAGAAG GTAGCCTGGA AAGGCGACCC CAAGAAGTTC AAGGGTCCTA TCTCTACTCT
TGGAACTGTG CCAAGGGGTT TGAGAAATGT TGGTGTCATG ACTGTTCCAG ATGGAATCAT
AGATGCCATG TCGTCAGAAA TTCCAGTCTC TACTGTCATC TTGTTGTTGG AACACATCGC
CATCTCCAAG TCATTTGGTA GAATCAATGA TTACAAAGTT GTTCCTGATC AAGAAGTTAT
TGCTATCGGT GTCACCAACT TGATTGGTAC TTTCTTCAAT GCTTATCCAG CCACCGGTTC
GTTCTCTAGA TCGGCTTTGA AGGCTAAGTG TGGTGTCAGA ACTCCTATTG CTGGTATCTT
TACTGGTGCC GTTGTCTTGT TGGCTTTGTA CGCTTTAACC TCTGCTTTCT TCTACATTCC
TAAGGCAACC CTTAGTGCCA TCATTATCCA TGCTGTGTCG GATTTGATTG CCAACTACAA
AGTCACCTGG TCTCTTTGGA ATATTTCTCC CATTGACTGT GGTGTCTTTA TTGTCTGTGT
CTTGATCACC GTCTTCTCCT CCATCGAAAA CGGTGTCTAT TTCGCTGTCT GTGCCTCCGT
TGCTATTTTG TTATTCAGAA TCGCTAAGCC AACGGGCCAG TTTTTGGGTA GAATCCAAGT
GGCTGAAGTT GTTAACCCCG TCATCGAAAC CTTAGAGAGC GATTTGTACT CTATCAAAAA
AGAGGGTAAC TTGTCTAACT CATCCAATTC TTCCAACGAT GAAATTGAAA TCAACCAAGT
CATCAGCAAC AACTCCAATT ACGACGCCAA GAAGACCAAG TCCCAGACTA AGGGCCAAAA
GTCTGTTGTC AACACCGAAG CTTTGTTGAA GAACAACCCC AGAGTCAAGT TCCACACCAG
ATGGGTTCCT TTGACGAAAG AAAACATCAA TTCTGACATC AATGTCCAGC CTCCTCCTCC
AGGTGTGATT GTTTTCAAGC CTATCGAATC TTTCACCTAC CCTAACTCGT CTTCACAAGT
TGATCGTGTC TCTGATGAAG CCAAGAGATT GACAAGACGT GGTAAGCCAT ATGATCTCTC
AGATACTGGT TCTAGACCTT GGAATGATCC TGGTCCTTTG AGATGGAAGC TTCCCTTCAT
CAAGAACGAG ACAGCCAAGG AAGTCAACGA ACATGACCAC AGACCTTTGT TGCGTATCAT
TCACTTTGAC TTTAGCACTG TTTCTTCTAC CGATGTTACT TCTATCCAAG CCTTGGTTGA
TTTGAGAAAG GCTCTCAACA TCTACGCCGA CCGTGAAGTT GAGTTCCACT TCTCTGGTAT
CTTGTCTCCA TGGATCAGAA GAGGGTTGTT GAATGCTGGT TTTGGTACTT ACGAAGATGG
TTTGGTCAGT GAAAATACCT ACGTCAACAT CGCTGCTGAC CAAGATATCG AAAGAGGCGA
TAACTCCGAA TACGTCGCTG CTATCTCTAC CGATACTCCA TTCTTTCACC TCGATATTCC
TGACTACTAG GTGTTCTTTC CGATAGATTT TCATGTATAA ATAGGGTTCA CGCTTCATTT
TACTTCTTTA ATTGCAACGA TTCTGGGGAT ATAGGTGTAA TACTTACCTA TTATTATTTA
TTTCCACCTC
 
Protein sequence
MADILHALNS NNPPDSQVPH ILDAEEIISH RSKSHEFSDI RTHEYASFGN NTVVPEYTER 
EVTVVDWAKH VFGHPLQKIA NYFISLFPIA KWILHYNGKW LYGDLVAGIT VGVVLVPQSM
SYAQLAGLEA QFGLYSSFVG VFIYSFFATS KDVSIGPVAV MSLQVSKVIA HVQGKVGDKY
APEVIATFLS LICGGIAAGI GILRLGFILE FISIPAVIGF MTGSALNIIS GQVPGLMGFN
SLVNTRASTY KVIINTLKNL KHSNSDAAFG LIPLFILYVW KFSTDYGQKK YPKYKYWFFY
IQQLRNAIVI IVATAISWGI VHPKKVAWKG DPKKFKGPIS TLGTVPRGLR NVGVMTVPDG
IIDAMSSEIP VSTVILLLEH IAISKSFGRI NDYKVVPDQE VIAIGVTNLI GTFFNAYPAT
GSFSRSALKA KCGVRTPIAG IFTGAVVLLA LYALTSAFFY IPKATLSAII IHAVSDLIAN
YKVTWSLWNI SPIDCGVFIV CVLITVFSSI ENGVYFAVCA SVAILLFRIA KPTGQFLGRI
QVAEVVNPVI ETLESDLYSI KKEGNLSNSS NSSNDEIEIN QVISNNSNYD AKKTKSQTKG
QKSVVNTEAL LKNNPRVKFH TRWVPLTKEN INSDINVQPP PPGVIVFKPI ESFTYPNSSS
QVDRVSDEAK RLTRRGKPYD LSDTGSRPWN DPGPLRWKLP FIKNETAKEV NEHDHRPLLR
IIHFDFSTVS STDVTSIQAL VDLRKALNIY ADREVEFHFS GILSPWIRRG LLNAGFGTYE
DGLVSENTYV NIAADQDIER GDNSEYVAAI STDTPFFHLD IPDY