Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_80830 |
Symbol | SUL2 |
ID | 4851365 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 1615054 |
End bp | 1618303 |
Gene Length | 3250 bp |
Protein Length | 824 aa |
Translation table | |
GC content | 43% |
IMG OID | 640393073 |
Product | high affinity sulfate permease |
Protein accession | XP_001387542 |
Protein GI | 126274406 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | [TIGR00815] high affinity sulphate transporter 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AATCAGTAAG ATAAACCAGC CGTAGCAGCA GCGTTTTAGG CGAAACGTTC ATAACCACAT TCTGAGGCAG TCAACACAGG GCAAATTTTG TATAGAGACT TTAACTGGAA ATCACAGCCT AACCACAAGA ATTCCATCAA TTTCTCAGCC ATTTTGCAAC GACAAACCAG GCTCTCTAAT AAAAAGAAAA GCTAATTTAA TTGCATTCCC CAGTAGGCAA TTTAAGATAT ATTGGCAATC AAAAGAAGTC ACCTAAGGCA TCGAAAGCTG CCGAATCGAC CAACAAAAAC CTGGTGTAAT ACATCCAAAC AAGCCCCCTG TTATCGGAGC AATTAACCGG CGTAGGTTGC AGCAGAAAAT AAGATCTTCT CAATTTCGCC GTGTCCATTA CCCCCTCTGT AGATTCCAGA TTAAGACGGA CAATAGAAAT ATCCTGTTGC ATATCTTCCT GAGTATTATT TGCATAGACA GTTCGTATTT CAGTGTCGCC CTATTGCAAT TCGGGTACTA GCATTTAATC ACTTTCCTTG TCTATAAAGT CAAGACGATC TGCCCGTTTT CTTACCCAGC TTTTTTCACT TATTATTTTT CCATTGCATT TTCTCTGCAT AGCATTGCAT ACTAGCCGCT AGCAGCTTTT AGCATCCAAT TCAATATGGC TGATATTCTC CACGCCTTGA ATTCCAACAA CCCTCCGGAT TCCCAGGTGC CTCATATCTT GGACGCTGAA GAGATCATCT CGCACAGATC CAAGTCGCAC GAGTTCTCCG ACATCAGAAC CCACGAATAT GCCTCCTTCG GAAACAACAC CGTCGTTCCC GAGTATACAG AAAGAGAAGT CACTGTTGTA GACTGGGCCA AACATGTCTT TGGCCATCCC TTGCAGAAGA TTGCCAACTA CTTCATCTCG TTGTTCCCAA TTGCTAAGTG GATCTTGCAC TATAACGGTA AATGGCTCTA CGGTGACTTG GTTGCTGGTA TCACTGTTGG TGTCGTGTTG GTTCCTCAGT CTATGTCGTA CGCTCAGTTG GCTGGATTAG AGGCTCAATT TGGTTTGTAC TCTTCTTTTG TCGGTGTTTT CATCTACTCG TTCTTTGCCA CGTCGAAGGA TGTTTCCATT GGTCCTGTTG CTGTCATGTC GCTTCAGGTG TCGAAGGTTA TTGCTCATGT TCAAGGAAAG GTTGGTGACA AGTACGCACC AGAAGTCATT GCTACCTTCT TGTCTTTGAT CTGTGGTGGT ATCGCAGCAG GTATCGGTAT CTTGAGACTT GGTTTCATAT TAGAATTCAT CTCGATTCCT GCCGTGATTG GGTTCATGAC TGGTTCTGCC CTTAACATTA TCTCTGGTCA AGTTCCAGGG CTTATGGGCT TCAACTCGTT GGTTAACACC AGAGCATCTA CCTACAAGGT TATCATCAAC ACCCTTAAGA ACTTGAAGCA TAGCAATTCC GACGCCGCCT TTGGTTTGAT TCCTTTGTTC ATCTTGTACG TATGGAAATT CTCGACTGAC TACGGTCAAA AGAAATACCC AAAGTACAAG TACTGGTTCT TCTACATCCA GCAGTTGAGA AATGCCATTG TAATTATCGT AGCTACTGCT ATTTCCTGGG GTATTGTACA TCCCAAGAAG GTAGCCTGGA AAGGCGACCC CAAGAAGTTC AAGGGTCCTA TCTCTACTCT TGGAACTGTG CCAAGGGGTT TGAGAAATGT TGGTGTCATG ACTGTTCCAG ATGGAATCAT AGATGCCATG TCGTCAGAAA TTCCAGTCTC TACTGTCATC TTGTTGTTGG AACACATCGC CATCTCCAAG TCATTTGGTA GAATCAATGA TTACAAAGTT GTTCCTGATC AAGAAGTTAT TGCTATCGGT GTCACCAACT TGATTGGTAC TTTCTTCAAT GCTTATCCAG CCACCGGTTC GTTCTCTAGA TCGGCTTTGA AGGCTAAGTG TGGTGTCAGA ACTCCTATTG CTGGTATCTT TACTGGTGCC GTTGTCTTGT TGGCTTTGTA CGCTTTAACC TCTGCTTTCT TCTACATTCC TAAGGCAACC CTTAGTGCCA TCATTATCCA TGCTGTGTCG GATTTGATTG CCAACTACAA AGTCACCTGG TCTCTTTGGA ATATTTCTCC CATTGACTGT GGTGTCTTTA TTGTCTGTGT CTTGATCACC GTCTTCTCCT CCATCGAAAA CGGTGTCTAT TTCGCTGTCT GTGCCTCCGT TGCTATTTTG TTATTCAGAA TCGCTAAGCC AACGGGCCAG TTTTTGGGTA GAATCCAAGT GGCTGAAGTT GTTAACCCCG TCATCGAAAC CTTAGAGAGC GATTTGTACT CTATCAAAAA AGAGGGTAAC TTGTCTAACT CATCCAATTC TTCCAACGAT GAAATTGAAA TCAACCAAGT CATCAGCAAC AACTCCAATT ACGACGCCAA GAAGACCAAG TCCCAGACTA AGGGCCAAAA GTCTGTTGTC AACACCGAAG CTTTGTTGAA GAACAACCCC AGAGTCAAGT TCCACACCAG ATGGGTTCCT TTGACGAAAG AAAACATCAA TTCTGACATC AATGTCCAGC CTCCTCCTCC AGGTGTGATT GTTTTCAAGC CTATCGAATC TTTCACCTAC CCTAACTCGT CTTCACAAGT TGATCGTGTC TCTGATGAAG CCAAGAGATT GACAAGACGT GGTAAGCCAT ATGATCTCTC AGATACTGGT TCTAGACCTT GGAATGATCC TGGTCCTTTG AGATGGAAGC TTCCCTTCAT CAAGAACGAG ACAGCCAAGG AAGTCAACGA ACATGACCAC AGACCTTTGT TGCGTATCAT TCACTTTGAC TTTAGCACTG TTTCTTCTAC CGATGTTACT TCTATCCAAG CCTTGGTTGA TTTGAGAAAG GCTCTCAACA TCTACGCCGA CCGTGAAGTT GAGTTCCACT TCTCTGGTAT CTTGTCTCCA TGGATCAGAA GAGGGTTGTT GAATGCTGGT TTTGGTACTT ACGAAGATGG TTTGGTCAGT GAAAATACCT ACGTCAACAT CGCTGCTGAC CAAGATATCG AAAGAGGCGA TAACTCCGAA TACGTCGCTG CTATCTCTAC CGATACTCCA TTCTTTCACC TCGATATTCC TGACTACTAG GTGTTCTTTC CGATAGATTT TCATGTATAA ATAGGGTTCA CGCTTCATTT TACTTCTTTA ATTGCAACGA TTCTGGGGAT ATAGGTGTAA TACTTACCTA TTATTATTTA TTTCCACCTC
|
Protein sequence | MADILHALNS NNPPDSQVPH ILDAEEIISH RSKSHEFSDI RTHEYASFGN NTVVPEYTER EVTVVDWAKH VFGHPLQKIA NYFISLFPIA KWILHYNGKW LYGDLVAGIT VGVVLVPQSM SYAQLAGLEA QFGLYSSFVG VFIYSFFATS KDVSIGPVAV MSLQVSKVIA HVQGKVGDKY APEVIATFLS LICGGIAAGI GILRLGFILE FISIPAVIGF MTGSALNIIS GQVPGLMGFN SLVNTRASTY KVIINTLKNL KHSNSDAAFG LIPLFILYVW KFSTDYGQKK YPKYKYWFFY IQQLRNAIVI IVATAISWGI VHPKKVAWKG DPKKFKGPIS TLGTVPRGLR NVGVMTVPDG IIDAMSSEIP VSTVILLLEH IAISKSFGRI NDYKVVPDQE VIAIGVTNLI GTFFNAYPAT GSFSRSALKA KCGVRTPIAG IFTGAVVLLA LYALTSAFFY IPKATLSAII IHAVSDLIAN YKVTWSLWNI SPIDCGVFIV CVLITVFSSI ENGVYFAVCA SVAILLFRIA KPTGQFLGRI QVAEVVNPVI ETLESDLYSI KKEGNLSNSS NSSNDEIEIN QVISNNSNYD AKKTKSQTKG QKSVVNTEAL LKNNPRVKFH TRWVPLTKEN INSDINVQPP PPGVIVFKPI ESFTYPNSSS QVDRVSDEAK RLTRRGKPYD LSDTGSRPWN DPGPLRWKLP FIKNETAKEV NEHDHRPLLR IIHFDFSTVS STDVTSIQAL VDLRKALNIY ADREVEFHFS GILSPWIRRG LLNAGFGTYE DGLVSENTYV NIAADQDIER GDNSEYVAAI STDTPFFHLD IPDY
|
| |