Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32535 |
Symbol | SUL4 |
ID | 4839771 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 45623 |
End bp | 48268 |
Gene Length | 2646 bp |
Protein Length | 847 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640391086 |
Product | Putative sulfate transporter |
Protein accession | XP_001385342 |
Protein GI | 150865928 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.162178 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTAGTC CCAATCCTTC TATTCGTCAC ATCTCCAGCA AAAGAGTCAA CGCCGTCAGC TTCGATCGCC AACTGCTTCT TGACAGAGAG AGCCCACAAC AGGACTATCT GGGCATAACA AATTCACTAT CGCCACGGAG TTCCAATACC AAATGGGGAT CGACTCCCAT GGGAGGTTTC TCATCCCCTC CTACTAAGAA CATAGCACCT GCTTTACACA GACACCTTGT AGATACTGCT GATGTAGAAT CTGTTGTTTA TAGAAGTACA CCTACGGTTC ACCATTCCAG CAGTCTTCAC TATGTTAATC CTCCTCCTCC AGGTTCAGAT TCAACATCTC CTTCAGGCGA TTACAACGAA GGCTACAGCG ACTATATGCC AGCAACTACG AGACACCTTG AGAACTTGGA GTGGAACGAC ATTCTTCCAT ACTACTTGCC ATGCATGTCA TGGATAAAAG AGTACAATTT TCGCTATTTT GTTGGGGACC TCATCGGTGG ATTGACGTTG GTCTTTTTCC AACTACCACT TTCGCTTTCT TACGCAACAA CCTTGGCGAA AGTGCCAGTA CTCTCAGGTT TGTTGTCGCT AGGTGTGACT CCCTTGATCT ACGCTGTATT CGGAAGTGTT CCCCAGATGG TTGTTGGTCC AGAAGGTCCT ATATCGCTTA TTGTTGGCCA GGCTGTGGAG CCGCTTTTGC ACCATTCCAA CAAGAAGCAT TTGGACCCGC TTGAGTTTGT AGTGGCTATA ACTTTTGTGA GTGGAGCGAG TCTTCTCGGG TTTGGCTTAG GAAGATTTGG CTTCTTAGAT AACGTTCTTT CTGCTTCTTT GTTGAAGGGA TTCATAAGTG GAGTCGGTAT CGTTATGGTT ATAAACGCTT CGATCGTAGT TTGTGGATTA GAGAAACTAC TTCAGGAAAT TGCCGATGAC CCCAATGAAA TGGATATCCA TTCCCCTTTT GACAAAGTAC GATTCTTCAT TCACCACTAC CAAGAAACAA ACCCTTTGAC TTTTAAGATC AGTATGACAG GGTTTGTCAT CATTATGGCA TTGAGAATAT TCAAGAAATA TGCCGACAAA AGGCCCGATA AAAGATTCAG AAATGCAGCC TACATCCCAG AAATCTTATT GGTTGTCAGT ATTTCAACAT ATCTTTGTTC GAAGTTGCGC TGGGATCTCG ATGGTATTTC TATCATTGGA AAGATCAAAA ATGACGGCCT GGTAAAGCTT TATAACCCAT TCTCCAAGAA GATCTTGCCA TTGTACAAAA CCCTCAGTAC ATCGGGATTC TTGTGCGCAA TGTTAGGTTT CTTTGAATCT ACCACTGCAT CTAAATCATT AGGCTCTACT TACGATTTGC CCATTTCTTC CAACAGAGAG TTGGTTGCTT TAGGGTTCAT CAATATTGTG GCATCTCTGT TTGGTGGATT ACCTTCTTTC GGAGGTTATG GAAGAAGTAA GATCAATGCA ATGTCAGCCA AGACCACCAT GCTGGGTGCT ATAATGGGGA TTTGTACCTT ATTTACCGTT TTCTTTCTTC TCGATTACCT TTACTTTGTT CCTGAGTGTA TGCTCAGTGT GATTACAGCA GTGATCGGGA TCCTGTTGAT AGAAGAAGCC CCATATGAGT TATATTTCCA TTGGCAAAGT GGGGGATACA ACGAGTTGAT CACATTCGCT GTTACTGTAA TGACTACTTT GTTTTTCTCG ATGGAAGGAG GTATTGCTGT TGGATTGGTG TATCTGTTGA TTCGAGTAAT TAGACATTCT GCGGAATCCA GAATTCAGAT TTTGGGTAGA TACCCAGGTT CGAACACTTT CCTTGATGCC GATATTCCTG ATGCTTCCCT TCTTCATCTC CAAGTTCCAG ATAGCATAGA TGGACAGCTT AACGGAGTTG GTTCTTCAAG CTTTAGTGGA TCTGAGAAGT TGCTCAACTC ACAATTGAAT GTATTTGCTG ATGGAAATTT CACGCACTTG AATACTCATG TACTTGAAGA GATCGAGGGT TGTTTGATTA TCAAGATTCC AGAACCATTG ACATTCACCA ACAGTAGCGA CTTACGGACC AGACTTCAGA GAGTTGAAAT GTATGGATCT ACCAAAGCTC ATCCTGCTCT GAAGAGAAGT AGAAGTCCAG CGATGACAAA GTACATTATC TTTGATTTGC ATGGCATGAC AGATATTGAT TCGTCTGCAG CAAAGATATT GACCGAACTA CTTACAAGCT ACAAGAGAAG GAATATCCAT TCATTTTTTG TTAGAGTGAA CAAGAATGCC AGATTGAGAA TACGATTACG CAAGACAGGC ATAGTACAGA TGCTTCTTGA TGATTTAGAG GATGTAAAAT ACTTTGAAGC TCAAAAGAGA TCTGTGTTCT CTCGGATGCG AGGACGTCGA AGCTTCTCTG AAATTTCAGC CGAATACAAT GAAGAAGAAA TAGCTAATAT CCCTGTGGAT TTAGACGATA CTGCCGAAAT TTACGATTTG ATAGAAAGCA ACGAAGAGCC CTACTTCAAC CATATCAGTG ATGCGTTGAA GGTCATTGAT TTCTACGAAG TGAATGAATG TACTCGTACC AGCGAGTACT TAGAAGTGGA TCAGATGGAG AGACGATCCA GTCTCCCAGA AGATATTCTT GTTTGA
|
Protein sequence | MTSPNPSIRH ISSKRVNAVS FDRQSLLDRE SPQQDYSGIT NSLSPRSFSS PPTKNIAPAL HRHLVDTADV ESVVYRSTPT VHHSSSLHYV NPPPPGSDST SPSGDYNEGY SDYMPATTRH LENLEWNDIL PYYLPCMSWI KEYNFRYFVG DLIGGLTLVF FQLPLSLSYA TTLAKVPVLS GLLSLGVTPL IYAVFGSVPQ MVVGPEGPIS LIVGQAVEPL LHHSNKKHLD PLEFVVAITF VSGASLLGFG LGRFGFLDNV LSASLLKGFI SGVGIVMVIN ASIVVCGLEK LLQEIADDPN EMDIHSPFDK VRFFIHHYQE TNPLTFKISM TGFVIIMALR IFKKYADKRP DKRFRNAAYI PEILLVVSIS TYLCSKLRWD LDGISIIGKI KNDGSVKLYN PFSKKILPLY KTLSTSGFLC AMLGFFESTT ASKSLGSTYD LPISSNRELV ALGFINIVAS SFGGLPSFGG YGRSKINAMS AKTTMSGAIM GICTLFTVFF LLDYLYFVPE CMLSVITAVI GISLIEEAPY ELYFHWQSGG YNELITFAVT VMTTLFFSME GGIAVGLVYS LIRVIRHSAE SRIQILGRYP GSNTFLDADI PDASLLHLQL LNSQLNVFAD GNFTHLNTHV LEEIEGCLII KIPEPLTFTN SSDLRTRLQR VEMYGSTKAH PASKRSRSPA MTKYIIFDLH GMTDIDSSAA KILTELLTSY KRRNIHSFFV RVNKNARLRI RLRKTGIVQM LLDDLEDVKY FEAQKRSVFS RMRGRRSFSE ISAEYNEEEI ANIPVDLDDT AEIYDLIESN EEPYFNHISD ALKVIDFYEV NECTRTSEYL EVDQMERRSS LPEDILV
|
| |