Gene PICST_51416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_51416 
SymbolSUL3 
ID4851342 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1545872 
End bp1548247 
Gene Length2376 bp 
Protein Length781 aa 
Translation table 
GC content44% 
IMG OID640393050 
Productputative sulfate transporter 
Protein accessionXP_001387941 
Protein GI126274383 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.549171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCAG ACGAAACTAC ACGGCTACTC CGTAATGCGA ACTCATCTCC AGTTCCAGCT 
ATCCGACTAG AGTCTGGGCC AACAGCTTCT CGACTTCCAT CCAGACCACC ACTTTTTTCT
ATTCCTTCGT CGCAATCTGT GCGCAGCTAC AAATCGTTGA ACACCGTTAA TGATCCCATA
GTTGATTCCT ACCTCAATGC CAATTCGAAC AGCAATAGCA ATATAAACAA CGCTGGCTAT
GGAGCCAATA ACAGCGATGA CTCTTCGTCC ATTGCTCGGT TCAATGTCTT CAAAAACGCC
AGTTCCGACT TTGATTTCAA AGCCTACGTC GCTTACTATT TGCCGATCCT TAATTGGTTG
CCCAACTACG ATGCCAAAAA CAGTTTGCTT GGAGACACCC TTGCCGGACT CTCGTTGGCA
TCTTTTCAGA TGCCTCTTGT AATGTCTATA GCTACCTCAC TAGCCCATTT ACCGCCTGTG
ACTGGTCTCT ACTCAATAAT TGCAGGAGCC ACGGTTTATG CCATTTTTGG CTGTGTTCCA
GTTCTAGTCG TCGGACCATC GCCGTCTTCA GCTATTATAT ATGGCCAGGT CATAGAGCAG
ATCCGCCATG CTGGCCTCTT TGAGTCGTTT ACTCAGTTAG AGTTGTCATC TGCGATGTCA
TTTTCTCTAA GTGCTGTGCT TCTTGGAGCT GGGTTTCTTC GTTTTGGTTA CTTAGATAAT
GTTCTTAGTA GAGCCCTTCT CAAGGGATTC ATTGCTGCCA TGGGCTTTAT TATGATCATA
AATGAGTTTA GCTCAGAGTT GGGCATGCTC GAGTTATCTA AGACTCAGCC TCATCTCACC
AACGTCGATA AGGTCATGTT TGTAGTTCGC AACTGGCGCA AAACACATGT TCTTTCAGCT
TGCATTTCAG GTATCACTCT CGCTATTGTT CTTGCTGTAA GATATGTAAA GGGCATTCTA
GTCAGTAAAC ATAATTTTAA ACTGGCTGTG TACTTTCCGG AACTAATGTT GATGGTCGTG
ATCACAACTA TTCTTTGTCG GTACTATCGT TGGGACTTGC AAGGAGTCGA CATCGTTGGT
GATATTGCTA GCGGCTCTTC AGCATCTTTA CATATCATCA ATCCTATCGA TGTCAGCAAA
TTGGCATTGT ACAAACATAC TTTCCACGCT GCATTTTTGT GCACCATATT GGGCTACTTC
GACTCCACCA CTGCAACCAA AGCATTGGGA GCAAAGTACA ACTACAATGT TTCATCCAAC
CGTGAGTTGA TAGCCTTGGG TTCAACTAAT CTTGTTGTCA GTTTGGTTAG TGGTATGCCT
TCCTTTGGTG CCTTTGGAAG ATCCAAAATC AATTTGTTAG CTGGTGCAAC TACACCTATG
GCAGGTCTCA TTATGGCCGC AGTAACTTTA TTGACCATTT CATATCTCTT GCCGCTTATT
CGCTTTCTTC CTGAGTGTGT ATTGGCATTA ACAACGACAA TAATTGGTAT TACCGTTTTG
CAAGAGGTCC CACACGATCT TCAATTCTTC TGGAACATCC GAGGCTACGA TGAGCTCACA
GTATTTTTTC TCGTCATGTT GACTACAGTA TTCTGGAGTG CTCAAGCGGG CTTGACACTA
GGGGTATTGG TTGCCATCGC AAGAGTTATA AAACACAGCA GCCGATCGAG GATACAAATT
ATGGGTAGAG TACCTAACAC CAACGTATTC AGAAATGCTG ATACTTTGAT AGAAGAAAGC
TTTGCGGCAT TTGACGAATC TGTTGGAAGC ACAGTGAATT CACCGAATAT GTCAGTAGAA
AACTTGACAG CAAACATGAG TCCCGACTTC CACACCAGCA GTACCGATAA GTATTCTGCC
CTTGTGGATG AAATTGAGCA GATTGAAGGG GTTCTTCTCA TAAAAATCCC TGAACCTTTG
AACTTTGCTA ATGTCAGCAA CTTGAAAAGC AAACTTAGTC GAATCGAAAA ATACGGAACA
TTGTTGGTGC ATCCCTCGCA ACCCACAAGA AGAGACTTCA ATAACAATAC CATCAAATTC
ATCATCTTCG ATTGCAAGGG AATGAACTCT ATCGACTCGT CTGCTACCCA AGTACTCTAC
GAAGTTGTGA GAAAGTATAC CCAAGAGGAT AAAATTAATG TGTGCTTCTC ACGAGTTCCG
GCAGATGCAG TGGTGAGAGA TAAGTTTCGG ATGTCGGGGA TTACTGAGAT GATTAATGGC
AGTTACCATA GCTATAGTGT GACTCGAAGC AACAATTCCC TTACCAACAT GTCGCTGCTT
GAATTTTCGT ATCCTGTGTC TTTGTCAGGT ATGGGTGACG GATTTTTTCT CAGTATCGAC
CAGGCGCTCA AGTCGATCGA CTTACAAAAT GTATAG
 
Protein sequence
MNPDETTRLL RNANSSPVPA IRLESGPTAS RLPSRPPLFS IPSSQSVRSY KSLNTVNDPI 
VDSYLNANSN SNSNINNAGY GANNSDDSSS IARFNVFKNA SSDFDFKAYV AYYLPILNWL
PNYDAKNSLL GDTLAGLSLA SFQMPLVMSI ATSLAHLPPV TGLYSIIAGA TVYAIFGCVP
VLVVGPSPSS AIIYGQVIEQ IRHAGLFESF TQLELSSAMS FSLSAVLLGA GFLRFGYLDN
VLSRALLKGF IAAMGFIMII NEFSSELGML ELSKTQPHLT NVDKVMFVVR NWRKTHVLSA
CISGITLAIV LAVRYVKGIL VSKHNFKLAV YFPELMLMVV ITTILCRYYR WDLQGVDIVG
DIASGSSASL HIINPIDVSK LALYKHTFHA AFLCTILGYF DSTTATKALG AKYNYNVSSN
RELIALGSTN LVVSLVSGMP SFGAFGRSKI NLLAGATTPM AGLIMAAVTL LTISYLLPLI
RFLPECVLAL TTTIIGITVL QEVPHDLQFF WNIRGYDELT VFFLVMLTTV FWSAQAGLTL
GVLVAIARVI KHSSRSRIQI MGRVPNTNVF RNADTLIEES FAAFDESVGS TVNSPNIPDF
HTSSTDKYSA LVDEIEQIEG VLLIKIPEPL NFANVSNLKS KLSRIEKYGT LLVHPSQPTR
RDFNNNTIKF IIFDCKGMNS IDSSATQVLY EVVRKYTQED KINVCFSRVP ADAVVRDKFR
MSGITEMING SYHSYSVTRS NNSLTNMSLL EFSYPVSLSG MGDGFFLSID QALKSIDLQN
V