Gene PICST_83361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_83361 
SymbolPHO91 
ID4838906 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1656072 
End bp1658957 
Gene Length2886 bp 
Protein Length837 aa 
Translation table12 
GC content45% 
IMG OID640390221 
Productlow-affinity phosphate transporter 
Protein accessionXP_001384269 
Protein GI150865165 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID[TIGR00785] anion transporter 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATCCTAGCGA TATCCTTCCA GAGCAGCCAA TATCTCATTT ATCCTGTGAA TCCTACTAAG 
AATTATTGAT ATTGTCATAA TTACAACCAT ACTTCTATCT ATTCTATCAA TAGGAGATAT
TCCACAATCA CATCTCGCAA TCTGAGAAAG CACATACTTC AACAAAATAC AACAGTTCAA
CAAAATCAAC AATATCGTAT AGACTATCGT AGATATGAAG TTCTCCCATT CATTGCAGTT
TAATGCTGTG CCCGAATGGC TGTCTAAATA CATCGCCTAT ACTACCCTAA AGAAGCTCAT
CTACTCGCTC CAGAGAGACA ACCTCAGGAG ATCTCACCAG TCAGTGGTTT CGCAGGATGA
AGACCTCGAA GCTGGTGCAC ATCTCATGGC TGACGAATCA CAGACGTATG GAGCCACGGC
TGGTGCTGAA TCTCCGTCGT CAGTGTTTCT TGCTGCTCTC GATGCTGAGT TGAAGAAGAT
CGATGACTTC TACCAGCTCC AGGAAGCGTT CATCTTCAAG AGTATTGACG AAATTGTCAA
CGACATCGAA AACTTTGAAC ACGAGCTTGA TGGGTCGTCC ATGCTCAATG GCAAAAACCT
GAACTTCTTG AATCTGAAAC TGAATCACTT GAGACAGACC TCCGATGCCC TGAACGAAAA
CTACATCACC AACACTACTG ACCCCGATAC GGAATTTACA CAGCCAGAAG AAGAAGACGA
CGATGACGAT GACGATATGC GCGACTTTGT CGGCCACGAC CAGGCCAGCA GGGGCAGGAA
GAAGCTGAAT ATTTCGATTG GTTCACAATT GGTTAAGTCT CGTTCCATCG ATGAGTATCT
CCGTTCACCA AAATCGCCCA AGATCTGGAA TAATATTAAC AATGCTGCTT CCTCTTCGGC
CAATTTGCCT CCACAGTTGA TTTTGCTCAG TGAAAGCAGA ATTATCTTGA GAAAGAGAAT
TATCGGTCTT TTCACGACGT TGTCTGAATT GAAGTCTTAC ATCGAGTTGA ACCAGACTGG
TTTCAAAAAA GCGTTGAAGA AGTTCGATAA GTCGCTCGAC ACAAACTTGA AAGATGGTTA
CTTGGACAGT TTGCCAAAGA GATCGTACAT CTTCCAGGAT TCTACCATTG ATAGAGTCAA
CGACCGTTTG CAATCCTTAA CAAGACTCTA TGCCTTGATC TGTAATCATG GTGATGATTT
GGAGGCTGCA AAATCTGAGT TGTCGATCCA TTTGCGTGAG CATGTCGTGT GGGAAAGAAA
CACTGTGTGG AGAGACATGA TTGGTTTGGA AAGAAAGACG TATGCTGCCA ACTCGAAGCT
GATATCTGTA GCTGACCGTC TCAGAATGGA AAAGAGTGCT GAAGATGGTG GAAAGCTTGA
AGGGCAACTT ATCCACAAGT TCAATATCTC GCTTAAAAAC CCTGTCTTGA TCAAGTTGGT
TTTGATAGCC ACGGTCACGA TCGTGTTGTT GAACTTTTCT CCTTTTGCTG ATAGAGCCCA
GAAGAACTGC TTTGCATTGT TGATCTGTTC TTCTCTTTTG TGGGCTACTG AAGCCATCCC
CCTTTTTGTA ACCTCTTTGT TGATCCCTTT GTTGATTGTC AATCTCGGCG TGTTGAAAAA
TGATGATGGC TCAGACATGG ACTCTGTAGA CTCTTCGAAG TTTATATTAT CGACTATGTG
GAACTCCGTC ATATTGTTGT TATTGGGAGG TTTTACGTTG GCAGCGGCAT TGTCGAAATA
CCACATCGCC AAGTTGATAT CTACGTGGAT CTTGTCCAAG GCTGGTACCA ACCCTTCCGT
TGTTTTGCTT ACAATCATGG GTGTGGCCTT ATTTGCTTCC ATGTGGGTTT CGAATGTCGC
TGCGCCAGTG CTTTGTTTCT CACTTATTCA GCCATTATTG AGAACATTGC CTAAGGGCTC
CCAGTTCATC AATGCCCTTA TCTTGGGTAT AGCTCTAGCA TCCAACGTCG GAGGTATGGC
TTCTCCGATC GCTTCTCCCC AGAACATGAT AGCCATAGGG GCCATGGACC CACAGCCTTC
GTGGGGCCAA TGGTTCGTCA TTGCATTGCC TGTTTGTGTT TTGTCGTTGT TGCTCATCTG
GATCTTCTTG TTGATAACAT TCAACTGTAA CTCTAGAAAC ACCCATCTTG TTCCCATCAG
AACCATTGAT AACAAGTTTT CAGGAGTGCA GTGGTTCATC ATCTCCATTT CTGTTCTAAC
CATCTTCCTC TGGTGCATTG CTTCGCGTAT CACCGGCATC TTTGGTGAAA TGGGGATCAT
AGCTCTTCTT CCAATCATCT TGTTCTTTGG CTCGGGCTTG TTGTCTACAG AAGATTTCAA
CAACTACCCT TGGAACATTG TCGTGTTGGC TATGGGTGGT ACTGCTTTAG GTAAAGCTGT
TGCGTCATCT GGATTGTTGA ATACCATAGC CGTTGCTATC CAGAAACAAG TAGAGGACTT
CTCTTTGTTT GGTGTAGCAC TTACCTTTGG CTTCTTGATC TTGACAATGG CCACTTTTGT
TTCGCACACG GTGGCTGCGC TCATCATCAT CCCATTGGTC TCTGAAATTG GTAACAACAT
GGAAGAGCCC CATCCCAGAC TCCTCATCAT GATTAGTGCC TTATTGTGTT CCGCAGCCAT
GGGATTGCCT ACATCCGGGT TCCCTAATGT TACTGCCATT TGCATGACCG ACGAGTTTGG
TAAGCCGTAC TTGACCGTGG GGACGTTTAT CACCCGTGGT GTGCCAAGTT CAGTTATAGC
CTACGCCATC ATTGTGTCCA TCGGCTACGC TACGATGAAG TTGATCAACT TCTAAACTGA
TCGAATACCC CGCCGTAAGG CCAGACTACT ACATAGTAAA CTCATAATAC ATATCTGACA
TAAAGG
 
Protein sequence
MKFSHSLQFN AVPEWSSKYI AYTTLKKLIY SLQRDNLRRS HQSVVSQDED LEAGAHLMAD 
ESQTYGATAG AESPSSVFLA ALDAELKKID DFYQLQEAFI FKSIDEIVND IENFEHELDG
SSMLNGKNSN FLNSKSNHLR QTSDASNENY ITNTTDPDTE FTQPEEEDDD DDDDMRDFYL
RSPKSPKIWN NINNAASSSA NLPPQLILLS ESRIILRKRI IGLFTTLSEL KSYIELNQTG
FKKALKKFDK SLDTNLKDGY LDSLPKRSYI FQDSTIDRVN DRLQSLTRLY ALICNHGDDL
EAAKSELSIH LREHVVWERN TVWRDMIGLE RKTYAANSKS ISVADRLRME KSAEDGGKLE
GQLIHKFNIS LKNPVLIKLV LIATVTIVLL NFSPFADRAQ KNCFALLICS SLLWATEAIP
LFVTSLLIPL LIVNLGVLKN DDGSDMDSVD SSKFILSTMW NSVILLLLGG FTLAAALSKY
HIAKLISTWI LSKAGTNPSV VLLTIMGVAL FASMWVSNVA APVLCFSLIQ PLLRTLPKGS
QFINALILGI ALASNVGGMA SPIASPQNMI AIGAMDPQPS WGQWFVIALP VCVLSLLLIW
IFLLITFNCN SRNTHLVPIR TIDNKFSGVQ WFIISISVLT IFLWCIASRI TGIFGEMGII
ALLPIILFFG SGLLSTEDFN NYPWNIVVLA MGGTALGKAV ASSGLLNTIA VAIQKQVEDF
SLFGVALTFG FLILTMATFV SHTVAALIII PLVSEIGNNM EEPHPRLLIM ISALLCSAAM
GLPTSGFPNV TAICMTDEFG KPYLTVGTFI TRGVPSSVIA YAIIVSIGYA TMKLINF