Gene PICST_81069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_81069 
SymbolPHO81 
ID4851581 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2203155 
End bp2207134 
Gene Length3980 bp 
Protein Length1302 aa 
Translation table 
GC content40% 
IMG OID640393289 
Productpositive regulatory protein of phosphate pathway 
Protein accessionXP_001386771 
Protein GI126274926 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.351661 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTTG GAAAGTATCT CGCCTCGAGA CAGTTGGAGC TACCTGAGTA CTCCGGTCAT 
TTTATAGACT ACAAGGCTCT CAAGAAATTG ATCAAGCAGT TGGCCATTCC CACCACAACT
TCCACCAATG ACGAATCCAA TAGACCGCTC ACCCAGGCGG AAATACAGCA GACACTCAAA
GAAAACAAGG CGCTGTTCTT CTTCCGTGTG GAGCGTGAGT TGGACAAAGT CAACTCGTTC
TACTTGGAGA AGCAAGCCAA TTTGGCCATC AACCTCGACT TGTTGGTGAT GAAAAAAAAC
GAGCTCTTGA CCAAAAGTGC CTACTTCATC AACCAGCAGA ATAACCTGTC AAACGGTGGC
GGTCCCACCT CGAATCCTCT GGCCAATTCT ATCAACGCCA ATTTCAGAAA CTCCATCTCC
TACTTGAACT TGTACCAGAA CTTCAAAAAA ATACACCAGG ACTTGATCAG ATTACAGCAG
TTTATAGAGC TCAACGAGAC TGGTTTTTCC AAAGTTGTCA AGAAATGGGA CAAACGGTCG
AAGTCTCACA CAAAGGAGTT GTTCATCCTG ACGGCTGTCT CCGTCCAGCC TGTCTTTCAC
AAGAACGAAA TCAACGAGTT GTCAGACTTG GTCACACAGT CGCTCTTTGA CTTGGAATCG
ATAATGGACG GTGATTATTC AACATTAAAC AACTACTCGT CATCTTCTCT TGCAGCTACT
TCCGTGGCCT CTTCTTCGGT ACAGCCACCT ATACTAAAGG TCTTATCCAG AACCAACTCC
ACGTCCAATA TCAATATTCC GAGTGACCCA TCTGATTCTG CCAGCGAAAG ACACCAGTCG
ATCAGCTCTT TGCCAATTCC CGGCAACTTC CCAGCCTCTC GCAATAGCTC TATTATCAAC
TTACAGAATA ACGAAATTGA CGAGTTATAC TCCAGCTTCG TCAACGTAGC CACCATTAAG
GATCCAGACT TGTCGATCTT ATCGAGATGG GTCGACAAAG TGAAGAGCTC ATCCAAGAGT
GGCAGAAATG AGTCTTCTGC GTCGATTTCG TCCAGTCTGT CTGGAAAGCC TCAATTCAAC
TCTATAGCCA AATTCAAGTT GTCCAAGATC TTTCTATTAT CAGTTGCAAA CTTGAAGATC
TCTGACAGCT TCTTAGAGTC TTTCCTCGAT TTGATCCACT ACGAGATCGA CTTTGCATTT
ATCAATGACG AATTCAACAA CAACAAGAAT ATTCTCCACG AATGCTGCTC TATCCCGCCA
GACTCTACCC ACGAGAGAAG CCATCACGTC ATTATCAACA ACGGTGTCAA GGTTATCAAT
TCAACGGATT CGATAAACCA TTCTAGAACG TTCATCATAC AACACATCAT GAACAACTTG
GACCAACAAC AGAGAGCAAA TCTCTTGATC TGCAAGGATT TCAACGGAAG AACCTGTTTG
CATTACGCTT CTCAGAATAA TCGGTTAGAT TTACTTGAGA TATTGACTAC ATATTTCCCC
AAGGAGCATA TAGACGATTT GGACAACGAT TCAATGTCTC CATTATTATT AGCTATTAAG
CACGGAAACC TCAATGTCAT CAAGAAGTTG GTACAACTGG GAAGTAACTG CTTTCCTCAG
AATGATGAAA GTAAATTGCA ATATTTGCCT ATCAATTATG CCTGTAAGTT TGGCGAATAC
AAGACGTTGG AGTATTTGCT CTCTTACAGC AAACCTACTA ATAATAATGA ACAGTTGAAC
CAGCAAGATG TGGAAGGGTT GCCACCTTTG CATGTTATTG CTAGGTCAGG TCACTATAAG
CTAATCAAGT TGTTGATTCA GTACGGTGCC GAAATCAACC GTGTAGATGG TTTCAACAAG
TGGACACCCA TCTTTTATGC AGCTTCTGAA GGACATGTCA AAACTACACA AGAATTAGTG
AAGTTGGGTG CAAAGTTGAA CATTGTGGAC GAGGATGGTT ACAACGTCTT GTACTACTGT
GTTGTAGAGG GTCATATCAA CGTGTTGAAT GAATTATTGA ATCACTACAA TACAATTCAA
TCCAACCAAA AGCGTGAAAA CATTAATACA ATCAACACAA TAATAAGTGA GGAAATCATC
GCTAGCAAAC AACAACAGAA TTCCAGTACT GATAGTAACA ATTCTATGTC AATCTTGGTG
GAGAATGATG ACGACTCTGA GGATAGCGGA AGCATCGATA AGAATAATGT GGATAGTATT
CCGGACTTGC AATTACCACC TCCAATCTTG CCTTTGAGAA GATACGGTCA TAATTTCTTA
GAACAGAAGG TATTGATTGA ATTGATCTTC CCCAGTGATG CAGAGTTTAT CAATTTCTTT
AACTCGACTA CAGATTTAAA ACCAGGCAGA ATAACCTTGA CATCCAATAA CTCTGATATT
GTTCCTAGAA ATATCTTGCT TCCTATTGCA GACGACACCA AAGCGATCAA CAATTGCGTT
TTCCAGACGG AAGTTGATTC ATTGAATGAG TTCAGAATTG ACTTTGAGAT CTTTCCCAAG
TTTGGTACCA GATTAATTGC AAAGACTACC GCTTTATCTT TTTCGCAGAT TGACACCTCA
TCTCCAGAAA TAAACTCAAT TCAACTTCCT CTCTTTGACT TAAGATTGAG AAACATTGGT
GAATTGAAAT TTAATTACCA AGTCATTTTT CCGTTTTCTG GTACATTGTT GGAGACGTCG
AAGTTTGACA CTTACTGGAA ATCGTCTACA AGCTTTGTAA AGAATAGGCA GACATTGAAA
TTGAACGCTG CTGGAGGATT GTCACCTAAT AACTTTTTGT CGCCTTCTAG CATTAACTCT
ATGATTGGTT CTTCTTCAAC TGCTTCCAAG AGTAATGCAT CTGGAAATGG CAACAGCAAC
TTGATTAACA ATCCTCTTAC CTCTACCAAT GAATTAGTCT CTGGCTCTAC ACCGTCTTCA
ATCGTTACTG CTACGTCTTT ATCTGGAGAA TATCTCAGAA TCAAGATATG TTTGCTTAAT
GACGGTACAC CGGTTGTATG CCCTCATTGG TCTATTCCTA TTACTGAAAA TATTGACCTC
TACTTGCCTA ATTTGTCCTT AGAACAATTA AGCTCCATTA CAAACGACTT ATTTGATTAC
GGCAAGGTAA TTCACGATTT GTCTGGAATG ACTATTAAAG ATATTGTCTT GATAAAGAAA
CTTTTGAGAA TCATTTACTT ACCATTGGAT GTGTTATTGG AGATTTTAAA TGTTGAAATC
AATTTGAACT TAGAGTTAGT TTTTCCATCC CTTTACGAAT TGGAGAATTT GCCATTCATT
GGTAACATAC AGCTGAACTT GAACAACTTT ATTGACTTCA CTTTGAATGA CGTCTTTAAT
CACATTCGGT CATCTAAATT GAGCCATGGT TCTTCTGGAA GATCAATCAT CTTTCTCTCC
TCGAATTCTC TTATTTGCAA GATTTTGAAC TGGAAACAAC CAAATTTTCC TGTTTTCTTG
ATCATGAATG GCATTGCTTA CAATGGCAAT AAACGAAAGT TTGAATCCAG AACTGCCAAC
GGCTTATTGA TTGAAGATAT TCAAAAGTCA AAGGCTGCCA GCGGACATCA GCCAATCACT
CCAGATTCAA AACCACCCAA CAAGTCATCT CGCGATAAAA TTGTCAGCAA CCACCAGGAA
GTTACAATTA GATCTATCAA AGAGGCTGTG AACTTCACGA TCAACAACAA TTTGATTGGT
TTAATCACTT CAATTCACTT GTTGGATTTG GTTCCCAAGT TGGTCCCTTT GATTAGATCC
AGAGGTCTAG TGTTGGTTGC TTCCAGTGAT GTTGGTGACG AGGAGAACGA AGAATCTCTT
CAGAAGGAAT TGGATTCGTA CACGAGAACC GAAATCAACG GTTTGAGATT TGATGACGTC
TTGAGCTTCA AAGAAGATAT CACAATGTGA GGTACATAGC ATAGGTTTTA TTTATTGTAT
ATTAATGTAT CGGTTACTTT
 
Protein sequence
MKFGKYLASR QLELPEYSGH FIDYKALKKL IKQLAIPTTT STNDESNRPL TQAEIQQTLK 
ENKALFFFRV ERELDKVNSF YLEKQANLAI NLDLLVMKKN ELLTKSAYFI NQQNNLSNGG
GPTSNPLANS INANFRNSIS YLNLYQNFKK IHQDLIRLQQ FIELNETGFS KVVKKWDKRS
KSHTKELFIL TAVSVQPVFH KNEINELSDL VTQSLFDLES IMDGDYSTLN NYSSSSLAAT
SPPILKVLSR TNSTSNINIP SDPSDSASER HQSISSLPIP GNFPASRNSS IINLQNNEID
ELYSSFVNVA TIKDPDLSIL SRWVDKVKSS SKSGRNESSA SISSSLSGKP QFNSIAKFKL
SKIFLLSVAN LKISDSFLES FLDLIHYEID FAFINDEFNN NKNILHECCS IPPDSTHERS
HHVIINNGVK VINSTDSINH SRTFIIQHIM NNLDQQQRAN LLICKDFNGR TCLHYASQNN
RLDLLEILTT YFPKEHIDDL DNDSMSPLLL AIKHGNLNVI KKLVQLGSNC FPQNDESKLQ
YLPINYACKF GEYKTLEYLL SYSKPTNNNE QLNQQDVEGL PPLHVIARSG HYKLIKLLIQ
YGAEINRVDG FNKWTPIFYA ASEGHVKTTQ ELVKLGAKLN IVDEDGYNVL YYCVVEGHIN
VLNELLNHYN TIQSNQKREN INTINTIISE EIIASKQQQN SSTDSNNSMS ILVENDDDSE
DSGSIDKNNV DSIPDLQLPP PILPLRRYGH NFLEQKVLIE LIFPSDAEFI NFFNSTTDLK
PGRITLTSNN SDIVPRNILL PIADDTKAIN NCVFQTEVDS LNEFRIDFEI FPKFGTRLIA
KTTALSFSQI DTSSPEINSI QLPLFDLRLR NIGELKFNYQ VIFPFSGTLL ETSKFDTYWK
SSTSFVKNRQ TLKLNAAGGL SPNNFLSPSS INSMIGSSST ASKSNASGNG NSNLINNPLT
STNELVSGST PSSIVTATSL SGEYLRIKIC LLNDGTPVVC PHWSIPITEN IDLYLPNLSL
EQLSSITNDL FDYGKVIHDL SGMTIKDIVL IKKLLRIIYL PLDVLLEILN VEINLNLELV
FPSLYELENL PFIGNIQLNL NNFIDFTLND VFNHIRSSKL SHGSSGRSII FLSSNSLICK
ILNWKQPNFP VFLIMNGIAY NGNKRKFESR TANGLLIEDI QKSKAASGHQ PITPDSKPPN
KSSRDKIVSN HQEVTIRSIK EAVNFTINNN LIGLITSIHL LDLVPKLVPL IRSRGLVLVA
SSDVGDEENE ESLQKELDSY TRTEINGLRF DDVLSFKEDI TM