Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_86239 |
Symbol | PHO87 |
ID | 4851128 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 997351 |
End bp | 1000468 |
Gene Length | 3118 bp |
Protein Length | 963 aa |
Translation table | |
GC content | 42% |
IMG OID | 640392836 |
Product | phosphate permease |
Protein accession | XP_001387832 |
Protein GI | 126274114 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0471] Di- and tricarboxylate transporters |
TIGRFAM ID | [TIGR00785] anion transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.182724 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTTT CACATTCGCT CAAGTTCAAC GCAGTTCCGG AATGGCAGGA CAACTATGTC AATTATCCGG CGCTCAAAAA AGTCATCTAC AAGTTGCAGC AAGACCAGTT GCAGGTCGCT AACGCTCCCA AAGACGGCTC GGTTCCAGTA GACAACGTTA ACAGAACAAC CGTTTCCGAC TTGGTAGAGA CCTACAAGTT CAAGAAAAGT GCTAGAACCG CTGCTGAGGC TGAAGAGGAC GACGATGACG ACGATATCAT CTACGAAGAT GAAAAACAGC CTACAAGCTC ATCTCGCTTC AAGAAAAAAT TTATCAACCG CTTCAAGAAG AGTTCGCCTG CTATTCCTGA AAATGCCTCG GTTTCTGGCT CCACTACTAC TGACTTGGAG AAGGCCGTCG GGTCAATGTC TTCTGCTACT GAGATGATAG AGGACAACTC CAAGAAATCT TCTACTGAAG CTTTCACAGT AGAAGTCGAC AATTCGACTA TAGTTTCCTT CACTTCGCAA GTGCATGCGC AATCGCCTGC AGCCTCGGAA AACGACTACT CTAGTCCGGC CACCACCGCT TTTGACCCTT TGAAGGTGTT TTCCAAGCAG TTATTGATCC AGTTGCTGAA GATCAACGAG TTCTACCAGA CAAAGGAACA GCAAGTGTTC ACTGGCTACG ACAACTTGAT CAAAGACTTG GAAGCAAACA ACGTCAATAT TGACGAGGTT TTCAAGTTCA CCCAGGCTTA TCACTACGAA AACCCTGAAA TTGTTAATGC TGATGACCAC CACCAGTACC ACATCAAGTC GACTCTCACC AGAGTCACAA CAAACGCCTC TGTTTTTGAC CACATCAACC ATTTGGGCAA CGATTACGAT GCTGATCACC ACGGAGACTT GGAAAAACAA TCTCATATTC ACATTGATGA AGACGAAGAC GATGATGATG ACGACGATGA TGACGATGAA GACAACCACA GCGAAGATTC CGTCTTGTTG TCTCACACAG ACTTCAATGT CAAGCAACAG AAGAAGATTA CGTTGAAGAA GAAGTCCATC ACTCTCTTCA TCAACCTCTC GGAATTGAAG TCGTTCATCG AGTTGAATAG AGTAGGCTTC ACCAAAATCT GTAAGAAGTT CGACAAAACT TGTAACTACT CCATCAAAGA CGACTTCATC CAGAACTACT TGCCCAACAA TTCCCGTGTC TTCTACCCAG AAACTATTGA AGACTTGGAC TACAAATTGA ATCAGATTGT CAAGATTTAC GCTTACCTCT CCAACAGATT GACTCCCAGA GCCACGAAGG AAGACTTGGA AAGTGTGAAG AACGACTTGA GATCCCATTT GCGTGATCAC ATCGTTTGGG AACGTAACAC CGTCTGGAAG GATTTGTTAT CGTTGGAAAA GAAGTCCTAC AACTTGGACC TCAATGTAGA CTCCAACCAG GCCAAGATGG GCGATGAAGG TTTGCAGAAC TCACTCTTGA ACATGAAACT TACCACGATC AACTTGCCCT TCTCGTTGTT CGGCTATAGC CATGTCAAAG TCCCTTCATT CTTCTTCACC ACCCAAATGA TCAAGTTGCT GATCATAATC ATAGTTTTTA TTGTCTTATT GACAGTTAAA ACCTTCAACG ATCCTGTTCA AGCTCGTTGT TTGGCTCTTT TGGTCGCTGC TGCTATGCTC TGGGCTTCTG AAGCTTTACC ATTGTTCACC ACTGCTCTTC TCATTCCATT GTTAGTTGTC ACCATGAAGG TTTGTAAAGT TGATGGCTCT GACGAACCTA TGGATGGTGT AACTGCTTCG CAGTACATCT TGTCAACCAT GTGGAACTCT ACTATCATGA TCTTAATCGG TGGTTTTACT TTGGCTGCTG CATTATCTAA GTACAACATA GCCAAGGTCT TGTCTTCTTA CATCTTAGCT TTTGCTGGTA CCAAGCCTCG TAACGTTTTG ATTTCCATCA TGTCCGTAGC CTTGTTCTTG TCGATGTGGA TTTCCAACGT CGCTGCCCCA GTCTTGTGTT TCTCGTTAAT TCAACCGGTC TTGAGAAGTG TCCCTACAGA CTCTCCAGTG GCTCAGGTTA TGGTTTTGGG TATTGCTTTG GCTGCTAATG TTGCTGGTAT GTCTTCTCCA ATTTCATCTC CTCAAAACGT TGTTGCTCTT CAGTATATGG ACCCAAATCC AGGTTGGGGA AAGTGGTTTG CAGTTTCTAT TCCAGTATCC ATACTTTCCT TGATAGGTAT CTGGTTGATG TTGATCTTCA CTTTCAAGAT CAACAATATT AAATTAAAGG CCTACAAGCC AATCAGAGAA AAGTTCACCA CCAAGCAATA CTTTGTTAGT ATTGTCACCA TTCTCACCAT CTTATTATGG TGTGTCATGA CTAAGATTTC GGGCACTTTC GGTGAAGCTG GTCAAATCTC ATTCATTCCA ATCGTCTTAT TCTTTGGTAC TGGTTTGTTG AAGACTGATG ATATCAACAA CTACCCATGG TCCATTGTTC TTTTGGCTAT GGGTGGTATT GCCTTAGGTA AAGCTGTCAG CTCTTCTGGA TTATTAGGTA CCATCGCTAT GGCCTTACAG AAGAAAATCA TGCACTTTGA TGTCTTTGTC ATTCTTATCA TCTTCGGTAT CTTGATGTTG GTCATTGCTA CTTTTGTCTC TCACACTGTT GCTGCTTTGA TCATTTTGCC ATTGGTCAAG GAAGTCGGCG ATGCCTTGCC TCATCCTCAT CCTTTGATCT TGGTCATGGG TACTGCTATG ATTGCTTCCT CGGCAATGGG ATTGCCAACA TCAGGTTTCC CTAATGTCAC AGCGATCAGT ATGCGAGACG AAGTAGGAAA GAACTACTTA ACCGTCAACA CATTTATCAC CAGAGGTGTT CCAGCATCTA TAATTGCATA CATTATCGTT ATTACCGTTG GTTACGGTAT CATGTCTGCC ATTAAATTCT AAAAGGACGT TTTATTTTGT TGTTTTTTTG TTTTTTTCAT TCATTGCATT AGGTTGGTTT TTGTTAAATG TTCATCACAC TATAGAATTT TGGTTTCAAC CTCCAAAAGT TTCCCATTTC TTGTTTTATT CTACGAAGTT TGTACTAAAA TAAATGTTTA ATTACTAT
|
Protein sequence | MKFSHSLKFN AVPEWQDNYV NYPALKKVIY KLQQDQLQVA NAPKDGSVPV DNVNRTTVSD LVETYKFKKN EKQPTSSSRF KKKFINRFKK SSPAIPENAS VSGSTTTDLE KAVGSMSSAT EMIEDNSKKS STEAFTVEVD NSTIVSFTSQ VHAQSPAASE NDYSSPATTA FDPLKVFSKQ LLIQLLKINE FYQTKEQQVF TGYDNLIKDL EANNVNIDEV FKFTQAYHYE NPEIVNADDH HQYHIKSTLT RVTTNASVFD HINHLGNDYD ADHHGDLEKQ SHIHIDEDED DDDDDDDDDE DNHSEDSVLL SHTDFNVKQQ KKITLKKKSI TLFINLSELK SFIELNRVGF TKICKKFDKT CNYSIKDDFI QNYLPNNSRV FYPETIEDLD YKLNQIVKIY AYLSNRLTPR ATKEDLESVK NDLRSHLRDH IVWERNTVWK DLLSLEKKSY NLDLNVDSNQ AKMGDEGLQN SLLNMKLTTI NLPFSLFGYS HVKVPSFFFT TQMIKLLIII IVFIVLLTVK TFNDPVQARC LALLVAAAML WASEALPLFT TALLIPLLVV TMKVCKVDGS DEPMDGVTAS QYILSTMWNS TIMILIGGFT LAAALSKYNI AKVLSSYILA FAGTKPRNVL ISIMSVALFL SMWISNVAAP VLCFSLIQPV LRSVPTDSPV AQVMVLGIAL AANVAGMSSP ISSPQNVVAL QYMDPNPGWG KWFAVSIPVS ILSLIGIWLM LIFTFKINNI KLKAYKPIRE KFTTKQYFVS IVTILTILLW CVMTKISGTF GEAGQISFIP IVLFFGTGLL KTDDINNYPW SIVLLAMGGI ALGKAVSSSG LLGTIAMALQ KKIMHFDVFV ILIIFGILML VIATFVSHTV AALIILPLVK EVGDALPHPH PLILVMGTAM IASSAMGLPT SGFPNVTAIS MRDEVGKNYL TVNTFITRGV PASIIAYIIV ITVGYGIMSA IKF
|
| |