Gene PICST_29107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29107 
SymbolHIP1 
ID4851843 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2985971 
End bp2991734 
Gene Length5764 bp 
Protein Length529 aa 
Translation table 
GC content40% 
IMG OID640393551 
Producthistidine permease 
Protein accessionXP_001387137 
Protein GI126275764 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0833] Amino acid transporters 
TIGRFAM ID[TIGR00913] amino acid permease (yeast) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0754466 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCTTA ATGGCACCAC TTCGTGGCAA AAGATATAAG AACAATATTA CTTGCTGGTT 
CCTCCATTTA TATTTCAATT TCGGTTTTTC CTTCACAATC GATATATTCT ACTAATTTCC
ATACTAATAT TCTACAATGA TCGCTCAATT AGGTTTGAAC ACCAAGATTC CTTACCATTT
CTTGTTCTGG GGAGTTGCAT TTGGTGGTTC GTCTTTCTAT TCATTTATCG TTTCTCCACT
TGTTTTCAAA AAGTTGCCTA GAGAAGAATT CAGTAACTTG CAAACCCAGG TGTTCCCCAC
TTATTTTTCT GGTCAAATCG TAGCTCCTAT TATCTTGGGG TTGACTTCTC CATTGAAGCT
TTGTCCATTC ACTACTGGTT TACTTGTCGC TTCTTCTGTC GGAGGATTGT TGAACTTCTA
CTGGTTGATG CCAGTTTGCC GCAACCTCAA GGAGAAGAAG AACAAGTTGA TTGCTGATAA
ATTGGATACT ACTGAATCTG GTGAACCAAC TGAAGAGTAC ACTGCTACCA TCAAGAAGTT
TGGTGCCTAC CACGGTCTTT CCTCTTTGGC TAATGCCTTG TCCATTGTCT CTCTCGGTTT
CTACGGTGTC TTGTTGGCCA AGAAACTTGT CTAAGCAGAG GACATTTATA TAAAAACAAT
ATGTTTGGGA GTTGAAATCT GAAGAAGCAG AATTTCAACA ATGGAAAATT TATGTACATA
CTACCTCGAA TTTGATCAGG GAGCCTATTC CCAAATATAT ATATGAATAT TATGTAGCAG
ACGTGCCCGT AACATTATCA AGCTATATTA TTAAGCTAAC GTGAACTGAT TAAACTATGG
TACAAAATAT GAGACGGTGG CTATTACTGG CTTATTTATT CTTCTTAGCT GCAATTGCGG
CTTCAGCCTT TGCAACTTCA GCTTTGTCTT TGGCGATTTC TTCAGCATAG ACCTTTGGAT
AGTAGACGAA AAGAAGAAGG TATCCGAGAC CAAAAGTACC TCCAATGAAC AAAACAGCAT
GTGGAGCGAT TTTTGTCATT GGCATATAGG GGTCAAGGTA CAATTGTCTC ATTATATATG
CGTAATGGAA GAGAATTGCA CACATCACCA AGAAACTCAC AGCAAAGAAC TTGACATCCT
TGATGAGCTT GGGGGTGACG TATTTTTCAT AGAGAGTCTT GGTTGGTCTC TTTGTAGAGC
CTACGGCAGC AGATTCGGAC ATGGCTATTT TCTGTATTGA CCTTTTACCC TTGGAGGAGA
TTTCGGCACG GAATCGAAAG AAGAATCTGG AGATATTGGT GCCGAACTCT TTTTTGAAGA
TGAGCCAAAC ATGTAAGAGG ACAGCGACAT TTGAATTGAT ATTTTTCTGG TAGCAGCTCG
CGCGACATTT TTGCTATTCC ACACTGCAGG CTGACTTGAC TTCAAGTACA TTTTACAGTG
TGCTAACTTT GCTTACGTCT GTGTGTATGG CAGGAACGAC GTTTATGCAC TCAAAACTCC
GATCCTCGAT GCATAACATA ACACCATATC CGTGGCTTTA GGCAGATTGA TTTGATGTTG
TAGCAAAATA TCGATAAAAA TACTACTGCT AGTAATAATA ATCCCATGGC CAGGTCATTC
TATGGTATAA CTTACTAGTT GTTTCCTGCC TACTGTTCCC CTCCATGTGG ATCTAAAAAA
GCCACTGGAA CCTCACTATA TTGTCAGGAA GACGATTTCG GTTGGAGTTC TGCTGTATTT
TAGTCTCAGC TGATAAATAT ATTCGTATAC GAGAACTATC CAATACAGGA ATGCACCTTC
CACTAAGTTT GCACCCCTTT AGTCCGCGAT CTTTGCTTCT GGAGTTAGGA TCTAGAATGA
ATGATAATGA ATATGATAAT GAAAATTAAC AGGTCATAGA GAACGTGCAT TGAAAGTATC
CATGCTGCTG AAGTACGTCC TCGAGGCAGA CCAAATCTCG ATTTTAGTAT TTATAAGCAC
AGTAGGATAG AGCTGAAGGA TCCTTCTACG GCAATCATTC AGAACCAAAA AGAGTCTCAT
CGCCTTCGGA GTGGTGATAT CACCAAATAC TGATACAATT CCGACGTGGT CTTCCGGCAA
TAAATTTTTA TTGAATGTTA AATTTCCGTG GTTGATTTCC TTCAACGTAG TTTTTCCGGT
TACTCCCTAG TAGGCCCACA CTGCATGTTA GCAGACGCCG GCTAAAAATA AAAACACACC
ACTAAACTCA AAATTACGAT CTTTAAGAAT AATTACTGCA TGTCGCGGTA AATGACCTGA
TCATAGTATT AGCGAGAACG GTTATCTACA GAATAGGGAG CCAAGAGTAA GGTGATTCAG
TTCATAGTTA CACATCATGG CAGCGATGCC TGTGGCTGTT TTCGGAAACG TTTCCTGTGT
CCACTGGCTC CAAACTATAT TTAAGCAATC TTCTTTTCCC TGTACCATTC CCGTACTCCG
AACAGAAAAA TATTTCAATG TTCCATAGTT AAACAAAATA GGTTTCATTA ATCGACTGCA
AATTATATAT ATTTACAATG AGCCAGCAAG GAGTTTTCAC AGAAGAATTC TTCCCGGTCG
ATCCCAAAGT CTTATCCCCC AATTTCGACG GTCAGGCGGA GGACCAGACC TGGTTGAATG
ATTTGCTAGC CAAATCAGCA CCTGTCACTA CCAGCAATAG CTTCACAGGG TTGTCGAACT
ATGACAACCC CAAAAGTACA CTACAAATAC TAAACCAACC TATAGACTTT GCTAGACTTT
TCGAGAGCGA CAGCTTGTTC AGCACCGACG GATTTGTCTC TAGCTCGGAC GTTTCTTCAA
GTAACAGTCC CAACGTCAAA GTTGAACCTG AAGAAAACGA AGGAGTTCCA GATATGGCCG
TTATAGACAA TAAATTAAGA AAGATCCAAT CCATGCCAGA TCACACTCCT AATATGCTCG
AAACTATTTT TGACACTAAA GACATTCTCA AACAAGAACA GTCACTCGTT AACTCCCCTA
AGAAGAACCC TGCAGAAGAT ATTAATCCTG TTTTAAAGAA GTCTCACTCC TTCATTGGCA
TTGGCGAAGT CAAGAAGCCT ACCAGAAGAA GAACCCCAAG AAAGAGATTG ACCGAGACCC
AGAAGGAGGC TCACAACAAG ATTGAGAAGA AGTATAGAAT TAATATCAAT GCTAAGATTG
CAGGGTTGCA AAAGATAATA CCCTGGGTTG CTCTTGACAA GACTGCATTT GAAACTGGAA
GAAAAGAGGA CGACGACGAG ACTTCCAACT GTAGCAGATT GAACAAGTCC ATTATCTTAG
AGAAGGCTAC TGACTATATC TTGTACATGC AGCAGAACGA AAATAGACTT TTGGAAGAGA
ATAGATTGCT CAAAAGAGAA TTAGAGCAGT TGAGAGCCAG CTACAATGCT CTCACGAGAT
AATTTGCACT TTCTTTAAAC AATTATTGCC CTGGAGGAGA AAATTTTAGA TAGATACATT
TTAGTTCTTT TGTATGATAG AATGACTTAC GATATTAATG AATTGGATAG ACCTGACAAT
TATAAAACGT ATTATGATGG CGTATACAAT TATGCGGTAA GGACAGAGCT GCAAGGCTGT
TCCTGAAAGC TCGCCCCTGG AGTTGCAACG CCGTTCCCAT CAAGGCATTT CACGGATGTA
CATTGCCGCC ATATAGAAGG TTGCACTCTT TTCTATTGTG AAAAAATTTT TTGATGCGGT
CTTCGCGAGT TAATCCGAAT TTCGCACCCA TACCGAATGT TCACAAGCAT GACACATTTT
CGCCGTTTGA TTTTGCTAGC GTTTGCCGTT GCATGTTGTG ACAGCGGTTG CAATTTCCAG
GGCGTTTAGA ATAACGCACT ATGCATAAAG CATCGCAACT CCAGGATTTT CCCCCGACTT
GTTTTTGGTT TATAAGTCAA CATTTACCCT AAAGGAAGGA GATACCTGTT TTTTTACATT
TCATTCATAG ATTGGCTCTT CACAGATTGG CACTTGGTTG CATATTCTCA ATTGACGGAA
AAGTAGGTAG AGAATGTCAG ACAATAGTTT AGCAGCCAAG AAAGACGAGG ACGTCTGTTC
AGTCGCTACT TTCCAATTGC AAGCTGAAAC CAAGGAATCC AGTCACCCCA CTCCTACTCA
CAAGACATCC AAGTTCAAGA AATTCATTGC TGATTTCAAA ACTGTCGACG CAACCAAGGA
TTTGGAAGTA GGAAAGGAAA TCCAAATAAC TACATTAGAT GACAATCTTA GAGACTACCA
AGTCAAATTA ATTGCATTAG GTTCATGTAT TGGTAGTGGT TTATTTATTT CCAGTGCTTC
TGCTCTTTCT ACCGCTGGAC CAGCAGGTTG TATCATCGGT TATGTTATTG TTGCTTTCCT
CATTTTCTTC ATTGTTCAAG CATTGGGAGA ATTGACCAGT TCTTACCCAG TTAGAGGTAA
CTTTTTAGCC TACAGTACCA AATTCATTGA CGAATCTTGG GGGTTCGCAA TGAACTGGAA
CTATTGTTTG CAATGGTTGG TAACTACTCC TCTTTCATTA GTTTCTGCAT CTTTAACAAT
TGAATTCTGG CACACGGAGG TAAACACAGC AGTCTTCGTT GCTATCTTCT ATGTCATCAT
TTGCATCATC AACATCTTTG GGGTTAGAGG CTACGGCTAT GGAGAATCTT TCTTTTCTGT
TATCAAAGTA GTCGGTATTC TTGGCTTCTG TGTTTTGGCA ATTGTTCTCA TTGCTGGAGG
TGGAGAACAA GGATACATTG GTGCAAAGTT TTGGCACAAT CCAGGTGCAT TTGCTAACTC
GTTCCATGGT GTTTGCAACA CATTGGTCAA TGCAACTTTT AGTTTCGCTG GTACAGAGTT
AGTAGCTATT GCCGCTGCTT CTTCTCCTAA CCCTCGTAGA GCCTTGAATA AGGCACTCAA
ACAAATTTTT TGGAGAATCA CTGTGTTCTA TATGCTTGCC ATCATTTTAA TTTGTTTCTT
GGTTCCATAC AACGATCCGT CGTTGATGGG TAACTCAGGC TCTTCTGCTT CTCCTTTTGT
GATTGCAATT AAAAACGGTG GCATCAAAGC ATTACCGTCT ATCTTTAATG TCATCATTTT
GCTTGCAGTA CTTTCAGTTG CTAATGCGAG CGTCTTTGCT TCTTATAGAC CCTTGGTTGC
TTTAGCTGAA GCAGGCCATG GTCCAAAGTT TTTGGCTTAT GTAGACAAAA AAGGAAGACC
AATCTACTCT ATCATGATTG CTTTAGCGTT TGGGTTGATT GGGTTTGTGT GTGCCTCGTC
GCAACAGGCT ACTGTATTCA ATTGGCTTTT GGCTCTCAGT GGTCTTTCTA CCATTTTCAT
TTGGTTTTCT ATCTCATTGG CTCAGGTCAG AGTCAATTAT GCTTGCAAAG TGCAAGGTAT
TTCCACCGAT AATGTTCCGT TCAAAGCCAT GGGAGGCGAC TACGGTGCTT ACTTTTCGAT
GTTTGTTAAC ATTTTGATCT TGATTGCCCA ATTTTACGTT GCCTTGTACC CAGTCGGAGG
AGAAAAGTTG AACGCCAACA CATTTTTCCA AGCTTATCTT GCGGCTCCAG TGGTTCTATT
CTTTTACATT ATCCACAAGG TGTGGACTAG AAACTGGAGC TTGTACATAA AGGCTGACCA
AATTGATGTT ACTACCGGAA GAAACATTAT TGATATGGAT CTTGTAACCC AAGAATTGTA
CGAGGAAAAG ACTCGTATGA AAGAACAGCC GTGGTACTCT CGTTTCTTCA ACTTTTGGTG
TTAA
 
Protein sequence
MILNGTTSDL EVGKEIQITT LDDNLRDYQV KLIALGSCIG SGLFISSASA LSTAGPAGCI 
IGYVIVAFLI FFIVQALGEL TSSYPVRGNF LAYSTKFIDE SWGFAMNWNY CLQWLVTTPL
SLVSASLTIE FWHTEVNTAV FVAIFYVIIC IINIFGVRGY GYGESFFSVI KVVGILGFCV
LAIVLIAGGG EQGYIGAKFW HNPGAFANSF HGVCNTLVNA TFSFAGTELV AIAAASSPNP
RRALNKALKQ IFWRITVFYM LAIILICFLV PYNDPSLMGN SGSSASPFVI AIKNGGIKAL
PSIFNVIILL AVLSVANASV FASYRPLVAL AEAGHGPKFL AYVDKKGRPI YSIMIALAFG
LIGFVCASSQ QATVFNWLLA LSGLSTIFIW FSISLAQVRV NYACKVQGIS TDNVPFKAMG
GDYGAYFSMF VNILILIAQF YVALYPVGGE KLNANTFFQA YLAAPVVLFF YIIHKVWTRN
WSLYIKADQI DVTTGRNIID MDLVTQELYE EKTRMKEQPW YSRFFNFWC