Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_29107 |
Symbol | HIP1 |
ID | 4851843 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2985971 |
End bp | 2991734 |
Gene Length | 5764 bp |
Protein Length | 529 aa |
Translation table | |
GC content | 40% |
IMG OID | 640393551 |
Product | histidine permease |
Protein accession | XP_001387137 |
Protein GI | 126275764 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0833] Amino acid transporters |
TIGRFAM ID | [TIGR00913] amino acid permease (yeast) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0754466 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCTTA ATGGCACCAC TTCGTGGCAA AAGATATAAG AACAATATTA CTTGCTGGTT CCTCCATTTA TATTTCAATT TCGGTTTTTC CTTCACAATC GATATATTCT ACTAATTTCC ATACTAATAT TCTACAATGA TCGCTCAATT AGGTTTGAAC ACCAAGATTC CTTACCATTT CTTGTTCTGG GGAGTTGCAT TTGGTGGTTC GTCTTTCTAT TCATTTATCG TTTCTCCACT TGTTTTCAAA AAGTTGCCTA GAGAAGAATT CAGTAACTTG CAAACCCAGG TGTTCCCCAC TTATTTTTCT GGTCAAATCG TAGCTCCTAT TATCTTGGGG TTGACTTCTC CATTGAAGCT TTGTCCATTC ACTACTGGTT TACTTGTCGC TTCTTCTGTC GGAGGATTGT TGAACTTCTA CTGGTTGATG CCAGTTTGCC GCAACCTCAA GGAGAAGAAG AACAAGTTGA TTGCTGATAA ATTGGATACT ACTGAATCTG GTGAACCAAC TGAAGAGTAC ACTGCTACCA TCAAGAAGTT TGGTGCCTAC CACGGTCTTT CCTCTTTGGC TAATGCCTTG TCCATTGTCT CTCTCGGTTT CTACGGTGTC TTGTTGGCCA AGAAACTTGT CTAAGCAGAG GACATTTATA TAAAAACAAT ATGTTTGGGA GTTGAAATCT GAAGAAGCAG AATTTCAACA ATGGAAAATT TATGTACATA CTACCTCGAA TTTGATCAGG GAGCCTATTC CCAAATATAT ATATGAATAT TATGTAGCAG ACGTGCCCGT AACATTATCA AGCTATATTA TTAAGCTAAC GTGAACTGAT TAAACTATGG TACAAAATAT GAGACGGTGG CTATTACTGG CTTATTTATT CTTCTTAGCT GCAATTGCGG CTTCAGCCTT TGCAACTTCA GCTTTGTCTT TGGCGATTTC TTCAGCATAG ACCTTTGGAT AGTAGACGAA AAGAAGAAGG TATCCGAGAC CAAAAGTACC TCCAATGAAC AAAACAGCAT GTGGAGCGAT TTTTGTCATT GGCATATAGG GGTCAAGGTA CAATTGTCTC ATTATATATG CGTAATGGAA GAGAATTGCA CACATCACCA AGAAACTCAC AGCAAAGAAC TTGACATCCT TGATGAGCTT GGGGGTGACG TATTTTTCAT AGAGAGTCTT GGTTGGTCTC TTTGTAGAGC CTACGGCAGC AGATTCGGAC ATGGCTATTT TCTGTATTGA CCTTTTACCC TTGGAGGAGA TTTCGGCACG GAATCGAAAG AAGAATCTGG AGATATTGGT GCCGAACTCT TTTTTGAAGA TGAGCCAAAC ATGTAAGAGG ACAGCGACAT TTGAATTGAT ATTTTTCTGG TAGCAGCTCG CGCGACATTT TTGCTATTCC ACACTGCAGG CTGACTTGAC TTCAAGTACA TTTTACAGTG TGCTAACTTT GCTTACGTCT GTGTGTATGG CAGGAACGAC GTTTATGCAC TCAAAACTCC GATCCTCGAT GCATAACATA ACACCATATC CGTGGCTTTA GGCAGATTGA TTTGATGTTG TAGCAAAATA TCGATAAAAA TACTACTGCT AGTAATAATA ATCCCATGGC CAGGTCATTC TATGGTATAA CTTACTAGTT GTTTCCTGCC TACTGTTCCC CTCCATGTGG ATCTAAAAAA GCCACTGGAA CCTCACTATA TTGTCAGGAA GACGATTTCG GTTGGAGTTC TGCTGTATTT TAGTCTCAGC TGATAAATAT ATTCGTATAC GAGAACTATC CAATACAGGA ATGCACCTTC CACTAAGTTT GCACCCCTTT AGTCCGCGAT CTTTGCTTCT GGAGTTAGGA TCTAGAATGA ATGATAATGA ATATGATAAT GAAAATTAAC AGGTCATAGA GAACGTGCAT TGAAAGTATC CATGCTGCTG AAGTACGTCC TCGAGGCAGA CCAAATCTCG ATTTTAGTAT TTATAAGCAC AGTAGGATAG AGCTGAAGGA TCCTTCTACG GCAATCATTC AGAACCAAAA AGAGTCTCAT CGCCTTCGGA GTGGTGATAT CACCAAATAC TGATACAATT CCGACGTGGT CTTCCGGCAA TAAATTTTTA TTGAATGTTA AATTTCCGTG GTTGATTTCC TTCAACGTAG TTTTTCCGGT TACTCCCTAG TAGGCCCACA CTGCATGTTA GCAGACGCCG GCTAAAAATA AAAACACACC ACTAAACTCA AAATTACGAT CTTTAAGAAT AATTACTGCA TGTCGCGGTA AATGACCTGA TCATAGTATT AGCGAGAACG GTTATCTACA GAATAGGGAG CCAAGAGTAA GGTGATTCAG TTCATAGTTA CACATCATGG CAGCGATGCC TGTGGCTGTT TTCGGAAACG TTTCCTGTGT CCACTGGCTC CAAACTATAT TTAAGCAATC TTCTTTTCCC TGTACCATTC CCGTACTCCG AACAGAAAAA TATTTCAATG TTCCATAGTT AAACAAAATA GGTTTCATTA ATCGACTGCA AATTATATAT ATTTACAATG AGCCAGCAAG GAGTTTTCAC AGAAGAATTC TTCCCGGTCG ATCCCAAAGT CTTATCCCCC AATTTCGACG GTCAGGCGGA GGACCAGACC TGGTTGAATG ATTTGCTAGC CAAATCAGCA CCTGTCACTA CCAGCAATAG CTTCACAGGG TTGTCGAACT ATGACAACCC CAAAAGTACA CTACAAATAC TAAACCAACC TATAGACTTT GCTAGACTTT TCGAGAGCGA CAGCTTGTTC AGCACCGACG GATTTGTCTC TAGCTCGGAC GTTTCTTCAA GTAACAGTCC CAACGTCAAA GTTGAACCTG AAGAAAACGA AGGAGTTCCA GATATGGCCG TTATAGACAA TAAATTAAGA AAGATCCAAT CCATGCCAGA TCACACTCCT AATATGCTCG AAACTATTTT TGACACTAAA GACATTCTCA AACAAGAACA GTCACTCGTT AACTCCCCTA AGAAGAACCC TGCAGAAGAT ATTAATCCTG TTTTAAAGAA GTCTCACTCC TTCATTGGCA TTGGCGAAGT CAAGAAGCCT ACCAGAAGAA GAACCCCAAG AAAGAGATTG ACCGAGACCC AGAAGGAGGC TCACAACAAG ATTGAGAAGA AGTATAGAAT TAATATCAAT GCTAAGATTG CAGGGTTGCA AAAGATAATA CCCTGGGTTG CTCTTGACAA GACTGCATTT GAAACTGGAA GAAAAGAGGA CGACGACGAG ACTTCCAACT GTAGCAGATT GAACAAGTCC ATTATCTTAG AGAAGGCTAC TGACTATATC TTGTACATGC AGCAGAACGA AAATAGACTT TTGGAAGAGA ATAGATTGCT CAAAAGAGAA TTAGAGCAGT TGAGAGCCAG CTACAATGCT CTCACGAGAT AATTTGCACT TTCTTTAAAC AATTATTGCC CTGGAGGAGA AAATTTTAGA TAGATACATT TTAGTTCTTT TGTATGATAG AATGACTTAC GATATTAATG AATTGGATAG ACCTGACAAT TATAAAACGT ATTATGATGG CGTATACAAT TATGCGGTAA GGACAGAGCT GCAAGGCTGT TCCTGAAAGC TCGCCCCTGG AGTTGCAACG CCGTTCCCAT CAAGGCATTT CACGGATGTA CATTGCCGCC ATATAGAAGG TTGCACTCTT TTCTATTGTG AAAAAATTTT TTGATGCGGT CTTCGCGAGT TAATCCGAAT TTCGCACCCA TACCGAATGT TCACAAGCAT GACACATTTT CGCCGTTTGA TTTTGCTAGC GTTTGCCGTT GCATGTTGTG ACAGCGGTTG CAATTTCCAG GGCGTTTAGA ATAACGCACT ATGCATAAAG CATCGCAACT CCAGGATTTT CCCCCGACTT GTTTTTGGTT TATAAGTCAA CATTTACCCT AAAGGAAGGA GATACCTGTT TTTTTACATT TCATTCATAG ATTGGCTCTT CACAGATTGG CACTTGGTTG CATATTCTCA ATTGACGGAA AAGTAGGTAG AGAATGTCAG ACAATAGTTT AGCAGCCAAG AAAGACGAGG ACGTCTGTTC AGTCGCTACT TTCCAATTGC AAGCTGAAAC CAAGGAATCC AGTCACCCCA CTCCTACTCA CAAGACATCC AAGTTCAAGA AATTCATTGC TGATTTCAAA ACTGTCGACG CAACCAAGGA TTTGGAAGTA GGAAAGGAAA TCCAAATAAC TACATTAGAT GACAATCTTA GAGACTACCA AGTCAAATTA ATTGCATTAG GTTCATGTAT TGGTAGTGGT TTATTTATTT CCAGTGCTTC TGCTCTTTCT ACCGCTGGAC CAGCAGGTTG TATCATCGGT TATGTTATTG TTGCTTTCCT CATTTTCTTC ATTGTTCAAG CATTGGGAGA ATTGACCAGT TCTTACCCAG TTAGAGGTAA CTTTTTAGCC TACAGTACCA AATTCATTGA CGAATCTTGG GGGTTCGCAA TGAACTGGAA CTATTGTTTG CAATGGTTGG TAACTACTCC TCTTTCATTA GTTTCTGCAT CTTTAACAAT TGAATTCTGG CACACGGAGG TAAACACAGC AGTCTTCGTT GCTATCTTCT ATGTCATCAT TTGCATCATC AACATCTTTG GGGTTAGAGG CTACGGCTAT GGAGAATCTT TCTTTTCTGT TATCAAAGTA GTCGGTATTC TTGGCTTCTG TGTTTTGGCA ATTGTTCTCA TTGCTGGAGG TGGAGAACAA GGATACATTG GTGCAAAGTT TTGGCACAAT CCAGGTGCAT TTGCTAACTC GTTCCATGGT GTTTGCAACA CATTGGTCAA TGCAACTTTT AGTTTCGCTG GTACAGAGTT AGTAGCTATT GCCGCTGCTT CTTCTCCTAA CCCTCGTAGA GCCTTGAATA AGGCACTCAA ACAAATTTTT TGGAGAATCA CTGTGTTCTA TATGCTTGCC ATCATTTTAA TTTGTTTCTT GGTTCCATAC AACGATCCGT CGTTGATGGG TAACTCAGGC TCTTCTGCTT CTCCTTTTGT GATTGCAATT AAAAACGGTG GCATCAAAGC ATTACCGTCT ATCTTTAATG TCATCATTTT GCTTGCAGTA CTTTCAGTTG CTAATGCGAG CGTCTTTGCT TCTTATAGAC CCTTGGTTGC TTTAGCTGAA GCAGGCCATG GTCCAAAGTT TTTGGCTTAT GTAGACAAAA AAGGAAGACC AATCTACTCT ATCATGATTG CTTTAGCGTT TGGGTTGATT GGGTTTGTGT GTGCCTCGTC GCAACAGGCT ACTGTATTCA ATTGGCTTTT GGCTCTCAGT GGTCTTTCTA CCATTTTCAT TTGGTTTTCT ATCTCATTGG CTCAGGTCAG AGTCAATTAT GCTTGCAAAG TGCAAGGTAT TTCCACCGAT AATGTTCCGT TCAAAGCCAT GGGAGGCGAC TACGGTGCTT ACTTTTCGAT GTTTGTTAAC ATTTTGATCT TGATTGCCCA ATTTTACGTT GCCTTGTACC CAGTCGGAGG AGAAAAGTTG AACGCCAACA CATTTTTCCA AGCTTATCTT GCGGCTCCAG TGGTTCTATT CTTTTACATT ATCCACAAGG TGTGGACTAG AAACTGGAGC TTGTACATAA AGGCTGACCA AATTGATGTT ACTACCGGAA GAAACATTAT TGATATGGAT CTTGTAACCC AAGAATTGTA CGAGGAAAAG ACTCGTATGA AAGAACAGCC GTGGTACTCT CGTTTCTTCA ACTTTTGGTG TTAA
|
Protein sequence | MILNGTTSDL EVGKEIQITT LDDNLRDYQV KLIALGSCIG SGLFISSASA LSTAGPAGCI IGYVIVAFLI FFIVQALGEL TSSYPVRGNF LAYSTKFIDE SWGFAMNWNY CLQWLVTTPL SLVSASLTIE FWHTEVNTAV FVAIFYVIIC IINIFGVRGY GYGESFFSVI KVVGILGFCV LAIVLIAGGG EQGYIGAKFW HNPGAFANSF HGVCNTLVNA TFSFAGTELV AIAAASSPNP RRALNKALKQ IFWRITVFYM LAIILICFLV PYNDPSLMGN SGSSASPFVI AIKNGGIKAL PSIFNVIILL AVLSVANASV FASYRPLVAL AEAGHGPKFL AYVDKKGRPI YSIMIALAFG LIGFVCASSQ QATVFNWLLA LSGLSTIFIW FSISLAQVRV NYACKVQGIS TDNVPFKAMG GDYGAYFSMF VNILILIAQF YVALYPVGGE KLNANTFFQA YLAAPVVLFF YIIHKVWTRN WSLYIKADQI DVTTGRNIID MDLVTQELYE EKTRMKEQPW YSRFFNFWC
|
| |