Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_87014 |
Symbol | |
ID | 4851637 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2408036 |
End bp | 2410354 |
Gene Length | 2319 bp |
Protein Length | 621 aa |
Translation table | |
GC content | 42% |
IMG OID | 640393345 |
Product | predicted protein |
Protein accession | XP_001386797 |
Protein GI | 126275117 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0814] Amino acid permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.728691 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.601954 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GAGCAGTATA CAGACAGCAT TTTAGTCAGC GTTTTCAACA AGTGTTTTGT CAACTTTTTA GACATTTTTC ATGAGATCCC CAGCTTGAGT CTAGACTTTA TCCAGAGAAG ATCTCATTAC AGATAGAAGA CAATCCGATT TCTAAGACTA ATAGGAAAGC TTTCAGTTTC ACAATACAAA TTTGAGACGG TTTTCTACGG ATCGATACAA AGTTGAATAC TGAATAACAT TTACACACCG GTGACCGGAA TCTAAGATAT CATCACGAAA TCACCATAGT CGGATACACC ATTCATAGTA TAGCATTATG TCTTCGTTCA GACAGAATAC CAACAGTTCG TTGTCCAACT CACCTTTTCA ACATTCGCCG CCTATAAGGC TGCCTACGCC GACGGTGTCG CTTCATTCGA GCAATTCTCA TTCTAATCTT CGACGTGAGA GTAGCAGTAT GAGCTACCAT GACCAGATTC TGAACATGAA CAACCTTGCA TCCAAAGACT TAGACTACAA GACTGTCGGT AAGCATTTGG TGAATCTGGA AGATGCATTG AAGTTGCAAG GAGGCGATAT AGCTAGGCAC TTCTACCACC AGATCGAGAA CAAGGCAGAT GATTCGCATC CTGGAGGTAT CTCCATAGAG AACCCCAAAC GTAGAACTAG AAGTTCGTCC TTCTCTGCTT ACTTAACTGA AACAAGAAGA GAATCTACAG CCAGCGACAT CAACGTTCCT GGCGGCTTCA GACGAGAATT TTTGATTAAC CGTAGCATTC GTAACAACGA GGAGCCTCCA AACTTCTTAA CTAAGAACTT TGTCGAGTTT CTCAGCATAT ACGGCCATTT TGCTGGTGAA GATTTCACTG ATGACGAAGG TGGCGAAACT GACGATTCCG AGAGCGCAAA CTATGAAGAT GTATTTGACG AAGAGTCTTC GTTGTTGACC ACGGAACGTC GTAACCATTC GTATACTCAC CCCTCGCTAC CAAAACAGCT TGCCATTCCT AAACCTAAGT CTAGAGTCCA GCCTAAGGGT ACTGCCTCTG TCTTGAAGAC TTTCTTCTTG GTATTTAAGT CATTGGTAGG ATCGGGAGTT CTTTTCTTGC CCAGGGCTTT CTACAATGGA GGGTTAACCT TCTCTATCTT TGCGTTGAGT GGATTTGGAT TGTTGACTTA TTTCTGTTAC GTAGTTTTGA TCAAGTCTAA GAAAGTCTTG AACTTGACCT CGTTCGGTGA ATTGGGATAC AAGACATATG GCAGACCATT GAAGATATGT ATTCTCATCT CCATTATCAT TTCGCAGATC GGTTTCGTCG CTACTTACAT CCTCTTCACC GCTGAGAACA TGTTATCTTT CGTAAGTCAT ATCTTACCAA CCACTCCTGC ATTCTTGACT ACTGCCAACA TTGTTGCTGT ACAGTGCGTC TTCTTGATCC CGTTGGTTTT GATAAGAAAT TTGGCCAAGT TGTCGCTTGT CTCGTTGATA TCTTCGCTTT TCATCATGAT CGGCTTGTTT ATCATTTTCT ACTTCTCTGG TCTCAACCTC TTGAATAATG GAATGGGTCC TAACATCCAC CAATTCAACG CAAACAGTTG GTCCATGTTG ATTGGTGTTG CAGTAACTTC GTTTGAAGGC ATTGGATTGA TTTTACCTAT AGAGGCTTCC ATGGCTCAAC CAGAGAAGTT TTCCATGGTT CTCTCGGTCA GCATGCTTTT GATCACTATC TTATTTGTTG GTGTAGGAAC CATCGGTTAC ACTTCGTTTG GTGAAGATGT CAAATCTATC ATCATCTTGA ACTTACCCCA AGGAAACTTG GCCGTTCAAT CGATTTTGAT TCTTTATTCT TTGGCTGTGT TCTTGACAGC TCCTTTGCAG TTGTTCCCTG CCATTAAGAT AGGTGAGTCT TTGATCTTCA ACCGTAATCT GTCCAAGAGA AGTGGTAAGG ACGAAGAAGG AAGACTCTAT CATCAATCAG GAAAGTACAA CCCTCAGGTT AAGTGGCTGA AAAACTTGTT CAGAGCGCTT GCCGTTGCGG GGATCTGTAC TATTGCCTAT TTGAATGCAA ACAACATTGA CAAGTTTGTT TCTTTCAATG GATGTTTTGC TTGCATACCC TTGGTGTATA TTTATCCACC AATGATCCAC TTGAAGACTT TAAAACAGAA AAAGGAACGT TTCACCGCTT CCGACTGGGC CTTGTACATT GCCGACTATG CATTGATTGC CGTTGGTCTC TTGGCTGTTG TATACACCAC CTACCAAATA TTGGTACTCA ACTAGATAGA TTAGCAGTGG ACTTTAACAT AATAGAACAA TCATAGCTG
|
Protein sequence | MSSFRQNTNS SLSNSPFQHS PPIRLPTPTV SLHSSNSHSN LRRESSSMSY HDQILNMNNL ASKDLDYKTV GKHLVNLEDA LKLQGGDIAR HFYHQIENKA DDSHPGGISI ENPKRRTRSS SFSAYLTETR RESTASDINV PGGFRREFLI NRSIRNNEEP PNFLTKNFVE FLSIYGHFAG EDFTDDEGGE TDDSESANYE DVFDEESSLL TTERLQPKGT ASVLKTFFLV FKSLVGSGVL FLPRAFYNGG LTFSIFALSG FGLLTYFCYV VLIKSKKVLN LTSFGELGYK TYGRPLKICI LISIIISQIG FVATYILFTA ENMLSFVSHI LPTTPAFLTT ANIVAVQCVF LIPLVLIRNL AKLSLVSLIS SLFIMIGLFI IFYFSGLNLL NNGMGPNIHQ FNANSWSMLI GVAVTSFEGI GLILPIEASM AQPEKFSMVL SVSMLLITIL FVGVGTIGYT SFGEDVKSII ILNLPQGNLA VQSILILYSL AVFLTAPLQL FPAIKIGESL IFNRRLYHQS GKYNPQVKWL KNLFRALAVA GICTIAYLNA NNIDKFVSFN GCFACIPLVY IYPPMIHLKT LKQKKERFTA SDWALYIADY ALIAVGLLAV VYTTYQILVL N
|
| |