Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_46289 |
Symbol | |
ID | 4839480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 685245 |
End bp | 686564 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640390795 |
Product | predicted protein |
Protein accession | XP_001384796 |
Protein GI | 150865538 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.862046 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCCCG TACGGTCCTT CTTTCAAGCA ACGAGAGATA TAAGAATATT ATGGGGTTCA GTCTTTTTTA GAATGGCAGG TTATGGACTT ACCAACCAAG TTTTGACATT GCATTTGGAA GCTCTTGGTA TATCTGAGTC AAATATTGGT CTATTTATGT CGCTTACATT AGTGGGAGAT ACCATGATAT CATATTTTTT GACCTGGAAT GCTGATAGAA TAGGAAGGAA GCGTGTTATG ATGTTTGGTA CGCTTTTGAT GTTTACATCT GGATTCACAT TTGCATTCAG TTCCAACTTT TTGGTATTGT TAACAGCAGC AATCTTGGGT GTCATTTCTC CTTCTGGTGA TGAGACTGGT CCGTTTAAAT CTGTCGAAGA GGCTTCAATT GCCCATTTGA CTCCAGAGAA CCATAGACCC GAAGTGTATG CATTTTATGG AGTATTTGCA ACTGCCGGGG CTGCTTTTGG ATCCCTAATT TGCGGATTCT TGGTAGATTA CATGAATGCT TCTGTGGGGT TCCCGATTGA AAAATGCTAC AGAATTATCT TTTTGGTATA CACTGGGATT TCAATTATCA AATTCATCTT GGTGTGCTTT CTCTCTCCAA AATGTGAAAT ACATAACCAC GACATGGACA ACATAGCAAG TGAAGAAAAT TCTTTATTAG AGGCAGTCGA ACAAGATACT ACAAAGCTGA ATTTCGTCAG CTTATCAGAC AGAACATTTT ACTTACTACC AAGATTGCTC GCCATTTTCA TGTTGGATTC TTTGGGGTAC GGATTTATGA CATCTACGTG GATCGTGTAT TACTTGAAGA AGACATTTGA AGCTACTGCT ACCGGGCTTG GGTTGTTATT CTTTTTAACT AATACTGTTA ATTCCATTTC TTCATTGCCC TCGGCTTATT TAGCCAAATT ATTGGGCCCT GTGAGGGCTA TCTTGTTCAC CCAAGCTCCT TCGGGGGCAT TCTTCATTGT TGTTGCGTTC CTTTCTAACT TTTATTCAGC TTCATTCTTT TTGCTCTTAT ACTACATTAC CACTAGTATG GATGTCGTTC CTCGACAGAT TTTGCTAACT TCTCTTATGC CAAGGGAAGA GTTAACTAAG GTCATGGGAA TTGTGAACAT CGGTAAGACA TTCGCTAGAT GCATTGGGCC AATATTTACA GGTAAGTTCG CTGCACATGG CGTTCTACAC TATGGATTTA TAATCAACGG TGGTTGTGTA CTTTTGGCAG ACTTAATATT GGCTACAAAC TTTTTGCATG TTGATGCTGA AATATTACAT AAACAAAGCA TTGATGCTGG GTTTGATTGA
|
Protein sequence | MHPVRSFFQA TRDIRILWGS VFFRMAGYGL TNQVLTLHLE ALGISESNIG LFMSLTLVGD TMISYFLTWN ADRIGRKRVM MFGTLLMFTS GFTFAFSSNF LVLLTAAILG VISPSGDETG PFKSVEEASI AHLTPENHRP EVYAFYGVFA TAGAAFGSLI CGFLVDYMNA SVGFPIEKCY RIIFLVYTGI SIIKFILVCF LSPKCEIHNH DMDNIASEEN SLLEAVEQDT TKSNFVSLSD RTFYLLPRLL AIFMLDSLGY GFMTSTWIVY YLKKTFEATA TGLGLLFFLT NTVNSISSLP SAYLAKLLGP VRAILFTQAP SGAFFIVVAF LSNFYSASFF LLLYYITTSM DVVPRQILLT SLMPREELTK VMGIVNIGKT FARCIGPIFT GKFAAHGVLH YGFIINGGCV LLADLILATN FLHVDAEILH KQSIDAGFD
|
| |