Gene PICST_46289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_46289 
Symbol 
ID4839480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp685245 
End bp686564 
Gene Length1320 bp 
Protein Length439 aa 
Translation table12 
GC content39% 
IMG OID640390795 
Productpredicted protein 
Protein accessionXP_001384796 
Protein GI150865538 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.862046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCCCG TACGGTCCTT CTTTCAAGCA ACGAGAGATA TAAGAATATT ATGGGGTTCA 
GTCTTTTTTA GAATGGCAGG TTATGGACTT ACCAACCAAG TTTTGACATT GCATTTGGAA
GCTCTTGGTA TATCTGAGTC AAATATTGGT CTATTTATGT CGCTTACATT AGTGGGAGAT
ACCATGATAT CATATTTTTT GACCTGGAAT GCTGATAGAA TAGGAAGGAA GCGTGTTATG
ATGTTTGGTA CGCTTTTGAT GTTTACATCT GGATTCACAT TTGCATTCAG TTCCAACTTT
TTGGTATTGT TAACAGCAGC AATCTTGGGT GTCATTTCTC CTTCTGGTGA TGAGACTGGT
CCGTTTAAAT CTGTCGAAGA GGCTTCAATT GCCCATTTGA CTCCAGAGAA CCATAGACCC
GAAGTGTATG CATTTTATGG AGTATTTGCA ACTGCCGGGG CTGCTTTTGG ATCCCTAATT
TGCGGATTCT TGGTAGATTA CATGAATGCT TCTGTGGGGT TCCCGATTGA AAAATGCTAC
AGAATTATCT TTTTGGTATA CACTGGGATT TCAATTATCA AATTCATCTT GGTGTGCTTT
CTCTCTCCAA AATGTGAAAT ACATAACCAC GACATGGACA ACATAGCAAG TGAAGAAAAT
TCTTTATTAG AGGCAGTCGA ACAAGATACT ACAAAGCTGA ATTTCGTCAG CTTATCAGAC
AGAACATTTT ACTTACTACC AAGATTGCTC GCCATTTTCA TGTTGGATTC TTTGGGGTAC
GGATTTATGA CATCTACGTG GATCGTGTAT TACTTGAAGA AGACATTTGA AGCTACTGCT
ACCGGGCTTG GGTTGTTATT CTTTTTAACT AATACTGTTA ATTCCATTTC TTCATTGCCC
TCGGCTTATT TAGCCAAATT ATTGGGCCCT GTGAGGGCTA TCTTGTTCAC CCAAGCTCCT
TCGGGGGCAT TCTTCATTGT TGTTGCGTTC CTTTCTAACT TTTATTCAGC TTCATTCTTT
TTGCTCTTAT ACTACATTAC CACTAGTATG GATGTCGTTC CTCGACAGAT TTTGCTAACT
TCTCTTATGC CAAGGGAAGA GTTAACTAAG GTCATGGGAA TTGTGAACAT CGGTAAGACA
TTCGCTAGAT GCATTGGGCC AATATTTACA GGTAAGTTCG CTGCACATGG CGTTCTACAC
TATGGATTTA TAATCAACGG TGGTTGTGTA CTTTTGGCAG ACTTAATATT GGCTACAAAC
TTTTTGCATG TTGATGCTGA AATATTACAT AAACAAAGCA TTGATGCTGG GTTTGATTGA
 
Protein sequence
MHPVRSFFQA TRDIRILWGS VFFRMAGYGL TNQVLTLHLE ALGISESNIG LFMSLTLVGD 
TMISYFLTWN ADRIGRKRVM MFGTLLMFTS GFTFAFSSNF LVLLTAAILG VISPSGDETG
PFKSVEEASI AHLTPENHRP EVYAFYGVFA TAGAAFGSLI CGFLVDYMNA SVGFPIEKCY
RIIFLVYTGI SIIKFILVCF LSPKCEIHNH DMDNIASEEN SLLEAVEQDT TKSNFVSLSD
RTFYLLPRLL AIFMLDSLGY GFMTSTWIVY YLKKTFEATA TGLGLLFFLT NTVNSISSLP
SAYLAKLLGP VRAILFTQAP SGAFFIVVAF LSNFYSASFF LLLYYITTSM DVVPRQILLT
SLMPREELTK VMGIVNIGKT FARCIGPIFT GKFAAHGVLH YGFIINGGCV LLADLILATN
FLHVDAEILH KQSIDAGFD