Gene PICST_63832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_63832 
SymbolHOL31 
ID4840970 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp750355 
End bp752123 
Gene Length1769 bp 
Protein Length570 aa 
Translation table12 
GC content41% 
IMG OID640392285 
Productputative ion transporter 
Protein accessionXP_001386551 
Protein GI150866826 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0755002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0915689 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCTGA CAATTGATAT AGAAAAGACA ACAACAGCCA AGTCTCTCAA CTACGACTTT 
GTACCTGGGA CCGTCCACCT AGTGGATGTT ATTGGGAACT TGAGCGTCAA GAAAGATGAT
GGTGGCGAAG ATATCATACT TCAGCCCCAA CCCACTTCAA ACATCAATGA TCCCCTTAGA
TGGTCCAAGG GGAAGAAGAG GTTTCAATTT TTTCTATTAT GGTTATGGTA TGTCTCGATA
CATTCTTTCT TACAACTAAT CAATACTAAC TTTTTGTTTT TAGGAGTTTT CTTCTAGCAG
TTTCTCTTAA TTTCTCTGGT CCCTTGTTTG TTATTTGGCT GGTGGAATTG AAAACGACTT
TCTTCAAATT GAATGTCCAC ATGGCCCTTG GTTTCTTATT CCTTGGTCTT GGGTGTCTCT
TCTTACAACC CACTGCTCTC AAGTTGAGTA GAAGATTTAT TTATTTGTGC TGCACTATCA
TAGCAATTGT AGGTAATGCA GTTGGTTCTC AAGCTACCAG TATTAACTTC TTGTATGTGG
TGAAAATTTT AGTTGGTTTG GCAGCTGCTC CAGTTGACTC GTTGGTTGAG ATTTCTTCCA
CGGATGTTTT CTTCTTACAC GAAAGATCGA GAGCATTCAG TTTGATATTA TTGGCATTGT
ATGGTGGAAG TAATTTGGGT CCTGTTGCCT GTGGGTACAT TGTGCAAACT TTGAGCTGGA
GATGGTGTTT CTATATCCAA ATCATAATTT TAGGAGTTAT GTTTTTCATT CTTCTCTTTT
TGTTGGAAGA CACTTCTTTC AGAAGGGACT TCGACGAAGG TGAGTTAGAG AACAAGATCT
TGGAGCAGAT TAAGTCTAAT GATTCAATGG CTCAAAGAGA TAAGCCTCAG ACTGAAAAGG
GCCATAAATC AGGTGGAATC ATAACCAACT TGAATGAAGT TGTTGATTCT TCTAGTGATG
ATGTCTCTCT AGACAATACC ATTCCTCCTA GAACATACTG GCAAAGAATG CAACTCATTC
AGACGCATTA TAACGATACA AGATCATGGA TCACTATTTT CTATAGACCT TTCTTGTTAG
CCAGCTTTCC AGCTATTATC TGGGGTAGTG TTCTCTATGG TGCACAAATG ATGTGGTTGT
CCTTCTTAGG AGTCACACAA GCACAGATTT ATTCAGCCCC TCCTTACAAC TTCAGTCCAT
CTCTGACTGG ATTGACAAGT GTAGGTGCCT TTGTTGGAAA TTTAATTGGT ATGGTTTACG
GTGGCAACTT CGTCGATTGG ATGACTTTGA AGATGGCCAA GAGGAACCAT GGTATCTTGG
AGCCGGAATT CAGATTGTAC GCTATGATTC TTCCAACCAT CTGCAATGCG GCTGGCCTCT
TGGCCTACGG GTTAGGTTCA TACTATGGTG CACACTGGGC TATTTCAGTT GTCATCGGGC
AAGTCTTTCT TGGATTTGCC ATGAGTGCAG CCGGATCCAT CTGTTTGACA TATGCTGTTG
ATTCATACCA TAATGTTGCA AGTGAAAGTC TTGTGTTGAT GCTTTTCATC AGGAACATGA
TTGGTATGGG ATTCACCTTT GCTATTCAGC CTTGGTTGGT GAGTAACGGA TTGAAGACTG
TGACATGGTT GATGTTCATG CTCTCAATTG TCATCAATGG TTCTTTCATC TTTATGATTA
AGTACGGAAA GAGCATGAGA AGGTGGACAG CCACAAGATA CGAGAAGTAT TCTGATCTTA
ACTACGGAGA ACTCTTCCCT AGAAAGTAA
 
Protein sequence
MTSTIDIEKT TTAKSLNYDF VPGTVHLVDV IGNLSVKKDD GGEDIILQPQ PTSNINDPLR 
WSKGKKRFQF FLLWLWSFLL AVSLNFSGPL FVIWSVELKT TFFKLNVHMA LGFLFLGLGC
LFLQPTALKL SRRFIYLCCT IIAIVGNAVG SQATSINFLY VVKILVGLAA APVDSLVEIS
STDVFFLHER SRAFSLILLA LYGGSNLGPV ACGYIVQTLS WRWCFYIQII ILGVMFFILL
FLLEDTSFRR DFDEGELENK ILEQIKSNDS MAQRDKPQTE KGHKSGGIIT NLNEVVDSSS
DDVSLDNTIP PRTYWQRMQL IQTHYNDTRS WITIFYRPFL LASFPAIIWG SVLYGAQMMW
LSFLGVTQAQ IYSAPPYNFS PSSTGLTSVG AFVGNLIGMV YGGNFVDWMT LKMAKRNHGI
LEPEFRLYAM ILPTICNAAG LLAYGLGSYY GAHWAISVVI GQVFLGFAMS AAGSICLTYA
VDSYHNVASE SLVLMLFIRN MIGMGFTFAI QPWLVSNGLK TVTWLMFMLS IVINGSFIFM
IKYGKSMRRW TATRYEKYSD LNYGELFPRK