Gene PICST_78175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_78175 
Symbol 
ID4839399 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp332594 
End bp334581 
Gene Length1988 bp 
Protein Length584 aa 
Translation table12 
GC content43% 
IMG OID640390714 
Productpredicted protein 
Protein accessionXP_001384723 
Protein GI150865487 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.586636 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCAT ATCAATATGG CTGGTAAACA CAAGAAGCTC ACCTTCAAGG AGCAGATGCA 
GGGGTTCCCT GTGAGACAAA TGTTCATTAT CAGTATCATC CGTTTCTCGG AACCCTTGGC
TTTCACGTCT CTTTTCCCAT ACTTGTATTT CATGATTCGT GATTTCCACA TCGCAGAACG
TGAGGAGGAT ATCTCCAAGT ACAGTGGGTA CCTTGCATCA TCGTTCGCCT TCTGCCAGTT
CCTCTTTGCT GTCAGATGGG GGAAACTCAG TGATCGTATA GGTAGAAAGA TCATCTTGTT
GATAGGCTTG TTTGGTACTT CACTCTCGTT GATTTGTTTT GGATTCAGTC AGACTTATTG
GACAGCATTA ATATCGCGTA GTTTGGCTGG TGTTTTGAAC GGAAACGTCG CGGTTTTGAG
AACGATGATT GGGGAAATCG CTACGGAAAG AAGACACCAA GCCTTGGCTT TCTCGACACT
TCCACTCTTG TTCAATTTCG GTAGTATCAT AGGTCCAGCC ATTGGAGGTT CCAAGTACTT
GACCCATCCT CGCAAGGAAA ACCCGTACCA CCCAGGGGAA ACTATGGCAT TGTTAGAAGA
AGGACTTTCG TTTTACGAGC GTTTCATCAG GAAGTATCCT TATGCCTTGT CCAACATTGT
AGTGAGTTGC TTCTTATGGT TCTCTTTAAT TTGTGGATTT TTGTTTCTTG AAGAGACCCA
TGAAGTTTGC AAGTACAGAA GGGATTACGG AGTAGATTTG GGGGACTGGT TGTTGTTGAA
ATTGGGCTTC CATCCACCGG TCAGACCATG GCACCATCAT AAACACACTA GTGCTGAAGC
ACAAATAGCT TCTGAAAGCT CTCCTTTATT AGGAAATTCC AGTCTGGATA ACAACCAAGA
TAATGCCTCT ATTTCTTCAG AAACTGTTGA TGAAGACGAT GATGATGATA TCCAAAGTGT
CCGTCCTTAT ATCTCTAGAA GAATGTCACA GGCTATTGTA AAGACTTACT CCATGAGTGA
ACACGAAATT GAGGTACATA GACCTTCTTA CGCTAATGCC TTCACATCCC GAGTAATCAC
GGTCATCACA GGAAATTTCA TCATCTCTTT ACATAACGTC ACCTACAACG AATTCTTGCC
AGTTTTTTTG GCATCTAGAT TCCGTAAAGA CGGGCTCAAA TTTCCGTTCA GAATAGAAGG
TGGATTTGGT CTTGATGTCA GTTACATTGG TACTTTGTTT TCTTCTACAG GTATCATGGG
AATGTTAATT GTCCTTCTAA TCTTCCCCAT GATCGACCGT AACCTTGGAA CCATTAACGG
GTATAGATTG TCAGTATCAA TCTTCCCATT CGTCTACTTC ATGGTACCCT TAGCCATCTT
TACTTTGCAC GACTACAACC CAGCCTTCCC CAAGTGGGTT ACCCCAGTTA TACTATACAC
GTTCACGTCA TTGAAGACGT TGGCTAGTGC AACAGGGATG CCTCAGGTTA TGCTCTTGAA
CCACCGTGCT GCTGCCAAGG AACACCGTGC CTATGTGAAC AGTGCTACTA TGAGTATTAT
TGCTCTTGCC AGGTGTACTG GTCCGATTGT GTTTGGTTAC TTGATGTCTC TAGGTGACAA
GCTTTCTACT GGGGAATTGA TCTGGTGGGT TATGTCTTTG CTTGCCGGAT TTGGAATGAT
CCAGTCTTAC TGGATGGAAG ATTACGACGA CGATGAAGAT GAAAAAGTTC AACCTGAGCC
GGCTGCCGAT CTCTTCGAAG CAGAAGCCTT GGGTGAAGAA GACATGCAAC AAGACTTCAC
TACAATAGAG GTTTCTGATA ACGAGCAGTC TAGCTTGTCT ACCAATTAGA GAACATATCT
AAAATGATAA TTCAAAAAAC TTAATTGACA TATTGAATAT GTGCATCGAA AGGAACTCTT
CAAATATCAT AGAATTGTCT ATATCCATAT ATATTAGAAA CATATAGAAA CGTACGGAGT
TACCCTTC
 
Protein sequence
MAGKHKKLTF KEQMQGFPVR QMFIISIIRF SEPLAFTSLF PYLYFMIRDF HIAEREEDIS 
KYSGYLASSF AFCQFLFAVR WGKLSDRIGR KIILLIGLFG TSLSLICFGF SQTYWTALIS
RSLAGVLNGN VAVLRTMIGE IATERRHQAL AFSTLPLLFN FGSIIGPAIG GSKYLTHPRK
ENPYHPGETM ALLEEGLSFY ERFIRKYPYA LSNIVVSCFL WFSLICGFLF LEETHEVCKY
RRDYGVDLGD WLLLKLGFHP PVRPWHHHKH TRNSSSDNNQ DNASISSETV DEDDDDDIQS
VRPYISRRMS QAIVHRPSYA NAFTSRVITV ITGNFIISLH NVTYNEFLPV FLASRFRKDG
LKFPFRIEGG FGLDVSYIGT LFSSTGIMGM LIVLLIFPMI DRNLGTINGY RLSVSIFPFV
YFMVPLAIFT LHDYNPAFPK WVTPVILYTF TSLKTLASAT GMPQVMLLNH RAAAKEHRAY
VNSATMSIIA LARCTGPIVF GYLMSLGDKL STGELIWWVM SLLAGFGMIQ SYWMEDYDDD
EDEKVQPEPA ADLFEAEALG EEDMQQDFTT IEVSDNEQSS LSTN