Gene PICST_31966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31966 
Symbol 
ID4839631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp335340 
End bp336950 
Gene Length1611 bp 
Protein Length536 aa 
Translation table12 
GC content44% 
IMG OID640390946 
Productpredicted protein 
Protein accessionXP_001384724 
Protein GI150865488 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGGGT TGACAGTACG AGAACAGTTG GAAGATTTCC CCGTCTTCCA GATGTCCATC 
ATCTCCATCA TCCGGATAGC CGAACCAGTT GCTTTTACCT CCATGTTCCC CTATATATAC
TTTATGATTA AGCAATTCGG AGTAGCAGCC AAGGAAGCCG ATATTTCGCG GTACAGTGGG
TACTTGGCAT CGGCTTTTTC TTTCAGTCAG TTTCTCTGTT CGGTACTGTG GGGTAGGGCG
TCGGAACGTT TTGGGCGCAA ACCAGTTCTC CTAGTGGGTC TAATGGGTAC AGCAATATCG
ATGATTGTGT TTGGATTCAG CACCAATTTC TATATTGCGT TTTTAGCCCG GTTGATGATG
GGATCCTTGA ATGGAAATGT CTCAGTTATC AGAACAACCA TTGGTGAAAT TGCTGTAGAA
AGAAGACACC AGTCTATAGC CTTTAGCTCT CTTACTTTAT TATGGAGTAC AGGAGCAATT
GTTGGATCTT GGTTGGGTGG AGTTTTGACA GACACAGAAA ACTTACCAGA GCAAATTGGA
CAGGGTCCCA AGGGCTCTAG TCTATTAGAA AGATACCCGT TTGCACTTTC CAACATCGTA
GTAGCAGGAG TTTTGTGCAC TAGCATTGTA ATTGGGTGGC TTTTCTTCGA AGAAACCCAC
GAACATAAAC GATTTGATAG AGACAGAGGT CTTGAAGTGG GGGATTATAT TAGATCGAAA
CTTGGTCTAG AACAGCCTTT AAGACCATGG AGGAAATATT CCAATGTCTA CGACCAACGT
CGCCGTCCAG AAAGACTAAT GAGTGATTAT GATGGAAATG AAATGGACAG AAGTAGCAAT
TCTTCGGAAA GTATAGAGCT ACAATTGTAT TCGCTGATAG ATCCAGACGC AGAAAGAGCA
GGAGCAATTC CTGGCTCAAA GCCTACTCAT GTAGATTATG TCGGCGCCTT TACCTGGCCA
GTCATAAATA CCATCCTCAG TCATTTCATA CTATCTTTCC ATAACTTGGT GTATTCCGAA
TTCCTTCCTG TACTTCTTGC AGGCAAGATT CAGCTTAAGG ATTTGCAATT TCCATTCAAG
ATCAAGGGTG GCTTTGGGTT CTCCTCTGAT ACAATTGGGA TGATCCTTTC GCTCACTGGG
ATAGTTGGGA TCTTGGTGGT AATATTTGTC TTCCCCATTA TAAACACCTA CTTCAGTACT
ATAAATGGGT ACCGAGTGGC ACTTATATCT TTCCCCATTT CACTTGTAAT TCTACCACTA
TTGGTGTTTA CACTTCCAGA ATACAACTCT CATATTCCGA ACAAGTTCTT TACAGGAGTT
TGTTTGTATA TGATTACAGG TTTGAATACG TTTTCTGGTG CTACAGCATT CTCCCAGATC
ATCATCTTAA TCCATAGAGC TCTGCCCAAG AAGTACCGTG CACTCATCAA CGGCTACACG
TTGAGCATCA CAGCACTAGC TAGATGTCTT GCACCCATTA TCTGGGGCTG GATCATCTCC
AAGTTTGACC AGATGGGCTA CAGCGGAGTA TCGTGGTGGC TCTTGTCGTG TATAGCCATA
GGTGGCTTCT TCCATTCGTT CGTACTAGAG GACTACCAGG AGGAGATTTA G
 
Protein sequence
MQGLTVREQL EDFPVFQMSI ISIIRIAEPV AFTSMFPYIY FMIKQFGVAA KEADISRYSG 
YLASAFSFSQ FLCSVSWGRA SERFGRKPVL LVGLMGTAIS MIVFGFSTNF YIAFLARLMM
GSLNGNVSVI RTTIGEIAVE RRHQSIAFSS LTLLWSTGAI VGSWLGGVLT DTENLPEQIG
QGPKGSSLLE RYPFALSNIV VAGVLCTSIV IGWLFFEETH EHKRFDRDRG LEVGDYIRSK
LGLEQPLRPW RKYSNVYDQR RRPERLMSDY DGNEMDRSSN SSESIELQLY SSIDPDAERA
GAIPGSKPTH VDYVGAFTWP VINTILSHFI LSFHNLVYSE FLPVLLAGKI QLKDLQFPFK
IKGGFGFSSD TIGMILSLTG IVGILVVIFV FPIINTYFST INGYRVALIS FPISLVILPL
LVFTLPEYNS HIPNKFFTGV CLYMITGLNT FSGATAFSQI IILIHRASPK KYRALINGYT
LSITALARCL APIIWGWIIS KFDQMGYSGV SWWLLSCIAI GGFFHSFVLE DYQEEI