Gene PICST_44044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_44044 
SymbolSGE1.1 
ID4837958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp675801 
End bp677120 
Gene Length1320 bp 
Protein Length440 aa 
Translation table12 
GC content42% 
IMG OID640389273 
Productsuppressor of gal11 null 
Protein accessionXP_001383765 
Protein GI150864791 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.333138 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATAG CTGGACTAAC AGACCAGTCG CAGACTTTGG ATAGAAAGAA GCTCTTATTG 
ACATTATCTT GTTTGACAGC TTCTTTGTTC GTGTCTTTCT TTGACCAGAC TGCTGTCTCC
ACGGCTATCC CTTCCATTTC GAAGGATCTT CACCATGCGT TTCTCATTAA TAGTTGGACA
TCGACTTCGT ACTTGATAGC TAACACAAAT TTCCAATTGC TTTATGGTCG ATTTTCGGAC
ATCTTTGGAA GAAAACAAAT ATTTGTCTTC AGTTTGGTTT GTCTCATGGT AGGTGACTTA
GGCTGTGGAT TCGCCAAAAA CCCAACCATG CTTTTCATCT TCAGAGGCTT GTCGGGTATA
GGTGGAGGAG GTGTAAATTG CTTGGTGATG ATCACATTTT CTGATTTGCT TTCCCCTCGA
CAAAGAGGAA AATATTTTGG AATAGTAGCT GCAGCCACCT CTGCTGGTAA TGGCATTGGA
CCATTCATTG GAGGATTGTT GTCAGAACAC GCTTCCTGGA GGTGGGCTTT TTGGTTAAGT
TGTCCCATTT GCTTGGTTTG TGGTCTCTTG TTGATATTAT TTGTGCCCTT GAAACCAGTC
GAAGGCTCTT TCAAAAAGAA AATCAAGTTG ATCGATTGGT TTGGTTTCAT TACGAGCATG
ATCTTCTCTG TGTTGTTTCT TGTGGCAATT TCTGGTGGAA ATGAGTCGTG GCCCTGGAAG
TCAGCAACAT TTATCTCGCT CATAATAATC AGCTCCATCG CCTTTTTCTG TTTCATTGGC
GTTGAACAAT ACTATGCTGA AATCCCCTTG ATCCCTTTGC GTCTCTTCAC AGACTTACAA
AGATTCTTAT TATTTTTGCT GTGTTTCTCG ATGGGATTGG CATATTTTGT GGATATATAT
TATTTGCCCT TGTACTTGCA AAACTACAGA GGCTGGCAAC CTATGATAGC TGGTGTCATC
CAGTTACCTG CGACTTGCAC AAGTAGTATT TTTGGAGTTG TGGTAGGACA AATCAATAGT
AGAACTGGCA GGTACGTTCA ATGTTTATGG GCTGGTGGTG CATTATGGGC TCTTGGAAGT
GGGTTGAAAT TGATGTACGA TTCAAACACC TCAATTGGCT ACATTGTGGG AACCAACATT
ATCCAAGGTT GTGGCATTGG CTTTACTTTT CAACCAACAT TACTTGCCCT TTTGGCTAAT
TCAGATTCAG CAGACCGTGC TGTTGTTACA GGGTTACGGA ACTTCTTCAG GTGCTTTGGA
GGCTCCGTTG GTCTCGTTAT CAGTGGAATT GCCTTCAATG CTACTCTTAG AAGCCAACTA
 
Protein sequence
MTIAGLTDQS QTLDRKKLLL TLSCLTASLF VSFFDQTAVS TAIPSISKDL HHAFLINSWT 
STSYLIANTN FQLLYGRFSD IFGRKQIFVF SLVCLMVGDL GCGFAKNPTM LFIFRGLSGI
GGGGVNCLVM ITFSDLLSPR QRGKYFGIVA AATSAGNGIG PFIGGLLSEH ASWRWAFWLS
CPICLVCGLL LILFVPLKPV EGSFKKKIKL IDWFGFITSM IFSVLFLVAI SGGNESWPWK
SATFISLIII SSIAFFCFIG VEQYYAEIPL IPLRLFTDLQ RFLLFLSCFS MGLAYFVDIY
YLPLYLQNYR GWQPMIAGVI QLPATCTSSI FGVVVGQINS RTGRYVQCLW AGGALWALGS
GLKLMYDSNT SIGYIVGTNI IQGCGIGFTF QPTLLALLAN SDSADRAVVT GLRNFFRCFG
GSVGLVISGI AFNATLRSQL