Gene PICST_58666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_58666 
SymbolMFS42 
ID4838416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp880203 
End bp881399 
Gene Length1197 bp 
Protein Length398 aa 
Translation table12 
GC content45% 
IMG OID640389731 
ProductMFS transporter 
Protein accessionXP_001384121 
Protein GI150865064 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.210164 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ACCATCACTA TGTCAGACTT GATCCCCTTG AGAGACCGAG GCTTCTACCA GGGCTTGGCG 
AATGTATTCT TTGGATTAGG CTCTGCTTCC GGAGGTATCA TCGGTGGATT GGTAGCTGAC
CATTTGGGCT GGAAATATGT TTTCATCTTA CAAGTGCCTT TGGCTGTGAT AGTAGGTCTT
GCGCTCTACT GGAACCTCAA CTTACCCCCT GGCTCTCCCG GTTTGGGAGC ACACGGTGAG
GATATCAAAC AGAAGTTAAA GAGAGTCGAC TTCTTGGGCT CATTCTTTTT GGTCAGCTCC
CTTATGTGTG TGTTGATAGC GGCTTCCTTA GGAGGTAAAG AGATTTCCTA TTCGTCCAAG
TCGTTTATCG GTTTGTGCAC AGCTTCGGCT CTCTTGTTGG GTGGCTTTAT ATATACCGAA
GCTTATATTT CTGCTGAGCC TATCATCCCT ATTGAGTTGT TAGGAAACAG AACTGTGTTA
TCATCTTCTT TGGCAAACTG GTTCTACACA ATGGGAGTGT TTTCCTACCT CTTCTACGTT
CCAGTCTACT ATACTTCGGT TATGGACTTG ACAGCTACGC AAAACGGGCT GAGATTGATC
CCAAATTTCT TTGGTGTATC CCTTGGTTCT ATTGGAGCTG GTATCTACAT GAAGAGAACT
GGACGTTACT ACAAGTTGAC TGTTCTCGTT GGTATAATAT CGATCTACGG TGTTCTCAAG
ATTTTCTTCA TCAATCCAAA CATCTCCTTA TTTGAACAGT TTACATTGTT GTTGCCTTCT
GGCTTGGGAT ACTCGTGTGT TTTGACAGTG ACACTTTTGT CGTTGATTGC AGCTGTGCCT
TCGAAGTACC AGGCATGTAC TACTTCCATC CAGTACACTT TCAGATCGAC TGGTTCGACT
TTGGGAGTAG CTGTTGCCTC TGCTTTGTTC CAGAATGTCT TGAGATCCAA CTTAACATCT
AAGATCCATG CGTTGATCCC AGATGTCAAC GAGGCAAACG AGATCATCAC CAAGGCTTTG
GCCAACACAA ACTACACACA TGAAGCCCCC GAAATTGTCA GAGCAGCCAT CAGAGAATCG
TATGCCTTGG GCTGTAAGGG TGCCTTTGCC GTAAGTGTGG TAACTGTATC TGTCGGATAC
TTTTCTTCGT TGTTCATGAG AGAACACAAG CTCCATACCA GTGTGAATAG AGATTAA
 
Protein sequence
TITMSDLIPL RDRGFYQGLA NVFFGLGSAS GGIIGGLVAD HLGWKYVFIL QVPLAVIVGL 
ALYWNLNLPP GSPGLGAHGE DIKQKLKRVD FLGSFFLVSS LMCVLIAASL GGKEISYSSK
SFIGLCTASA LLLGGFIYTE AYISAEPIIP IELLGNRTVL SSSLANWFYT MGVFSYLFYV
PVYYTSVMDL TATQNGSRLI PNFFGVSLGS IGAGIYMKRT GRYYKLTVLV GIISIYGVLK
IFFINPNISL FEQFTLLLPS GLGYSCVLTV TLLSLIAAVP SKYQACTTSI QYTFRSTGST
LGVAVASALF QNVLRSNLTS KIHALIPDVN EANEIITKAL ANTNYTHEAP EIVRAAIRES
YALGCKGAFA VSVVTVSVGY FSSLFMREHK LHTSVNRD