Gene PICST_31239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31239 
Symbol 
ID4838585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp297600 
End bp299123 
Gene Length1524 bp 
Protein Length507 aa 
Translation table12 
GC content41% 
IMG OID640389900 
Productpredicted protein 
Protein accessionXP_001384355 
Protein GI126135662 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0738] Fucose permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.144771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.813098 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATTA CTAAGCTAGA TAAGGCTCAT GTAGCCATAA CTATGAAGGC CATGAATGTA 
GGGGACAATA CATCTCATCG TCCAGGCTTC GAAAATCGAA ACTCTGATGA AGACATCAAT
GAATACGACG GAAGACATTC TTCTAATGTA GAACCCATTC CAGAACAAGA AGGCACTATT
CCAAAGTATA TGACTATTAA CAATCCTCCT AGAAATAAAT GGAGGTATTT ATCGTGTATA
TTTCTAGGTC TCTGTGTGGG TTTCAATGAT GCTGCCCCAG GTGCCTTGTT ACCTCATATG
GAGGTTTATT ATGGTATAAG TTACTCTGTA GCTTCTTTGA TTTGGGTAGC CTCAGCGACA
GGGTTTATAG TCGTTGCCTG TCTTGCTCAT AAAATCCAGC CGTGGATGGG AAAGAGGTAT
TCTTTAACTT TGGGATGTGC ACTTGGTGTT CTCAATTACC TAATTGTTGG TACCGGTACC
AAATACCCTG CAATAGTGGC CTCTTTTTTC TTCGGAGGGG CTGGTAGTGC TCTTCAAATA
GCCCAAACTA ATATCTTCGT TTCACGATTG GACAAAGCTT CCACTTATCT TTCCTTTTTG
CACGGTGCAT ATGGAATTGG TGCAACAATT TCTCCCTTAT TAGCTACTTC GATTGTTGCA
AGAGGAGTGA CATGGCATTA CTACTATTTT GTTCTCATGG GTATAATGTT CCCTAATATG
ATTGCAATCT TCTATGCGTT TAAAAATTCT GATGAAGATT TGAAACCCTG GGACGAAGAT
CCTAAAGATT TAGCTGTCTC TTATACAGCA GATGAGAGAT TGCGTGGTGA AGATGTTATA
GAAATGACGG ACATTAATGC TACTCAAGAA TATCCCCAAT CTCACCAAAA CTTGATGCTC
CTTGCACTTA AAACTCCTAC TACTTGGCTC ATCTGCTTTT TTGTTTTATT CTATCAAGGT
GCAGAGGTCG CTATGGGTGG TTGGATAGTT TCATTCTTCC TTGATTATCG CCATGGAAAC
CCGAAGTACG TTGGATATAC AGCTTCAGGT TACTGGGGAG GCTTGACTAT TGGAAGGCTA
TGTTTAACCA AGCCTTTACA CAAAACTCTT GGGGCTAGAA GGTCTGTTCT AGTTGTCTCT
TGCGGAGCTA TGATACTTGT AGCCCTTGTA TGGGCTGTCC CCAACCTCAT AGCTGAAGCA
GTCTTGGTAG CATTTGCAGG TATGATGACT GGGCCAAATT ACCCATTGCT TGTAGTGTAC
ACTGGACATG ATGGCTTAAT TCCTAGAAAA ATTCAAGTTA TCACTCTTAC TATTATGTCA
GCTTTTGGCT CTTCTGGGGG TGCAATTTTC CCTTTCATTG TGGGTTTGAT TTCTCAAACG
ACTGGAACAT TTGTTGTTTT GCCTATATTT ATCATATTGT ATTGCTTGGT TATTGTGATG
TGGGCTTGTC TCCCAAATTT GGAAAGAAGA AATTCTACTC CGTCTAAGAG TTCGATATTG
AGATTGTGGG AAAGAATTTG GTAG
 
Protein sequence
MTITKLDKAH VAITMKAMNV GDNTSHRPGF ENRNSDEDIN EYDGRHSSNV EPIPEQEGTI 
PKYMTINNPP RNKWRYLSCI FLGLCVGFND AAPGALLPHM EVYYGISYSV ASLIWVASAT
GFIVVACLAH KIQPWMGKRY SLTLGCALGV LNYLIVGTGT KYPAIVASFF FGGAGSALQI
AQTNIFVSRL DKASTYLSFL HGAYGIGATI SPLLATSIVA RGVTWHYYYF VLMGIMFPNM
IAIFYAFKNS DEDLKPWDED PKDLAVSYTA DERLRGEDVI EMTDINATQE YPQSHQNLML
LALKTPTTWL ICFFVLFYQG AEVAMGGWIV SFFLDYRHGN PKYVGYTASG YWGGLTIGRL
CLTKPLHKTL GARRSVLVVS CGAMILVALV WAVPNLIAEA VLVAFAGMMT GPNYPLLVVY
TGHDGLIPRK IQVITLTIMS AFGSSGGAIF PFIVGLISQT TGTFVVLPIF IILYCLVIVM
WACLPNLERR NSTPSKSSIL RLWERIW