Gene PICST_50085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_50085 
Symbol 
ID4840436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp563841 
End bp566576 
Gene Length2736 bp 
Protein Length911 aa 
Translation table12 
GC content39% 
IMG OID640391751 
Productpredicted protein 
Protein accessionXP_001386124 
Protein GI126139203 
COG category 
COG ID 
TIGRFAM ID[TIGR00727] small oligopeptide transporter, OPT family
[TIGR00728] oligopeptide transporters, OPT superfamily 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.100109 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACAG AAGAAAAGAA TACTACAGAT AAAAAGGAAA AACTTGACAA TATCACCAGC 
ATTCGAAGTG TCGGAAGTTA TTTGCAACTT GAGGATCATG AAATTGATTT GAAAGCCGTG
ACTTCGAATC CAATTTCACT TGGCGAAGTT GGTGCCTCGC TTACAGATGA TCAGAAATTG
CTTATTTTGA GGAGAGTCCA TCTTGATAAG TTAACTTCAT TTGAAGAATT GCCTCCCCAA
GCTGCTTTCT ATATACAAAA GGTTGAACAC TTGACCACCA CTGAAGCCCT CACAATCTTG
AACCAAGCAG TGATTGATCT TGATGCAGAT GCTAATTTCC CAACCAAGGA CTACAATTTG
TTGACAAGCT TGGTGCAGAG TTCCGAATCG AAACAATTTA ATGCCAAAGA AAAGTTGGCG
TCTGCCTTGG ATGGAAGCTC ATCAGAACAA TCATTAGAAA AGGATTATGA TACCCATTCA
ATAGTTGATT GGGACTTGCA AGTTAGATTG GAAGCTGTTT TGATTGCTTA CCACTCCCCA
TACCCTCAAG TGAGAGCAGT CACTGATCCG TATGATGACC CAACCATTCC TGTTGAAACC
ATTAGAGTTT ATATTCTCGG CATAATTTGG ACTGCCATTG GAGCTGTTAT CAATCAATTC
TTTTCAGACA GATTGCCAAG TATATCTTTA GATCCTGCAG TTGTTCAAGT GTTTCTCTAT
CCTTGCGGTA TCCTATTGGA ATATATCTTA CCAAAGAAGA AGATCAAGAT TTGGAGGTAT
ACAATTGATC TCAACCCTGG ACCATGGAAC TACAAAGAGC AGATGTTAGC GACTGTATTC
TATTCCGTAT CCTGTCCTAT TGGTACGAGT TACGTTTCTT CCAACATTAC TGTTCAAAAG
ATGGAAATGT TCTACAACAA TAAATGGGTT GATTTTGGAT ATCAAGTTTT GTTGATATTA
TCGAATAACT TTTTGGGGTT TGGATTCGCG GGTATCTTTA GAAGATTTGC TGTGTACCCC
CCCGAAGCCA TCTGGCCCAG TGTGTTGCCA ACTCTTGCTC TAAATAGAGC ATTGATGGTC
CCTGAAAAAA AAGAAATCAT TAACGGATGG AGAATATCTA AATATAATTT CTTTTTCATT
ACATTTGCTG CCAGTTTTGT TTACTTCTGG ATTCCTACCT ATCTTTTCGC TGCTTTGTCA
ACTTTCAATT GGATGACTTG GATCAAACCT TATAATTTCA ACTTGGCCGC CATAACTGGA
ACTAATTTTG GATTGGGATT GAATCCAATC CCTACTTTTG ACTGGAATGT GATTAATACT
AATTCACCTT TGGTTCTTCC ATTTTTCACC CAAATTAACA ACTATATCGG AGTCTTGATT
GGCTTTATTG CGATTGTAGG TGTCTACTGG TCCAACTACA AATGGACAGG CTTCCTACCA
ATCAATTCAA ATGCTGTTTT CACTAATACT GGTGAACCGT ATGCTGTCAC AGAAGTAGTG
GACGGAAATA GTTTGCTTGA TAATGAGAAG TACCAACAAT ACAGTCCACC ATTTTATACT
GCTGGAAACT TAGTGGTTTA TGGTGCCTTC TTTGCTATCT ACCCGTTTTC AATTGTCTAT
GAAATTGGGA GTAGATACAA ACAGACATGG AAAGCACTCA AGAGTGTTTA CAGTAGTGTT
AGAGATTTCA AGAGAGGCGC ATACGAAGGA TTTGATGATC CTCACTCTAA GATGATGACT
GCTTATAAGG AGGTTCCCGA TTGGCCCTTC TTTGTGGTCT TGGTAATATC TCTCGTTTTA
GCAATCATCT GTGTAAAGAT TTATCCTGCT GAAACTCCTG TCTGGGGGTT GTTCTTTGCT
TTGGGAATCA ACTTTGTCTT TTTGATTCCA ATCACTGCCA TCTACTCCAG AACTGGTATC
GGGTTCGGTC TTAATGTCTT AGTTGAATTG ATCGTTGGGT ATGCTATTCC TGGGAATGGT
CTTGCGTTGA ACTTCATCAA GGCATTTGGT TACAATATTG ATGGTCAAGC TCAAACCTAT
ATTACTGATC AGAAGATGGC CCACTATGCA AAAATTCCTC CTAGAGCACT TTTTAGAGTT
CAGATTCTTG GAGTTTTCAT CGCGTCCTTT GTTCAACTTG GTATCTTGAA TTTTGTTCTT
ACTAATATTG ATAATTATTG TGATCCTCAC AATAAGCAAA AGTTTACTTG TGCAGGGTCT
AGGACTTTTT ACAGCGCTTC TATTCTTTGG GGTGTTATTG GTCCGAAGAA GGTGTTCAAT
GGTCTTTACC CTATCTTACA ATACTGCTTC TTAATTGGAT TCTTAATTGC AATCCCAGCT
GTCGTATTTA AGTTTTATGC TCCAAGAAAG TACACTAAGT CTTTCGAACC TTCAGTTGTA
ATTTTAGGAG TTATGAGCTT TGCTCCTACC AACCTCACAT ATTATACCGG AGGCTTATAT
GCATCTATTG CTTTTATGTA CTATGTGAAG ACTAGATATG AGGCATGGTG GCAGAAGTAC
AACTACCTCT TGTCTGCTGC TTTGACGGCT GGTGTTGCTT TTTCCGCTAT CATCATTTTC
TTTGCTGTGC AATACCATGA TAAGAGTATT AACTGGTGGG GTAACATTGT TCCTTACGAG
GGGATCGATG GTGGCTACGG ACAGCAATCC AGGTTAAACG TTACAGAGCT AGCACCAGAT
GGATATTTTG GTCCAAGAAT TGGAAATTTC CCTTGA
 
Protein sequence
MSTEEKNTTD KKEKLDNITS IRSVGSYLQL EDHEIDLKAV TSNPISLGEV GASLTDDQKL 
LILRRVHLDK LTSFEELPPQ AAFYIQKVEH LTTTEALTIL NQAVIDLDAD ANFPTKDYNL
LTSLVQSSES KQFNAKEKLA SALDGSSSEQ SLEKDYDTHS IVDWDLQVRL EAVLIAYHSP
YPQVRAVTDP YDDPTIPVET IRVYILGIIW TAIGAVINQF FSDRLPSISL DPAVVQVFLY
PCGILLEYIL PKKKIKIWRY TIDLNPGPWN YKEQMLATVF YSVSCPIGTS YVSSNITVQK
MEMFYNNKWV DFGYQVLLIL SNNFLGFGFA GIFRRFAVYP PEAIWPSVLP TLALNRALMV
PEKKEIINGW RISKYNFFFI TFAASFVYFW IPTYLFAALS TFNWMTWIKP YNFNLAAITG
TNFGLGLNPI PTFDWNVINT NSPLVLPFFT QINNYIGVLI GFIAIVGVYW SNYKWTGFLP
INSNAVFTNT GEPYAVTEVV DGNSLLDNEK YQQYSPPFYT AGNLVVYGAF FAIYPFSIVY
EIGSRYKQTW KALKSVYSSV RDFKRGAYEG FDDPHSKMMT AYKEVPDWPF FVVLVISLVL
AIICVKIYPA ETPVWGLFFA LGINFVFLIP ITAIYSRTGI GFGLNVLVEL IVGYAIPGNG
LALNFIKAFG YNIDGQAQTY ITDQKMAHYA KIPPRALFRV QILGVFIASF VQLGILNFVL
TNIDNYCDPH NKQKFTCAGS RTFYSASILW GVIGPKKVFN GLYPILQYCF LIGFLIAIPA
VVFKFYAPRK YTKSFEPSVV ILGVMSFAPT NLTYYTGGLY ASIAFMYYVK TRYEAWWQKY
NYLLSAALTA GVAFSAIIIF FAVQYHDKSI NWWGNIVPYE GIDGGYGQQS RLNVTELAPD
GYFGPRIGNF P