Gene PICST_31037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31037 
Symbol 
ID4838217 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1581063 
End bp1583810 
Gene Length2748 bp 
Protein Length891 aa 
Translation table12 
GC content39% 
IMG OID640389532 
Productpredicted protein 
Protein accessionXP_001383926 
Protein GI126134803 
COG category 
COG ID 
TIGRFAM ID[TIGR00727] small oligopeptide transporter, OPT family
[TIGR00728] oligopeptide transporters, OPT superfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.629932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCAGG TCGAAAAGAA GGAAGAACTT AATAATATCG TTAGCATTCG AAGTGTTACT 
TCTCACTTGC AGTTAGAGGA TCATGAAGTC GACTTGAGAG CAATTACTTC TAACCCAGTC
TCTATTGGTG AGATTGGTGT TTCGCTTACA GATGAGCAGA AGCATTTCAT CTTGAAGAGA
CTTCATCTCA ACGGTCTAGA GTCCTTTGAA CAATTACCTC CTCAGGCTGC CTTTTATATC
GACAAGATTG AGAAAATGGG AGAAGATGAA GCATTGACAA TCGTGAAAGA AGCTCTTGTT
GAGCATCATG ACGACGCCAA TATCCCAGTC GAAGACATAG AATTGTGGAC TAACCTTGTT
GAAATTGGAA ATTCTAAGAC TTCTGGTACA AAGGAAAAGT TGGCTTCTAC TTTTGATGAG
AAGAATGACA GAGATGGTGA GTCCTCTTCT CAGGAAGTGT CTGAAGGTGG TAAATACGAA
GATTTCACCC ACAATGTGGT CGATTGGGAC TTGCAAGTTA GATTGGAGGC CGTTTTGGTT
GCTTACCACT CACCTTACCC TCAAGTAAGA GCGGTTACCG ATCCTTATGA CGATCATACT
ATTCCAGTCG AAACTTTCAG GGTCTATCTT CTTGGAATTA TTTGGACTGC CATTGGTGCA
GTAATTAATC AGTTCTTTGC TGAAAGGCAA CCTGGTATTT ATTTGGACCC AACTGTTGTT
CAAGTGTTGC TTTATCCTAG TGGTATGTTG TTGGAATATA TCTTGCCAAA GTTCAAGTTC
AAGATTTGGA AATACAGTAT TGACCTCAAC CCAGGTCCTT GGAATTACAA GGAGCAGATG
TTAGCAACAC TCTTCTATTC TGTTGCAGGA GGTGGTGCCA GTTACGTTTC ATACAACATT
CATGTTCAAA AGATGAAGGT GTTCTACGAT AACAAATGGG TTAATTTTGG TTATGAAACC
TTATTAATTT TGTCTAATAA CTTCTTGGGA TTTGGGTTCG CTGGTGTTTT TAGAAGATTT
GCTGTATATC CAACTGAAGC TATCTGGCCA ACAGTGTTAC CTACTCTCGC CTTGAACAGA
GCTTTGATGG TGCCAGAAAA GAAAGAGATT ATTAACGGTT GGAAGATTCC AAGATACACA
TACTTTTTCA TTCTCTTTGC GGCATCCTTT GTCTATTTTT GGGTTCCTGA TTACCTTTTC
TATGCCTTGT CTGTCTTCAA CTGGATGACA TGGATTAAGC CTTACAATTT CAATTTAGCT
GCCATCACTG GAAGTAACTT TGGTTTGGGA TTAAACCCAA TTCCTACCTT CGACTGGAAT
ATGATTAGTT TTAACGCACC ATTGATTTTT CCATTCTACA CCCAACTTAA CACTTATATC
GGTGCTTTAA TAGGGTTTTT TGCAATCGTT GGTGTATACT GGACCAATTA CAAATGGACA
GGCTTCCTCC CAATCAACTC TTCGTCCATA TTCACAAATA CTGGTGACTA CTATGCTGTT
ACGGAGATCC TCAATGAGAA AAGTTTGCTT GATGAAAAAA AGTATCAGGA GTACGGACCA
CCATTCTACT CCGCAGGAAA CTTAGTTCTT TATGGTGCCT TCTTTGCTAT CTACCCTTTT
TCTATTGTTT ATGAAATCGG TACCAGATAT AAACAAACTT GGAGAGCACT CAAAAGTCTT
TACCAAAGTT TCAGAAACTT TAAGAGATCT ACATATGAAG GTTACACTGA CCCACACTCC
ACTATGATGA GAGCTTATAA GGAAGTCCCA GATTGGGTAT TCTTGGTAGT TTTAGTGATT
TCTCTTGTGT TGGCTATTAT TTGTGTCGAA ATCTACCCTG CTGAAACTCC TGTTTGGGGT
ATTTTCTTTG CCTTGGGTAT CAACTTTGTT TTTTTAATTC CAATAACTGC AGTTTACTCC
AGAACTGGTT TTAGTTTTGG ACTCAATGTC TTGGTTGAAT TGATTGTTGG TTATGCTCTT
CCCGGTAACG GTCTTGCTTT GAACTTTATC AAGGCGTTTG GTTACAACAT TGACGGTCAG
GCACAAAACT ATATCACTGA CCAAAAGATG GCTCACTACT CCAAGGTTCC TCCAAGAGCT
TTGTTCAGAG TTCAAATTAT TGGTGTCTTT ATTGCCTCGT TCGTTCAACT TGGTATAATT
AATTTCGTTA TTGACAATAT TAAAGACTAT TGTGAGCCAT ACAACACACA GAGATTCACT
TGTCCAAACT CTAGAGTCTT TTACAGTGCT TCTATTCTTT GGGGTGTTAT CGGACCCAAG
AAGGTCTTCA ACGGATTATA CCCAATTTTG CAATACTGTT TCTTAATTGG ATTTTTGTTG
GCTATTCCTG CAATCGCGTT CAAAAAGTAC GCTCCAATTA AGTACACCAA GTACTTTGAA
CCTACAGTAG TAATCGGTGG TATGTTGAAT TATGCTCCTT ACAATCTTTC CTACTTGACA
GGTTCATTTT ATGCTTCCTT TGCCTTTATG TATTACATTA AGAACAAGTA CCAGGCATGG
TGGTATAAAT ACAACTATCT CACAACTGCA GGATTGACTG CTGGTGTGGC TTTTTCTTCT
ATCATCATTT TCTTTGCCGT TGAGTACCAC GACAAGAGTA TTTCATGGTG GGGTAACAAC
GTTATATATG GTGGTATCGA AGGTGGTTTG GGTCAACAGT CAAGATTGAA TGCTACCGCA
GAAGCTCCAG ATGGTTATTT CGGTCCAAGA AGAGGAAACT TCCCATAA
 
Protein sequence
MSQVEKKEEL NNIVSIRSVT SHLQLEDHEV DLRAITSNPV SIGEIGVSLT DEQKHFILKR 
LHLNGLESFE QLPPQAAFYI DKIEKMGEDE ALTIVKEALV EHHDDANIPV EDIELWTNLV
EIGNSKTSVS EGGKYEDFTH NVVDWDLQVR LEAVLVAYHS PYPQVRAVTD PYDDHTIPVE
TFRVYLLGII WTAIGAVINQ FFAERQPGIY LDPTVVQVLL YPSGMLLEYI LPKFKFKIWK
YSIDLNPGPW NYKEQMLATL FYSVAGGGAS YVSYNIHVQK MKVFYDNKWV NFGYETLLIL
SNNFLGFGFA GVFRRFAVYP TEAIWPTVLP TLALNRALMV PEKKEIINGW KIPRYTYFFI
LFAASFVYFW VPDYLFYALS VFNWMTWIKP YNFNLAAITG SNFGLGLNPI PTFDWNMISF
NAPLIFPFYT QLNTYIGALI GFFAIVGVYW TNYKWTGFLP INSSSIFTNT GDYYAVTEIL
NEKSLLDEKK YQEYGPPFYS AGNLVLYGAF FAIYPFSIVY EIGTRYKQTW RALKSLYQSF
RNFKRSTYEG YTDPHSTMMR AYKEVPDWVF LVVLVISLVL AIICVEIYPA ETPVWGIFFA
LGINFVFLIP ITAVYSRTGF SFGLNVLVEL IVGYALPGNG LALNFIKAFG YNIDGQAQNY
ITDQKMAHYS KVPPRALFRV QIIGVFIASF VQLGIINFVI DNIKDYCEPY NTQRFTCPNS
RVFYSASILW GVIGPKKVFN GLYPILQYCF LIGFLLAIPA IAFKKYAPIK YTKYFEPTVV
IGGMLNYAPY NLSYLTGSFY ASFAFMYYIK NKYQAWWYKY NYLTTAGLTA GVAFSSIIIF
FAVEYHDKSI SWWGNNVIYG GIEGGLGQQS RLNATAEAPD GYFGPRRGNF P