Gene PICST_30203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30203 
Symbol 
ID4837627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2221181 
End bp2223496 
Gene Length2316 bp 
Protein Length771 aa 
Translation table12 
GC content42% 
IMG OID640388942 
Productpredicted protein 
Protein accessionXP_001383187 
Protein GI150864397 
COG category 
COG ID 
TIGRFAM ID[TIGR00727] small oligopeptide transporter, OPT family
[TIGR00728] oligopeptide transporters, OPT superfamily 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATC TTGAGCCGTT AAAGTCTCCA ATCGAGAAGA ATGTCCTGGA TGTGGAGGTC 
GAGGTCTCGG AACCGCTACT TCCTCGAAGA AGTAATGCTG AAACCATAAA ATCGTACAAA
TCCTATGGTT CGGTTGAGGT GGTCAGTTCT CCGAGTTCTT CAGACGACTC CGAGGATAAC
GAGGATCTTG ATCCGGACGT CTTAGAATTG CCTAAGATAA TTAGGGAAGC GGTTCCCTTA
GTGGATGATC CTTCCATTCC AGTTTTGACT TTCAGATACT TTCTTTTGTC GACAGTTTTC
ATAATCCCTG GAGCCTTTAT AGACACTATG AATTCATACA GAACGACTTC AGCAGCATAC
TCCATATTTT TTGTACAAAT AGTGTCCCAT TGGGCCGGGA AGTATCTTGC AAGAACACTT
CCAAGGAAAC AAATCAAGTT CTTCGGATTC AAAATTGACC TAAACCCTGG ACCATGGTCT
ATCAAGGAAA CAGTCATGGT AACCATCACG GCTAATAGTG GAGCCACCGG TAATTTAGCC
ACAAATGCTA TCTCTTTAGC TGATTTATAC TTTGGAGAAA AAGTTCCTGC CATTGTGGCT
GTCGGGTTTA TGTGGGCGAT TGTCTTTGTA GGCTATTCGT ACGCTGCAAT TGCTAAGAAC
TTTTTGCTCT ACGATCCTCA GTTCTCTTGG CCCCAGGCAC TTATGCAGAC GACGTTGTTG
CAGTCTCAGG CAAAATCTGA CAAGTCAAGT AGAGGTGGTT CTAAGCAGAT GAGAGTATTC
TTCACAGTAC TTTTAGGGGT GACGGCATGG CAGTTCTTTC CAGAATTTCT TTTCCCAATG
ACGTCTTCGT TGGCTATTTT ATGTTGGATA GCTCCATATA ATGAAACGGT GAACTTCATA
GGCTCTGGTT TGGGTGGAAT GGGAGTACTC AACTTTTCTT TGGATTGGGC CAATATTACC
TCGTCGATTA TGTTGTATCC TTACTGGATA CAGGTCATTC AGTTCATAGC CTTTGTCATT
GGAGCCTGGA TTCTTATTCC TTTGGTTAAA TGGGGTGGAA TTGGATCTTT CAAAGGAGGT
TTAATGTCGA ACAGCCTTTT CCAAGGGAAT GGCTTACCTT ACCCCACAAA TGAGCTTTTG
ACCCAAGATT TGAAGTTGAA CTTAACTGCC TATGAACAAT TTGGTCCCAT CCATTTAGGA
GCCCAAAGAG CATGGAATAT GTTCTTTGAC TACGCTGCTT ACGTTAGTGG TACAACATGG
GTTGTACTCT TTGGATACGA CAAATTCAAA TCGTCTTTCA AGCATTTAAT CACCAGAGAT
AAAGATACCA AAGTTCAGTA CACAGATAGA TTAAATAAAT TGCGAGCTAG ATATGAAGAA
GTTCCAATCT ACTGGTATTT GGTGCTATTC CTTATTTCCT TCACCGTCTT GATGTCTATC
TTCCTCAATG GATACATGTT CATGCCATGG TGGGCTGCCA TCGTAGCCTT AGTCATGGGT
TCTATCATCG TTACCCCATT GGCCTGGCTT TATGCCTTGT CCAACTTCCA GTTGGCTATA
GGTACATTCA ACGAGCTTGT ATACGGATAC ATGGTTCAGA ACTTGGAACT GAAGCATCCT
GCTGGAGCCT TAGTTTTCGG CTCAATTGCT GGCAATGCCT GGTATAGAGC CCAATACCAT
CTTGAATGTA TGAGATTGGG ATTCTACAAC CACTTACCAC CTCGGGCCGT GTTTTTCTCT
CAATTGTATG GTGAAATGAT TGGGGTTCCC ATCAACTATT TGGCTGTTAG ATGGGTGTTG
AGTACCAAGA GAGAATTTCT CAATGGCTCT AAGATTGATC CTTTGCATCA ATGGACTGGC
CAAACAATAA CCTCGAATCA TACCAATGCT ATTCAGTATG TGGTTTTGGG CCCTTCCAGA
TTATTTGAAA ATTACCCTCT TTTACCCTAT GGTTTTGTTT TGGGATTAGT GGCTCCATTC
ATCTTCTTCA AGTTGCATCA AAGGTACCCC AACCTGAACT GGAACCTCTG GAACACCACT
GTGTTCTTTT CTAGCATGAG TAGATTTTAT GGAAATATTT CCACAGGATA CTTCTCCAGA
TTTATAGGAG GCACTATTTC AATGTACTGG GGAGTCAGGT ATAAGCACGC CTTGTGGAAG
AAGTATAACT ATCTTTTGGC AGCTGCTTTT GACACTGGTT ATAATTTGGC AATCTTGCTC
ATATTCTTGA TCTTTTCCGT GGGAACAAGT TACAACATGC CCAACTGGTG GGGCAACAAT
GCCACCAGCA TCGAAAGATG TTTTGCATTA TTTTAG
 
Protein sequence
MADLEPLKSP IEKNVSDVEV EVSEPLLPRR SNAETIKSYK SYGSVEVVSS PSSSDDSEDN 
EDLDPDVLEL PKIIREAVPL VDDPSIPVLT FRYFLLSTVF IIPGAFIDTM NSYRTTSAAY
SIFFVQIVSH WAGKYLARTL PRKQIKFFGF KIDLNPGPWS IKETVMVTIT ANSGATGNLA
TNAISLADLY FGEKVPAIVA VGFMWAIVFV GYSYAAIAKN FLLYDPQFSW PQALMQTTLL
QSQAKSDKSS RGGSKQMRVF FTVLLGVTAW QFFPEFLFPM TSSLAILCWI APYNETVNFI
GSGLGGMGVL NFSLDWANIT SSIMLYPYWI QVIQFIAFVI GAWILIPLVK WGGIGSFKGG
LMSNSLFQGN GLPYPTNELL TQDLKLNLTA YEQFGPIHLG AQRAWNMFFD YAAYVSGTTW
VVLFGYDKFK SSFKHLITRD KDTKVQYTDR LNKLRARYEE VPIYWYLVLF LISFTVLMSI
FLNGYMFMPW WAAIVALVMG SIIVTPLAWL YALSNFQLAI GTFNELVYGY MVQNLESKHP
AGALVFGSIA GNAWYRAQYH LECMRLGFYN HLPPRAVFFS QLYGEMIGVP INYLAVRWVL
STKREFLNGS KIDPLHQWTG QTITSNHTNA IQYVVLGPSR LFENYPLLPY GFVLGLVAPF
IFFKLHQRYP NSNWNLWNTT VFFSSMSRFY GNISTGYFSR FIGGTISMYW GVRYKHALWK
KYNYLLAAAF DTGYNLAILL IFLIFSVGTS YNMPNWWGNN ATSIERCFAL F