Gene PICST_79777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_79777 
Symbol 
ID4841132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp875415 
End bp877565 
Gene Length2151 bp 
Protein Length598 aa 
Translation table12 
GC content44% 
IMG OID640392447 
Productpredicted protein 
Protein accessionXP_001386584 
Protein GI150866854 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.405101 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTCATGCATC TGTAGATTTT GAACACTTTT GTTTTTCGTT TAACGATCTT TAGCATTTAC 
TCCAGGACCA AAAGACCGCT TTCTACTTCT GGGAAGGGCC GTTAGCTTTG TGTAGGCTGA
ACTTTTCTTG TTTCTCATTG ATTTTGGCAA TTTACAACTA GACTACTCTA TCTTGTTCCC
CATTTTTCAC AGTTTATCTG CCCATTTTTC ACATCTCATC CCCCAGTATT ATGTCTGGTT
CCGACGACGA AAACGCGGTC GATGAGTACA CCTCGTTGAT AGCCTCTAGA CGGCCCTCGA
TCATGTTTCT CGATACGCCT CTTGGCTCAT TCAAAGGCCC TAACTCTCTC CACAATTTTG
CTTCGTCCTT CACCAGAGCC CAGTCGTTTG CTGTGTCTAA AATCGACAAT GATATCCACA
GAGCGAGATC GTTTTTTGTC GAGAATATTG ACACTGACGC TGATGACGAG TTGTTTGATC
CGGAATTAAT GATTCCATCA CAAAAAGGTG AACGTCTTTC CGTAGTAATC CACGACATCT
CGTCTAGAAA CCAGCTTTTC ATGAATAATG TCAACGAGTT GGACCAGAAC ATCTCGCCCA
ACAACGACGT CTTCTACCAC GATGACATCT TGTCCGCCTT GAACGAGTCC AGGTCGAGGC
ACAACTCGAC CTACAATACA CCAGGTGCCA TCCCCATTTC GAAAAAGCGT GTATTACCTT
CTCCCTCGTT CTCTTCTATC CGTTCTGCGC TCTCATTGGC CACAACATCA GACCATATCA
ACTTGAAGAA AATAGAGGAC AAAGATGGAA ACGTCGTCAC GGTATTGGCC GGTCAATCAA
CAGCACCCCA GACCATCTTC AATTCGATTA ACGTTCTCAT TGGAGTGGGC CTCTTGGCTC
TTCCTGTAGG TATCTTAAAA GCAGGTTGGT ACTTCGGAAT TCCCATCTTG GTCATCTGTG
GCTTGGCAAC TTTCTGGACT GCTGGATTAT TGTCTAAATG TATGGACACT GATCCCACGA
TTATGACATA TGCTGATTTG GGTTATGCTG CCTATGGTTC CACAGCCAAG TTGTTAATCT
CGTTGTTGTT CTCTATCGAT CTCTTGGGAG CCGGAGTTGC GCTTATAGTC TTGTTCAGCG
ATTCCTTGTA CGCTCTTTTA GGCGACGAAG AGGTGTGGAC AAGAACGCGG TTCAAGTTCC
TCAGTTTTGT CGTCTTGACT CCATTCACAT TCGTCCCTTT ACCAGTATTG TCAATCTTCT
CGTTGTTTGG CATTCTCTCG ACAATTTCCA TCACTATTCT CGTAGCATTT TGTGGTATCT
TGAAAACCGA TTCTCCAGGC TCGTTGTTAG CGGTAATGCC CACCAACATC TGGCCGCAAT
CGTTACCTGA TCTTCTTTTG GCCATTGGAA TCTTAATGGC TCCCTTTGGT GGTCACGCCA
TCTTCCCTAA CTTGAAAACT GATATGAGAC ACCCATACAA GTTTGAAAAG ACTTTAAGGT
ACACCTACAG CATTACCATG ATCACAGATA TGGCAATGGG TGTTTTGGGC TTCTTGATGT
TTGGCCACAA ATGTAGCAAC GAAATAACAA ACACCTTGTT GTTAACTCTG GGATACCCCG
CATGGTGCTA TCCTTTGATC AGTGGGTTGA TCTGTTTGAT TCCCTTGGCC AAAACTCCGT
TGAATGCCAA ACCAATTATC TCTACCTTGG ACGTCTTGTT CAACGTGCAA GTTCCCAGCG
AGCATTTATC GTTGAACTTG CTTAAGGATG TCGGTAAGTT TTTCATCAGA GTCGGAGTCA
ATGCCGTCTT CGTACTCTTG GCCATTTTGT TCCCTGAATT CGACAAGATC ATTGGCATTC
TTGGAGCTTC CATCTGCTTC GTTATCTGCA TCGTCTTGCC ATGCTTGTTC TATTTGAAGT
TGTGTTCATC CAAGATGGGA GCTTTGGAAA GAGTACTCAT TCAATTTGTT GTATTTTTCA
CCTCCATCTT GGCCGTTGTC GCCACTTGGG CTGTCGTTCA GTTCTAGGCT GGCTAATCTT
TTTGCATCTT TATCTACTTA GTTAATTTAC AGTATAGACC ATGGGATCCG GGAAGCCAGG
TCTTTAAACA ATTAGAGTCT ATAGAATAAT ATCAAAACTC GTGCTCTCTA C
 
Protein sequence
MSGSDDENAV DEYTSLIASR RPSIMFLDTP LGSFKGPNSL HNFASSFTRA QSFAVSKIDN 
DIHRARSFFV ENIDTDADDE LFDPELMIPS QKGERLSVVI HDISSRNQLF MNNVNELDQN
ISPNNDVFYH DDILSALNES RSRHNSTYNT PGAIPISKKR VLPSPSFSSI RSALSLATTS
DHINLKKIED KDGNVVTVLA GQSTAPQTIF NSINVLIGVG LLALPVGILK AGWYFGIPIL
VICGLATFWT AGLLSKCMDT DPTIMTYADL GYAAYGSTAK LLISLLFSID LLGAGVALIV
LFSDSLYALL GDEEVWTRTR FKFLSFVVLT PFTFVPLPVL SIFSLFGILS TISITILVAF
CGILKTDSPG SLLAVMPTNI WPQSLPDLLL AIGILMAPFG GHAIFPNLKT DMRHPYKFEK
TLRYTYSITM ITDMAMGVLG FLMFGHKCSN EITNTLLLTS GYPAWCYPLI SGLICLIPLA
KTPLNAKPII STLDVLFNVQ VPSEHLSLNL LKDVGKFFIR VGVNAVFVLL AILFPEFDKI
IGILGASICF VICIVLPCLF YLKLCSSKMG ALERVLIQFV VFFTSILAVV ATWAVVQF