Gene PICST_72879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_72879 
Symbol 
ID4839783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp640318 
End bp643299 
Gene Length2982 bp 
Protein Length967 aa 
Translation table12 
GC content43% 
IMG OID640391098 
Productpredicted protein 
Protein accessionXP_001385471 
Protein GI150866013 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.828998 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGCAC TTGGTCTAAT CCTTCTATTA CTGACGGCAA TGCTTTTCAC GAGCCCGATG 
ACTGGAGCAG ATTCCCCAGG ATGTCGTCCC GTGTACATGT ATCCATCGTA TGCTCGGATT
ACGTCGTTTG ACGAGTCCCA TACCAAATTT GCATCTAAAT ACTCGCTATA TCTCTATAGA
GAACAGGGTA AGGATCCAAT TCCTGATGAA AATAATGGAT ACGAGCGTTT GGACGGAATA
CCCGTCCTCT TCATTCCAGG TAATGCTGGG AGCTACCGTC AGGCTCGTTC TATAGCAGCA
CAGCTGTCAA ATCTCTACTT TGACAAAAAA GTACATAACA GTGGAGCACG TAACTTTGAT
TTTTTCACAG CTGACTTCAA CGAGGACTTC ACGGCGTTTC ATGGCCGTAC CATGCTCGAC
CAGGCTGAAT TTCTAAACGA AGCCATACAT TTCATACTAG GTTTGTATTC CAATACGGAC
AACCCTCCGA AATCGGTTAT AATAGTTGGC CACTCGATGG GTGGTGTAGT AGCCCGAGTC
ATGCTCACTT TACCCAACTA TACCGACGGA ACTGTGAACA CCATCATCAC GTTGGCTTCG
CCTCACGCAG CCGCTCCGCT AACGTTTGAT GGGGATATCT TAAAAATATA CTCTGCTGTA
GATCGGTTCT GGTTCGATGG ATTCCATCCG ACTACGCTGG ATATTTCTCA GACAGCACAT
CGTAGATTGC ACGATGTTTC ATTGATTTCC ATAACTGGAG GTTTAACTGA TACAACGTTG
CCAGCTGACT ATACTACTTT GAGCTTTTTG GTGCCTCCTT CAAATGGATT CACCGTGTAC
AGCACAGGAA TCCCTAATGT TTGGACCAAT GTGGACCACT TGGCCATTGT TTGGTGCCAT
CAATTGCGGA CCCAAATAGC CAAAGCCATG CTAGAAATCG CGGACTTCTC CTCTCCTTCA
AGAACGTATC TGTTAGAAAG ACGAATGCAG GTGTTTAGAA ATAACTTCTT GACTGGATAT
GAAGACTACG CCAGCCAGGA TCTTGTTCCT AACATAGAAC AGCAACAGAA CTTGGCTATG
AAATTGGACT TGGCTCAAAT CAAGTCATTC AACATAAACG GCGATAAAAA GCTCAGGATC
ACGAAAGATA ATCAAAACAG TAAATGGAAC ATATTCCAAC TAGGCAACGA TGCAAAATTG
CAATTGGATG TGTTAAGCTC GCTTGAACCA ACTGAATGGG AATCTTGGTT AGCTGACGAA
GAATCCAATA GACCAGTCAT TTTACTATGT TCCAATTTGG GTGAAAACAA TCAGGAGCTT
GATTTGACGG ATTTCACTAA TGACCAGACT AACGAATTCG TAGAGTTAAA ATGTATTGAC
ACCGCCAGAG ACGTCCATTT AGTACCTCGG TCTATGCTGG ATTCAAAATC ATTAGGCGAG
TCTTCGTTTG GCGGAGAAAA GACTCCATTC TATTCGTTGC AATACAACGA CACGATACTA
ACTCGCTATG ATCTTGTTGT AGTTGCTCAA AGAAATATAG CAACAGATAG CGACTTTGTT
ATAGCAGAAC TTGCTGATCA GAAATCCACT ATTTTCGAGC TTGGAAAACA CATGTCTACT
CTTTTCCACA GCAAAGCCGA TCTCTCACTA TCGTCTAACC GCCCATTGTC TGTCAACATC
AAATTGCCTG CTGCTTGGAG CAGTTTGCTA TCGTATAAAT TGAAGCTAAA CCTCCCTGAC
AAAACAGAAG GAAAGTTTGC CTCCTTCATT CGGCAATGGA TAGACGAACC ATACGAAAGC
AAGTGGCATA TCAATGTGGA AAAAAACAAT GTAATCACAT TGAGAATGCA CGGAATCTCT
CCATTCGTAC CTTTCAAAGT AAAGGATGAC TACGGGTTGA ATATCCAACT ATGGTCTGAC
ACAAACACCA AGGAAGAGAT ACCATTGGAT ATTGAGCTCT CTGTGGATTT GATCGACAGT
TTCAGATTGT TTGTCATGAG ATATCGATTA TCGATTGTAG CCACATGTGT TCTGATTTCA
TTGTTGGCCA TGCTTGTACA ATTTCAATTG TACTTTAGAA CCGGCAAGTT CCCCAACTTT
ATCTTTGTGC TTTCATACCT CAACACCGGC TGGCCACTCG CGATGATCAT ATCTATCCTC
ATAGGCCTTA ATCCCATAGT CAAGTTGGGT TGGGTGCAAT ATTTGCTCAA CTTCATTGAT
CCCGTGGTAT TACTGGATGC CAATGAAGTC AACCTTTCGC TCAAACAAGA GTTCCGGCTC
AACTCGTTCT ACTTGGGTCT AGAAGAGTCC AGTCTCTGGT TCATTGGCAT CATGCTTTAC
TTCATTGGAA CCTTCTTGGT CGTAGCCACC TACTATTTAT TATCAGCTAT CAGCGTCTTG
GGATACGCTT TGGTGAATCA TTTGCCTAAA ACTACTTCTC CCACAAAAAC TCGCATCGTC
GTCACCTTCT TGCTAATCAT GATGATACCA ATTTATATCC CGTACCAGAT CGTATACATT
ATCAGTTGTG TTATCCAGGC TATCAACGTT TTGAAATCGG TTAACAACTC TCAGTTGTTC
AACTACCAGA TCTCGCTCTT GATTCTCATG TTGTGGATTC TCCCGGTGAA CATCCCCATT
GTTATTGTGT TTGTCCACAA CTTGTCGGTC AACTGGAAGA CGCCGTTCTC GTCGCACCAC
AACTTGCTTT CCATAGTTCC GGTGTTATTG TTAACAGAGA GGAACGGGCT CTTGTCTCGT
TTGCCAAAGA AAAAAGACCA GTTCTTCTTC AAGTGGTATA TGGGGTACTT TATCTTCTAC
TGTTTGATCT ACGGCAGTCG GCACACGTAC TGGCTCCATC ATTTGTTCAA CTTGATGAGT
TGTTTGGTGT TGGTGCTTAC GTTTGGCGAT GAAGAGAAGA AGGAAGAAGT CAGCATTACA
TAGCATCACT AGCATGTACA TAATATAAGC ATTAACATAG AT
 
Protein sequence
MLFTSPMTGA DSPGCRPVYM YPSYARITSF DESHTKFASK YSLYLYREQG KDPIPDENNG 
YERLDGIPVL FIPGNAGSYR QARSIAAQSS NLYFDKKVHN SGARNFDFFT ADFNEDFTAF
HGRTMLDQAE FLNEAIHFIL GLYSNTDNPP KSVIIVGHSM GGVVARVMLT LPNYTDGTVN
TIITLASPHA AAPLTFDGDI LKIYSAVDRF WFDGFHPTTS DISQTAHRRL HDVSLISITG
GLTDTTLPAD YTTLSFLVPP SNGFTVYSTG IPNVWTNVDH LAIVWCHQLR TQIAKAMLEI
ADFSSPSRTY SLERRMQVFR NNFLTGYEDY ASQDLVPNIE QQQNLAMKLD LAQIKSFNIN
GDKKLRITKD NQNSKWNIFQ LGNDAKLQLD VLSSLEPTEW ESWLADEESN RPVILLCSNL
GENNQELDLT DFTNDQTNEF VELKCIDTAR DVHLVPRSMS DSKSLGESSF GGEKTPFYSL
QYNDTILTRY DLVVVAQRNI ATDSDFVIAE LADQKSTIFE LGKHMSTLFH SKADLSLSSN
RPLSVNIKLP AAWSSLLSYK LKLNLPDKTE GKFASFIRQW IDEPYESKWH INVEKNNVIT
LRMHGISPFV PFKVKDDYGL NIQLWSDTNT KEEIPLDIEL SVDLIDSFRL FVMRYRLSIV
ATCVSISLLA MLVQFQLYFR TGKFPNFIFV LSYLNTGWPL AMIISILIGL NPIVKLGWVQ
YLLNFIDPVV LSDANEVNLS LKQEFRLNSF YLGLEESSLW FIGIMLYFIG TFLVVATYYL
LSAISVLGYA LVNHLPKTTS PTKTRIVVTF LLIMMIPIYI PYQIVYIISC VIQAINVLKS
VNNSQLFNYQ ISLLILMLWI LPVNIPIVIV FVHNLSVNWK TPFSSHHNLL SIVPVLLLTE
RNGLLSRLPK KKDQFFFKWY MGYFIFYCLI YGSRHTYWLH HLFNLMSCLV LVLTFGDEEK
KEEVSIT