Gene PICST_80251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80251 
Symbol 
ID4851486 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1952288 
End bp1956407 
Gene Length4120 bp 
Protein Length1030 aa 
Translation table 
GC content41% 
IMG OID640393194 
Productpredicted protein 
Protein accessionXP_001387613 
Protein GI126274615 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGGTATATCA CTATAGTGGA GATAGTCATA AGTTGTAGTA ATAGCTTCGA TCGAGCCATC 
CATCGCCGAG ACTGCTAATA TATAACTATA CGTTTGTTTG CCCGAAGCTC GAATCTTTCT
CATTTCACAA GACAGACAAT CAGAACGAAA GAATCGAATA CAACAGGACC AGAACATCTA
CAATATCATC TACCAGACGT TACATCCACT ATACTCTACT CTAACTACCA CTCATAAATA
CAGACAAACA GTGCTGTCCA ATTGTTGATT ACTCATCGTT ATCATATTTT TCACTTTCAT
AAATTTCATA AATTTCATTT CACTTCAGTT CGTTCATGAC AAATAAAGAG GAAGAGCAGC
TTCTTCAGGA TGCGCTGACG CTTTTGATGT TCGCCAACGT AGCGGCAAAG CAACAACAGC
AACATATCAA GAACACTTCT GCTAGCCATA GTCCTCTTCA AACTGCTACG TCACCGGCAG
CCTCCTCCGT GACCCCACCT CCGGCCAATC CTTCGACTTC GTATCAGCAA ATCCAACAAC
TTCAAAACCA ACAAGCACAG CTTCAAACTC AGTTTCAAAA CCATATTCAA CATGGTGCGA
TAAAAGTAGA ACCAATTCAA AAGAATCAAA CTCAACAAAG TCAGCTTCAG CCACAACTTC
AACAGAGTCA TCAACCTCAG AAACAACTTC CACCGTTTGA TTACTATGGA GCTCAGAATC
GCCCAGCAGT TTCTTCCAAT ATTGTCCCAA ACCCTGAAAA GTTCTCGACT TCGTCGTCTC
TGTCTACGGT TTCGCCTCCT ACGGCTTCGT TTGTGCATAA TCAAGACCTC AGAGAGGCAT
CTAGACCTCA CCTTCCACAT AAAAGTTCGT TGAGCATTCT TATGAACAGT CCAGAACCAC
AATCCATGCC ATTCTCGCCT TCTAATGTGT CAACTCTTAC AAAAGATTCT CAGATTTCAC
AAACTTCCCA GGTTTCACAG CAGAAACGCT CGGCTTCGCT TGAGGATTCT AATCTTCCCC
AGAAAGGCAG TTTTGTCCAA CAACACAAAC GCTCTAAATC GACCCCAGAG ACCGACAAAA
AGACACCGGC TTATAGACAC ATTGTTCTTT CTCCTGGCCC AGCAAACGTA GCTCTCGCCA
GAGGCATCAA CTTGGAAACA GGAGAAAGAA ATAATAACAA TGCGGTGATA GCTGCTGCTG
CATTAGCAGC TGCTGCAGAT ATACCGTTGC CGTTGAAACA TGTAAAAAGA AACGTAGAAC
CAACAGTTTC TAAGTCTACA ACAGTAAAAT TTGCACACAT AGTATCAGAA CAAGAAGCTG
AACCGGCTAA AACTGTTCAA GTAGCACAAC CTGTTGCAAA AGTTGAGCCG GAACAAGTTC
CACTAGCAGT TCCAAAACCA GTAGTGGTTC CTACACTTCC TGTGACATTA CCGGTCATGG
CTACAAATCT TCGAGAAAAG GAAATCAAAA TAACCAAAGA TGAAGATCAG CTTACCGAAG
CAGAGGTAGA CGAGAAAACA GACGATGAAC GCACAGACGA TGAATATTCC TCAGCGAGAG
GAAGTAAAAC AGAAACAGAA ACAGCCTCGG AAATCTTCGA AGTAAAATCA GAAGTTGAGG
GCCCAAGACA AGAACAAGTA GTTCAACTTC AACTTCAACT TCAACAAGAA GCACAAGAAC
AGGAAACTCA TGCACATCAA GAACAGGCCC AAGTTCAACA AGAACAAGAA CAAGAACAAA
TTCAAATTCA ACGAGAACCA GAAGCAAAAC AAGTTGAACA AGAACAAGAA TCACAGGAAG
TTCAACAACT TGTAGAAAGT GATGATCAGG ACTATGAACG GGAAGAAAAA GTGGACACAG
AGATTCAACA TGAAGAACAA AAGATCGATT TTAAGGCACC TCCTCTTTCT TCATATCAGG
TTGATCCTGA TTCTGGTTTA ATCGGATGTA TCTGTGGAAT TTCTGACGAT GACGGGTTTA
CTATCCAGTG TGATGTGTGT TATAGATGGC AGCATTGTGT CTGTATGGGT TTCAAGACAA
GTGAAGAAGT GCCCGAAGAC GAGTATACTT GTTATTACTG TGACAGAGCC AAGTGGGGAA
AATTTGACCC ACTAGAATGT CGCAGAGAAA CGTTGGATCG TCTAGACAAC GATAGTCGAC
AGCCTCAATT ACTGCAAGAA TTGAACGAAA GGCAACAGCA AGAACTTGCG GAATTTCAGG
AGAAGCAGAA GCAAAATGGC AAGAGAAAGC CGCTGAACTC TGATAAAAAC GACAAGCGAC
GTAAAGTGGA AAGTCAAGTA GAAAAGAGAA AGGACTCCTC AGCAACTGAA GTACAGCCAA
AGAGTCAAAC TCCTAGCGAT ACCCTTCCAA ACAAAGACAA TGAGTTGTTA GAAGATGGAG
TCACAGCAGA GTCGTATCAA TCTGTGTATT ATAAGCTTAG AAAGAACGAC TACAAGAGAC
AGTCTATAAG CGATTTCTTT TCTAGAATAG GTACTGAGTT CTTCAACGAA TACCTTTCTT
TGGACCCAAG CACAAAAGCC TCCAAAGAAT TGAGAGGGAT CAAAGTGATG TCTATGCCTG
AGTTTAAAGC TATTAGATTG AGCAAATTGA ATTTGCCAAA CCATCTTAAT TATATTAGCG
AGCACAAGAA CAATTTGTCC AAGAAGAAGT TGTTCAACGA CACTTCGATA CAGGTTAGAC
AGTATTCCGA TAATCAGAAG CAGAAGTTTA ATGGAATCTC CAAGTTGTCA CTTTTTATCA
CGTCCACTAA TAGCGATAGT TTGACGATCC CAGCCAACAC TGCGATTATC GAGTATTTAG
GTGAAATTGA TCTTTTCAAG AACTACGTCA GGGATCCTAT CAACCAATAT TCCAGCTGGG
GTACCCCAAA GCCCAAGGTT CTTCGTACAA GTCTCAAAGT AGCACAGGAC AACAATTTGG
AAGTTGTTCT TGACTCCAGA TTTGTGGGTA ATGAATCTAG ATTCATTAGA AAAGCTTGTC
CTGCTTCCAC AAATTGTAGA ATAGAGCCAA TTTACATTCC TGAAGACAAT TCGTTTCGCT
TCTTAGTCGT GACAACAAAG CCCATTAATT TGAAGTCAGA ATCTGCTGAT GAAGAGTTGC
GCCTTGAATG GGAATGGGAT CCACAACATC CCATCCTCAA GCTTTATGAA AATAACAACT
CTGAGAAGTT TGAGCAATTG GTGAATGCTG ATAAATCAGC ATTAATCACC TACATCGACA
ATATTTTGCA TTTTGCCGAG TGCGGCTGCT CAACAGCAAA CGCTTTTTCC TCTTGCGCTA
TTTTTAAGAT CAAGAAGGCT ACTTCGTATT TGTTGCGTTC AACTAGAAAA GCATCATCTA
TCAGCAATTC CAATCTAGCC AAATCAAAAG AGGAATTGAT ACTTCCTAGA AAGGAAAAGG
AGTACATTTC TTGGGAAGAA AGATTGTTGG AAAGAGACAA CATCATACAG ATGAATCTTT
CAGTAACTAC TGAACAGATT AGCGAAGAAT CAAAAGAAGA GTTGAAAAAC GATAATGAAG
TGGAAGTCGA TAACGAAGAA GTTAAGGGAG AAATTAAGCC AAATTATCTT TTCAAGTTGC
CTTACAAGCA GCAATTGCTC TCCAAGAACA GGGGCCGTAA GATCGTAGTT CGTCCTGGAT
CTTCTTCTGT AGAAGGAGAT GTAAATAGCG GAACAGCCCA CGACGAGTTA CCATTCCCAA
TTGTGTCTGA CTTAGTAGTC AAGATCGAGA AAAGTATTGA CGAAAAGCTT AAGCCAATGG
TCAAGGAGGT GGAAGAGAAG ATAACTACTG TCCTTGAGTC ACTTCCAAAG CCTTCGGCCA
CGCAAGAAGA AACGTCAAAG AAGGATGAAG TTATTGCTGC TGTGGAAGAA ATACAACATG
TTCTCAAGAA GGAGACAGAA GATCAAGTCA GCTCTCTTCC ACCACTTTCA ACTGAAACTT
CTTCAGAAAC TAAAAAGGTG GAAACTTCTG ATTCGACAAC AGCACAGGCA CCACCTCCGG
TTGTCAAGAA ATTATCATTC GCTGACTATA AGAAGAAATT GAAATAATAG AATAGGTTAT
ATATTTATAA TGAGTAATAA TGTACTATAT CATATTTTCA
 
Protein sequence
MTNKEEEQLL QDALTLLMFA NVAAKQQQQH IKNTSASHSP LQTATSPAAS SVTPPPANPS 
TSYQQIQQLQ NQQAQLQTQF QNHIQHGAIK NRPAVSSNIV PNPEKFSTSS SLSTVSPPTA
SFVHNQDLRE ASRPHLPHKS SLSILMNSPE PQSMPFSPSN ISQTSQVSQQ KRSASLEDSN
LPQKGSFVQQ HKRSKSTPET DKKTPAYRHI VLSPGPANVA LARGINLETG ERNNNNAVIA
AAALAAAADI PLPLKHVKRN VEPTVSKSTT VKFAHIVSEQ EAEPAKTVQV AQPVAKVEPE
QVPLAVPKPV EVQQLVESDD QDYEREEKVD TEIQHEEQKI DFKAPPLSSY QVDPDSGLIG
CICGISDDDG FTIQCDVCYR WQHCVCMGFK TSEEVPEDEY TCYYCDRAKW GKFDPLECRR
ETLDRLDNDS RQPQLLQELN ERQQQELAEF QEKQKQNGKR KPLNSDKNDK RRKVESQVEK
RKDSSATEVQ PKSQTPSDTL PNKDNELLED GVTAESYQSV YYKLRKNDYK RQSISDFFSR
IGTEFFNEYL SLDPSTKASK ELRGIKVMSM PEFKAIRLSK LNLPNHLNYI SEHKNNLSKK
KLFNDTSIQV RQYSDNQKQK FNGISKLSLF ITSTNSDSLT IPANTAIIEY LGEIDLFKNY
VRDPINQYSS WGTPKPKVLR TSLKVAQDNN LEVVLDSRFV GNESRFIRKA CPASTNCRIE
PIYIPEDNSF RFLVVTTKPI NLKSESADEE LRLEWEWDPQ HPILKLYENN NSEKFEQLVN
ADKSALITYI DNILHFAECG CSTANAFSSC AIFKIKKATS YLLRSTRKAS SISNSNLAKS
KEELILPRKE KEYISWEERL LERDNIIQMN LSVTTEQISE ESKEELKNDN EVEVDNEEVK
GEIKPNYLFK LPYKQQLLSK NRGRKIVVRP GSSSVEGDVN SGTAHDELPF PIVSDLVVKI
EKSIDEKLKP MVKEVEEKIT TVLESLPKPS ATQEETSKKD EVIAAVEEIQ HAPPPVVKKL
SFADYKKKLK