Gene PICST_53989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_53989 
Symbol 
ID4851913 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3162870 
End bp3165311 
Gene Length2442 bp 
Protein Length687 aa 
Translation table 
GC content43% 
IMG OID640393621 
Productpredicted protein 
Protein accessionXP_001387175 
Protein GI126276003 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.78983 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCCT TCTTCAAGAG ACATTCCCCC CCAAACTCGT CCACCGTCTT TGATGATATC 
TTCTGGTCAT GGTATCTCGA GAGCTACAGC TTGGTACGAG ATCCTGTTGG CGAGCCGCTC
CCTATAAATC CCAAAAAGTA CTCGTCTATT TCATTGGCTG TTATCACTCT TCAAAAATAC
TTCGATAGTC TTAAATTGCC TGTACCCTCA TCGAAGCAAA CTATAGCTCT TCTCAAGTCT
CCGTTTACCC AGGGGGATAT AATCAAGACA TTCCATCTCG TGCGTTTCTT CCAGCTTTCA
CGCGAGGGCT TGTTTCTCAC AAACTCATAT ACTGATAAGC TCAATGGCAC TATTCTCTAC
AAGGGAGCTG AGAACTGGGA GAATGTCATG TGTTACCTAG ATGCGCTTCT ATTCCTGATG
TTTGCCAAAC TCGAGTCGTT CGAGCCGATT TTGTTTATTT CCAACAGAGC ACCCAATACT
CTAGTCAGCC AGTTGGCAGC TTTGTTACGT CTTTACATCA GTATGCTTCG TTCGGGCAAT
TTGATTACAA CGGACATCAC CATCCGCTTG TGTGAGACAT TGCTGAAATT GGGATTCAAA
GAAGCAATGT CTCATAAACA ACAAGATTCT GCTGCGTTGT TTGAATTCTT GACGGAAACG
CTTTCTATGC CACTTCTCAC GTTCAAAATT GATATCAAGC ATGGTGGTAA ATTCAACAAG
GAAGACGACG AAAAAATCTC CAAGGAACGT ATTCTCTTTG TCAGTCTTCC AGAAGAAGAA
ACACCGTTTC CTGAAAAGCC AGTACGAGAA GAAACTGGTA CCGTGGAAGT AGTTTCTTTG
AACGAAGCAG ATGTAGACAA TTCTGCGAGA GACTCGAGTG GAGCGGAGAA CCCCGTGAAA
TCTCTAAATG CAGATAATTC AATCGCTGGA AATGTAGAAA AAGTGAATTC AGACAAAGGT
GTGAATGACA AACCAGAAAA ACTAGAAGAA AATGACTCCA AATCGTTGCA GACGTCCGAT
ACCCTGGTTC TCGAATTCAA AGAGTTCAAG CCTGCTTCCA AGGAAGAAGA GAATGATGAA
GGGATTCTTC TTGAAGAGTG TCTCGAACAT TATTTCAACA ACTCCATAAG TGTTAAACGA
GAATTGGAAA GAAGAGCAAC CATGGAAAGC TTGAAACCTG CGGCGGGGGA AGTTGTTTTC
TACGACAACG CAATCCCCGA AAACCAGGAA GTCACAACTC CCACTTCCGA GGGAAATCCT
GTAGTGAGAT CCAATTCAAA ACGTGAAGCC AACCACGAAT TTATAGAGAA TATGGATGAA
ACGATAAGTC TCCAGACTCT GGGACTGGCA TCTAGACTTC ACAAGAGCAA CCCATTCAGA
TACGCTTCTA GGACTCGTTC CTCCACGCTT TCAATCTGGT CAATGAACGA TCTGGAAACT
GGGGGAAAGT CCAAGGAAGT CAATTTGCCA GCCTGGATGT TTTTGCGCCT CTTGCCATTT
TACACTGATG ACAATGATGT CACCAATAGC CAGAACCAGA GCATAGCCAA GAACTCCAAA
GAGTTTGTTA ACAGAAGACC AATTCTTCCC ATTTGTCTCA AAAGATATTC CTTTGATGCT
ACTAAATCTC TGGCAAGTCG TTCTCAAAAA AGAATCATTA TTCCACCATT TATAGACTTG
CCCCAGTTTG TTGCTGACGA TGTAGATGAC GAAACGGGAA ACTACAGGTT GATATTGGAA
AGTGCTGTTT GTCACCGTGG CCATTCCATT GCCCTGGGTC ATTTTATTTC AGTCATCCGA
AAAAATACAG ACAACATCTC GGAGACAGAA GAAGAGGCTC AAAACTCCAC CTGGTATTTG
TACGATGATA TGAAGAAGAA ATCTAGAATT GTAGAGAAGA CCTTTAGGGA GATATTCAAC
AAAGAATGGC CCTATTTGTT ATTTTACAGA TTGGTGACTA CTGACGAAAT AGCATCTTCA
ACAAATTCAA GTAAAGCTAA TCTTGCACAA CAGAGTTCTT CCAACCCGTT TATAGCTCCA
GCTGGTTCCA AAAATTCATA CTGGTCGGAC GATTCTGACA CACCTTCTGG AGTACCATCT
TCTGGACTTC CTCTTCCTCC ACCGGCACTA TCACCGATTC TTTCTGCTTC CAATAGCGAC
GCTCCACCTT TTGCAGTAGA AGAATATGTT CCTTTAAAGA AGGTAGATTC TGCCCATAGT
AGTACTTCCA GTATTCCTAT TCCAGATATT TCACCTACCG ATGCCAGGTT CGTAGATATC
CGTAACAAAT ACTACTGGTA CATGATAGAC AAGAACAAGA ACTACATCAA AGAGCTCCCT
TCGATAAAGA CGTCTAGTGG CAGAGATGCC AGCGTAAGTT TCAATCCACA ATTCCGTCGT
AATAGTCAGT GGAGTGAAAG ATCTAACGTC AGTAGCATAG AT
 
Protein sequence
MSSFFKRHSP PNSSTVFDDI FWSWYLESYS LVRDPVGEPL PINPKKYSSI SLAVITLQKY 
FDSLKLPVPS SKQTIALLKS PFTQGDIIKT FHLVRFFQLS REGLFLTNSY TDKLNGTILY
KGAENWENVM CYLDALLFLM FAKLESFEPI LFISNRAPNT LVSQLAALLR LYISMLRSGN
LITTDITIRL CETLLKLGFK EAMSHKQQDS AALFEFLTET LSMPLLTFKI DIKHGGKFNK
EDDEKISKER ILFVSLPEEE TPFPEKPPAS KEEENDEGIL LEECLEHYFN NSISVKRELE
RRATMESLKP AAGEVVFYDN AIPENQEVTT PTSEGNPVVR SNSKQNMDET ISLQTLGLAS
RLHKSNPFRY ASRTRSSTLS IWSMNDLETG GKSKEVNLPA WMFLRLLPFY TDDNDVTNSQ
NQSIAKNSKE FVNRRPILPI CLKRYSFDAT KSLASRSQKR IIIPPFIDLP QFVADDVDDE
TGNYRLILES AVCHRGHSIA LGHFISVIRK NTDNISETEE EAQNSTWYLY DDMKKKSRIV
EKTFREIFNK EWPYLLFYRL VTTDEIASST NSSKANLAQQ SSSNPFIAPA GSKNSYWSDD
SDTPSGKVDS AHSSTSSIPI PDISPTDARF VDIRNKYYWY MIDKNKNYIK ELPSIKTSSG
RDASVSFNPQ FRRNSQWSER SNVSSID