Gene PICST_29047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29047 
Symbol 
ID4851783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2817620 
End bp2819452 
Gene Length1833 bp 
Protein Length610 aa 
Translation table 
GC content38% 
IMG OID640393491 
Productpredicted protein 
Protein accessionXP_001386877 
Protein GI126275589 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCTA GGACTGGCAA CGTGGGTATT GCCTTGCTTC CTGAACACAT CGTTGATTTC 
ATTCTCAGTT TGCTTCATGA TGATGATTTG GAAGAACTAG CGTTGTCCAA TGGACCATTC
CAAAAGCTCG CAAGAAAAAG ATATGTTAAG GCTCGTGAAC ATATATACAT ATCTTCTGGA
TGGATTGACC AGGTTGATGA GATTAGAAAT GCTCGTTTGG ATGGAAGTAT GTACTGGATG
ACAGAGAAAG AGTTTACGAG ATATATTGAA GATTTCCCAG AAGCAAAATT CTTCCTAAGG
TCAGCGGCAC AAGATTTCTT GAAGATAGTT GGTATCATAG ATTTTGATTG TGTTCAGAAA
ATCGAAATAT TCTTCTCCAC TCATTTTCCA ATAGACGAAC TACCAGACTA CTTGGAACTA
ATTAGGAGCT TACCTTTCTT AGTTGTAATT GGTTTGGATA TTTATTCAGA TTTCCATGAG
GTATACCACG ATATGGAGTT TCCGCCGAAC GTGGTAGGCA TATACACTAG TCATGATCGC
AGTGACTGGT GCACTTCTTT TGATACATTG TTCGTACTTT CAATTCAAGA CGGTGAGTTT
CAAGGGAGCA ATTTTCCAAG CAACCTTGTC ATCTTACATT TAGGTTCAAT GAGCAACATC
AATTCGTTGG CTCTTCCGAG TACGTTAAAA GAGTTAGGTC TTGCCGATGT AGAGAATCTA
GATATTTCGC ACTTGATAAA TTTACGAAGC TTGGTGATAA AGTTTACAGA CTTGACTGGC
TTCGATTCTT GGAAATTTCC TGTTAATATT AAATCTTTGC AAATGAAGTT CTGTGAGAAT
ATAAATAGTT GTTCGAAGAT CCATGAACTT ATAGAGTTGG TCAATTTTGA CATCGAGGGA
GGGTCAAGAG GGAAACCGTT ATTAGACATC AAGTTCCCAA AGACTTTGTT AAAGCTAAGG
ATAAGAGGAT ACGAAGTCAA AGAAGGCTTT GTATTCCCGC CTCGGCTTAA ATCTTTATCC
CTCGAATCGG CTGGCATTTT ACAAAACGTG CAATTCCCAG AGAATTTAAT TGAGTTGGAT
ATTTTCAGTA CAAGATTCCT GAATTTACCT GAAGATAATA ATCCTATATA CCCATTTTCT
CAAAAACGAT TATTGAAATT GTCGTTTGGG TATGCACCTG GTTCTTTCTT GTTGGGAATA
AACGACAACT TGAAAGACCT AAAAATTAAT TGCTCGAATA TTCAAGTATT AGATCAAATA
GAACGCTTCA AATCTTTGGA GTGCTTGACA GTTGAAGGAA GTTCAACTGG TCAGTTTACC
AGAAAACTAT TCCACGAGTT AAACTTTCCG CCCAGACTTA AAGAGTTGAA TCTTTTAGTA
ATGCAATCTA GCGATAATGA CTTCGAAACT TATCCTGAAG ATTATTCTTG TGTGGAAGAT
GATATGAAGT TCGTGATAGA CTATAGATTC AAGTTACCAT CAGAGCTAAA AAGAATTCAT
ATTCGGGGAC ATGGATTGGC AATTGGTAGT GGATATGAAT TTCCAATTGG TTTACGAGCA
CTTGGTCTAA GAGAGTTGGC TATTTTGGAC AAGCTGATTA CATTTGACTA CATGACAGAT
CTTAGGAAAC TAGACTTGTA CGGCACCAAT ATCGAACTTC TGGATGAATC AAAGTATCCG
AAGAGCTTGG ATGAGTTAAT TGTGTCTTCG AAGCAGTTCG TATCCCTTAC CAACACCCAA
TATGAGGAAA TAGGACCCTT GTTTTGGTTT TTAGAAATCG AAAAAGGAAA TATTCTTATT
AGGGCCCTTC ATATGATTCC GTGCGACTAC TGA
 
Protein sequence
MTARTGNVGI ALLPEHIVDF ILSLLHDDDL EELALSNGPF QKLARKRYVK AREHIYISSG 
WIDQVDEIRN ARLDGSMYWM TEKEFTRYIE DFPEAKFFLR SAAQDFLKIV GIIDFDCVQK
IEIFFSTHFP IDELPDYLEL IRSLPFLVVI GLDIYSDFHE VYHDMEFPPN VVGIYTSHDR
SDWCTSFDTL FVLSIQDGEF QGSNFPSNLV ILHLGSMSNI NSLALPSTLK ELGLADVENL
DISHLINLRS LVIKFTDLTG FDSWKFPVNI KSLQMKFCEN INSCSKIHEL IELVNFDIEG
GSRGKPLLDI KFPKTLLKLR IRGYEVKEGF VFPPRLKSLS LESAGILQNV QFPENLIELD
IFSTRFLNLP EDNNPIYPFS QKRLLKLSFG YAPGSFLLGI NDNLKDLKIN CSNIQVLDQI
ERFKSLECLT VEGSSTGQFT RKLFHELNFP PRLKELNLLV MQSSDNDFET YPEDYSCVED
DMKFVIDYRF KLPSELKRIH IRGHGLAIGS GYEFPIGLRA LGLRELAILD KLITFDYMTD
LRKLDLYGTN IELLDESKYP KSLDELIVSS KQFVSLTNTQ YEEIGPLFWF LEIEKGNILI
RALHMIPCDY