Gene PICST_33685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33685 
Symbol 
ID4840987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp213361 
End bp215370 
Gene Length2010 bp 
Protein Length669 aa 
Translation table12 
GC content39% 
IMG OID640392302 
Productpredicted protein 
Protein accessionXP_001386636 
Protein GI150866892 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.872469 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTTT ACGGCAAGAA CTGGAGTTCG TTTCGAAAGA GAACACGTCT TCTGGCGGAT 
GCACCAGTTT TTTCCAGCGA CGAAGAAAAT GAAGAATTCA CTGATATAAC CGAGCCCACT
TCAGTTTCAG ACAAACTAAT GTCAGTGATT CAAGATTCTC ACGTAGTTAC CAAGATAGGA
GAGCAAGAAT TGCTTCCCTC GGCTGAGATT AATTCGAATT GGAAAACATA CAAATCTGTG
CAGCATCACA AACGCAGCCT TTCAGATTTC AATCTTTCAG ATTCTCTCAC TACTTCTCCA
CAGAAGCTAG TAACTGTAGT CAACGCGCTC AGCAACTCAC CAAGCCCAGT GAAAAGTCGG
GCTAAACGAA GACTTGAAGA CGAATTAAAG GATTTGGCAA AAACTCCACC GAGAAACAAA
TCCAAGAACA ATAATAGAGA GCAAGTCACT CCTGCTAAGA GTACGAATTC AACTGCAAAG
AGAACTCCTA CATTCACACC CAAAGAAGCA CGCGACTGGG ATTCCTTGTT CGAAAGCATA
GATGACGAAC TGGTTGGTCG AAATACCTTT CTAACGTCTC AAAATGATAG TGACAACCAT
GGTGAAGAAA CGGACAACGA CGATGGAGAA ATAAACGTAG ATTTGTCAGT ATTTTCTTCC
TATGTAGAAA ACTCAAATAC TTCCCCAAGT CGAAATTCTA GCGAAAGGAT ACTGAAAGGA
ACCGCAAAAT CGAAGCTTAG AACATATGGG GATGAGCGAA GCTTTCTTCT AGAAGGAAAT
GAAGAAGAAA ACTCGAAAGT ATCTATTGGC GATGAAATAC CAGTTGTGGA AGATGTGTTA
AGCATCAATG ATCTCAGAAG TATCAGCAAA GAAAACCAAC GAAAGGAAGC TCTAGACTAT
ATTTTGGAAG GATTGCAATT TACTGATTGC AAGAACTTGG CCACTGGAAA TGCTGTATTG
GTATCATTGC TAGTAGACTT AGCCATCGAG TCGATCAAGA ATGGTTCAAA TGCACTTGAA
AGAAACGGTG AAATCATCGC AACTAGGTTG CTAGCCATTT ATGAAAATAT TATTAATTCT
AAGGGCAATG GAAAAGATGT TTTGTGTTGG CTAGTTAGTG TGAACTTTCT CTTTTTAGCA
GCATCAGCAA CTCGGACTGA ACAAGTTGCT ATCAGTCAGA CTTTCAGACA TTGCCTATTA
TCAATTCTCA AACTGCTTGG GGTCACTTAC AGTTCCAATG GTTTGCCTAT TTTGGTAAAG
AAATCTCTTT TGCAATTGCT GGACATATTG CAAATTGAAA CTCCTCTCCA AGTGCAATTG
ATAGAAGTGA TGAGCAATGT TCCTGATTTC CACAGAGCAG ACATATTTGA ACATGTCATT
CAATTGTTTG CAACCGAGAC TAGATTGACC AACAAGATGA AGCTATTAAG CTATATCCAA
TCGTATGTTG AAAGATCGCC TGAATTGGAT AGTCTTTACG ATCTCGAAAT TTGTATATTA
GAATCTATGA ACAAGATAGA TTTTGTACAA CTTGACGATC TAGATGTTCA AGTGTTAAAG
TTGGTCGTTG TGCTATCTAC ATCTTATGAC AATAATGAGC GAGTTGCCGA ATTGCTATTT
GATCCCAAAT ATGTTTCGCC CATGATAAGG TATATCAATA GCAGCTACAG TTGCTTAAAC
GACGAATATC GGCTCAACAT AGCATTGTTT CTCTTGGGAT TCTTGATAAA CTTTGTTGAG
TCTGACCGGT TCGAATTGCA AAAGTTTGAT GATGTTAGAG ATAACATAGC TATATTTGAA
GCAATTGATG CTAGTGCAAA GGACGAGGCT TCCGGTCACC TAATAGGCTA CAACAGTATA
GTTTTGACAT ACTTACTGTT GAAGTATAGT GAAGATCTCG ACATAGATAT TCAACATCTA
AAACACAAGT TGGACCATTT CAAGGAGAAG ATAGCTAATA CCAGAATCAA GACCAAGATT
GATAGTTTGC TTACAGAATT GGCTAAATAA
 
Protein sequence
MSVYGKNWSS FRKRTRLSAD APVFSSDEEN EEFTDITEPT SVSDKLMSVI QDSHVVTKIG 
EQELLPSAEI NSNWKTYKSV QHHKRSLSDF NLSDSLTTSP QKLVTVVNAL SNSPSPVKSR
AKRRLEDELK DLAKTPPRNK SKNNNREQVT PAKSTNSTAK RTPTFTPKEA RDWDSLFESI
DDESVGRNTF LTSQNDSDNH GEETDNDDGE INVDLSVFSS YVENSNTSPS RNSSERISKG
TAKSKLRTYG DERSFLLEGN EEENSKVSIG DEIPVVEDVL SINDLRSISK ENQRKEALDY
ILEGLQFTDC KNLATGNAVL VSLLVDLAIE SIKNGSNALE RNGEIIATRL LAIYENIINS
KGNGKDVLCW LVSVNFLFLA ASATRTEQVA ISQTFRHCLL SILKSLGVTY SSNGLPILVK
KSLLQLSDIL QIETPLQVQL IEVMSNVPDF HRADIFEHVI QLFATETRLT NKMKLLSYIQ
SYVERSPELD SLYDLEICIL ESMNKIDFVQ LDDLDVQVLK LVVVLSTSYD NNERVAELLF
DPKYVSPMIR YINSSYSCLN DEYRLNIALF LLGFLINFVE SDRFELQKFD DVRDNIAIFE
AIDASAKDEA SGHLIGYNSI VLTYLSLKYS EDLDIDIQHL KHKLDHFKEK IANTRIKTKI
DSLLTELAK