Gene PICST_33809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33809 
Symbol 
ID4840812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp533206 
End bp534525 
Gene Length1320 bp 
Protein Length439 aa 
Translation table12 
GC content44% 
IMG OID640392127 
Productpredicted protein 
Protein accessionXP_001386695 
Protein GI150866933 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0973496 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGACC TGGAAAAGTT ACAGAATCAC TTCAGGGGCC ACGACGAGTT GAGTGAAATT 
CTTCACCAGG ACGAACTCCG CAAGAGTCTT ATCCGCTTGG CCGTGCTTCG TGTAGTCACG
CTCAACAGAA ACAAGCCCAA GAGAGCAAAG ACCAACGAAC CCGATGAGCT TGTCCAAATG
AAAAAACAGC TCAATGAATA CGAATTATAC ACAAACAAGT TGAAACAGGA GAGCCTGACC
AAGTTGAAAT TGCAGGAGGC TAAACTTGAA AGTGCACAAG ACGAAGTGGC CAAACTCAAA
CTACAGCTAG CAGCAGCCAA AGAAGCAGCC AAAGATACAT CTAAACCAAT GTTCAGTGTG
CCTAGTTCAA GTTCGTTTAG ACCGCGAATC AACCTCAACG GGCTCTCAAG ACCGAAATCA
GCAACATTAT CTTCTATCAA AAAGTTTCCA TTAGCCAGAC CTACGTCTGT GTCCAGTGAA
AGAAACTATC TATCGCCCAC TTTCAACTCT ATTAACAAGT CAATATATTC GTCAGATGTT
TCCACAGTTT TGACACCCAT ACTGAATAGG ACTATTAGTA AACCACGTGG AAAATACATC
ACAGCCAGGA ATTTACACGA ATTGGGCAAT TCTCCAGTGA CGTCTAAGTT TGGAATTTCG
AAGCCACCAA CAAAATTGAC GACACTAACC CAAAAAATAG AAGCACAGAA AGACGAAAAC
GAGCCTCCAA AAACTGCAGA AGCAGAAACC GAAGCAGAGA TCGGAGCGAA ATCTGCTACT
ACTTCTGTCT TACGAAGTTC ACCTGCAAAA ACTCCTTCAC GTAAATCTTT TATAGAGAAC
TTCGACAAAT CATCAGGTTC AAGCTCGCCT TCTCCGGAAT TCACCCCAAT GAGGGTTTCT
TCAGACAAAA CTTCGGCTCC AGTGTCACGA GACATTGAAG AGACCGGATT CGGCAACGGA
AAAGTCACGA GAATCGAAAA ATTCGACAAT ACTTTACAGA CTGACGAGGA TACTTTTGCT
AGTGCCAACT CGACACTTGT AGGAAATGTT TCAGGAGACG TATTGCCTGA AAAGAAGAAG
ACGAAGAAGT TGCAATTGTG GAAATCAGGA GCTACTAAAG TGCCCTTAAC GGCTCCAGGA
AAGAAACCGC ATAGTCTTGG GCTCGAAGAT GAGAATCTCA ACTCTTTGAA TTACTATGAA
GATGGAAACT TTGCAACGGA CGAAAGTCCA CCCAAGCCGC AGCATAAGAG ACAATTAGAG
TTGTCCCCCG TTCCAGAGCC TGCTAAGCGT CGGAAACATA ATACGTTCAG GATAGACTAA
 
Protein sequence
MIDSEKLQNH FRGHDELSEI LHQDELRKSL IRLAVLRVVT LNRNKPKRAK TNEPDELVQM 
KKQLNEYELY TNKLKQESST KLKLQEAKLE SAQDEVAKLK LQLAAAKEAA KDTSKPMFSV
PSSSSFRPRI NLNGLSRPKS ATLSSIKKFP LARPTSVSSE RNYLSPTFNS INKSIYSSDV
STVLTPISNR TISKPRGKYI TARNLHELGN SPVTSKFGIS KPPTKLTTLT QKIEAQKDEN
EPPKTAEAET EAEIGAKSAT TSVLRSSPAK TPSRKSFIEN FDKSSGSSSP SPEFTPMRVS
SDKTSAPVSR DIEETGFGNG KVTRIEKFDN TLQTDEDTFA SANSTLVGNV SGDVLPEKKK
TKKLQLWKSG ATKVPLTAPG KKPHSLGLED ENLNSLNYYE DGNFATDESP PKPQHKRQLE
LSPVPEPAKR RKHNTFRID