Gene PICST_31485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31485 
Symbol 
ID4838659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp920069 
End bp922000 
Gene Length1932 bp 
Protein Length643 aa 
Translation table12 
GC content38% 
IMG OID640389974 
Productpredicted protein 
Protein accessionXP_001384481 
Protein GI150865318 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.607407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0352179 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGAAG AAGACTTTAA GGATGAAGTA GCGGCTGAAA AAAACACCTA CTACTTCCAA 
GATAATAAGT ATTACCAAAA TGCTCTAGAT GTCTCTTATC GGCTTTTACC TTTCAAAGAA
TCGATGTCTA AGAAGTGGGT CCTATCCATC TTACTTAGTA TTATTCTATC ATTGCTTTCA
TTTTGTTTAT TCTTGTTTGT TCGATACCGC TATTCGAACG ATTCCGTCAG AAACTTTGGC
TATTTCGAAG TTGAGTGGGT GTCATTGCCT CCAATCGATG GAGAACCTGT GCATATATTC
GAGAGCCAAT ATGAAAGGAT TCCCTGGGAG GCAAAAACAA TGAATACTAC AGTCTTTGAG
AACCAGAAAT TAGAAAACGG ATTGAGTGCT ACTTTCACTA AATTTGAAGA CTTCGAATTT
GACTCTGTTT TCTTGACATT AAACATCACC AACAATGAAG ACGACGACAG TGAAAGTGTC
AATGTATTTG AAATATCCAT CGATGGTCAT CCAGTATGGA GAAGCTCCAC ACCTTTAACT
AAGACAGACA CTACAACCTA CTCAAGCACT AGTAAAGAAA TTTCCAAGTA TATTTCACTC
TTTGGTAAAA ATAGTAACCA ATTCAAAGTA CAATTCTTGG AAGGAAGCCA TGACAAAATT
AGTTTTTCGC TTTCTCTTAC ATTAGCAAAC TGCGAAAAAG AAAAACCTGC TATAGATTCT
CCAATCACAG TGAGTTCGCT TTTCAATTCA ACCGTGCCTG CAAACGAAAT AATAGCATTG
ACCAAAGAAA ATGGAGAAGT ATTTGACTTG TCGAAGAACG ACAAGTTCTT AATTGAATTA
CCAAAGTTCA GCGGAAAGAC ATTTGCTGCC GACCTAGAAT TATTTGTCTC CGCTGGAAAG
TCTGACGACC TTTTCAAAAG TAGAAATTCT CCGTTTCGTC GTTTGAACAT TTTCATTAAT
GAAGAACTAA TTGCTACTAT CGAACCAAAG CCAGATTTGT TTCACTCTAA CTCCATCATT
CCTAGTGGTT ACTCAGGACC TATTGCCCCT TATGCTAGCT TCACCGGACT TACGTATGAA
GTAAATTTGG CTGCCTATTT GCCACTTTTG TGGGATCAGA AATCCACCTT AGAAGTCCAA
CTTGTCTCAC CAGTTAACGA TGTGTTTCCC TTAGAGGTTG AAAAAGTATT TTCCCTGATG
GACGATTATA GTGTTAAGGA ATTTACATCT GTTTCTTCTG AAAAGTCAAT ACTGAAGGGC
GATAATCAAG TCAATACAGA ATGGTACATG TCTGGCAATA TCTTGCTTTG GGAAAACGAA
GACATTTTGA CTTCTCAAGG CGAGATATTG AGGTCTGGAA CCAACGAAAC TCATAAAGCT
GCTTTACAAT ATAGAATTTA TTCCGACAAA GTAGCTTCTA GTTCACAACA TGTTAATTCT
TACCATCAGT CGATTCTTCA ATTTTCTTTG AAGAACGAGT CCCTACTAAC TTTCACTGTG
AATCAAACGA GTTCAAATGG AGGCGAGTTT AGAGAAGATC GGCGTAATAG AAACATTTCG
CTCGATGAGT ACACCTTGCA AAGGTATAGC TCGCAGTTTA TACTGTTAGA CGTCTTTGAC
GGAAGTGCCA TTGATCTGTT ATTCCACGTA CAACGTAATC TTGGATGGAA CACAGATAAA
GTGGTCTCTA TATCTGAAGA AAATCGATGC TTAATTCACG CTTCAAATGG TTATCACGAC
AATATTTTTA GCTCAAAGGC TACTGAAATC AGAAATTTAC ATGACCATGT ATTTGAGATG
ATTAATCATC TTTCAATCTG GACATTCGAG TCATATTCAC GTAATATATT CACGCTGAGG
TTTCACAGAG ACCCAAAGTC TAAAAGCAAA TCAAGAGTAC TCGCATTCAA ATCCGACATT
CAAGTGTGCT GA
 
Protein sequence
MSEEDFKDEV AAEKNTYYFQ DNKYYQNALD VSYRLLPFKE SMSKKWVLSI LLSIILSLLS 
FCLFLFVRYR YSNDSVRNFG YFEVEWVSLP PIDGEPVHIF ESQYERIPWE AKTMNTTVFE
NQKLENGLSA TFTKFEDFEF DSVFLTLNIT NNEDDDSESV NVFEISIDGH PVWRSSTPLT
KTDTTTYSST SKEISKYISL FGKNSNQFKV QFLEGSHDKI SFSLSLTLAN CEKEKPAIDS
PITVSSLFNS TVPANEIIAL TKENGEVFDL SKNDKFLIEL PKFSGKTFAA DLELFVSAGK
SDDLFKSRNS PFRRLNIFIN EELIATIEPK PDLFHSNSII PSGYSGPIAP YASFTGLTYE
VNLAAYLPLL WDQKSTLEVQ LVSPVNDVFP LEVEKVFSSM DDYSVKEFTS VSSEKSISKG
DNQVNTEWYM SGNILLWENE DILTSQGEIL RSGTNETHKA ALQYRIYSDK VASSSQHVNS
YHQSILQFSL KNESLLTFTV NQTSSNGGEF REDRRNRNIS LDEYTLQRYS SQFISLDVFD
GSAIDSLFHV QRNLGWNTDK VVSISEENRC LIHASNGYHD NIFSSKATEI RNLHDHVFEM
INHLSIWTFE SYSRNIFTSR FHRDPKSKSK SRVLAFKSDI QVC