Gene PICST_30458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30458 
Symbol 
ID4838048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp114745 
End bp115983 
Gene Length1239 bp 
Protein Length412 aa 
Translation table12 
GC content44% 
IMG OID640389363 
Productpredicted protein 
Protein accessionXP_001383655 
Protein GI150864714 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.775066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.183024 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTGTG AATCTAACGG CGATAGCAAC AGTAATGGCA GCAGGAAATG CTTTTTCAAC 
AGTGAATCCA AGACTGCTAA CAGTGAAGTA CATAATAAGG AGGAGTTTGA CAAGATGGTC
GAAAACATCA AGGACTTCAA ACAAAACGTT TCCAGTTTTG CCAAATCAGT CTTCCACTTG
ACAAACGAAT CTTTCAGCGA CTTCAACGAT TTTGCCAAAG ATTACTCGTT CAATTGGCTT
TTTGATGTTG ATGGCGATTC CTCCGGCCAC ATGAACCAAA TGCGCAAGAA CGTGAGAAAC
TTCTTCACAT CGAAGGATGT TGGCTTTGAC ACTGTTCTTT CCGAGCTTGA AGAGAGTTCC
ACCGGCATGG ATGTAGACAT CGATGCTGCC TTTCCCAAGC CGTACGAGAC ACGTGACGCT
ATCCACTCAT CTTGGTGTAA ACGTAATCAG GGTGGCTATG ACGGGATTTC AACGGGAATC
AGTGATTTCT ATGGCGGGCG GTTCCTGGAC TTAATTAGCG GCTCGTTCCG CAACGGAAGA
ACTCCTTTTG GCTACTATGC ATACAAGACA CCACTGACTA GAGCATACAA TAACTGCTTG
AACAAGGCTG GTGAATCTGT CTGGGACAGT AGAGGCTACT GGAGATGTCT ATTCCCCAAC
AGAGAAGTGC CTGTCGAATT GTTGAACTAC AAGCAAGAGA AGTTGAGAAA CACCATATTG
ACTAAAGAGG ACTTGGACAA TGCTATCCTA GAGAAGGGTG TAGATGAAAC GACTTCCAAG
GGGGTGATTG ACTTGGGAGA AAAGGGTGTT TTCTTCAGAA AGTTTGACGA TTATTTGAGC
TGGAAGAACA TCATGTATGC CAATCTTCGC CAACAGAGAG AAGAGGCTAG AAAGAAATAT
TTGGAAAGAG TCAAGACGGC TAAAGAAGTC CAGTCCCAGG CTCAACCTCC TGTGGAAACT
GACAGTAACG AGAGGAGAGT TGTTTCCAAC TCTTACCAAT CGTTCTACAC CCAGAATTCC
GAAACAAATG AGGTTGAATT GAAGGAAATT AAGACTGAAT CATTCAATGA CGGCACGTCT
TTAACAAAGA CAATTTTGAA AACCAGACCT CTTGATGCCA AAGACTGGGT AACTGTCAAA
GAAAGCGTTT CCGAAAATGG TAAAGATACG GTTTTCAACC CACAATTGGA TTCTGGCAGT
AAAGGCACCA ATGGGTGGTT CTGGAACTCA AAGAACTAA
 
Protein sequence
MTCESNGDSN SNGSRKCFFN SESKTANSEV HNKEEFDKMV ENIKDFKQNV SSFAKSVFHL 
TNESFSDFND FAKDYSFNWL FDVDGDSSGH MNQMRKNVRN FFTSKDVGFD TVLSELEESS
TGMDVDIDAA FPKPYETRDA IHSSWCKRNQ GGYDGISTGI SDFYGGRFSD LISGSFRNGR
TPFGYYAYKT PSTRAYNNCL NKAGESVWDS RGYWRCLFPN REVPVELLNY KQEKLRNTIL
TKEDLDNAIL EKGVDETTSK GVIDLGEKGV FFRKFDDYLS WKNIMYANLR QQREEARKKY
LERVKTAKEV QSQAQPPVET DSNERRVVSN SYQSFYTQNS ETNEVELKEI KTESFNDGTS
LTKTILKTRP LDAKDWVTVK ESVSENGKDT VFNPQLDSGS KGTNGWFWNS KN