Gene PICST_40400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_40400 
Symbol 
ID4837597 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2654561 
End bp2655835 
Gene Length1275 bp 
Protein Length371 aa 
Translation table12 
GC content48% 
IMG OID640388912 
Productpredicted protein 
Protein accessionXP_001383259 
Protein GI150864444 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACA CTGAAGTGCT CGACTACGAA AGACAGGAAT CCGATGCCAC AGGTGACGGC 
CAGCGCGAGG GTGCGGCTGA TATGGAAGTG GAAGTAGAAG CTGAAACGGA GCCAGAAGCT
TCTAAACAGA CAACAGTTGA TCACCATGGG TTGCAAACTT CACAGACACT CGAGCCTCAG
GAAGAAGTCA AGTACGAAGT GGTGGATGAA CCCAAGTTCC AGGTCACGAG ATCGATTTTC
ATTGGCAATT TGCGTCGTCC GCTCAATGCG ATGCATTTCC AGAACTTCTT GAAAGAGCTA
GCCAAAGAGG CTGGAGACTA CATCGTGGAA AGAGCCTGGT TGAATAGAAC GAGAACCCAC
GGAATTGTTC TTGTAGACAA AGAAGAGGGA GCCAAGTTTC TCCGTGAGAA ACTTCTCGGT
ACTATCTACC CACTGGAAGA AGACGACTTC AAGTTGAAGG AAGAATATGA GATTAGAGAA
CAGGAACGGT ATGAACAGCA GAAGCTTCAA TATGAAGATG AAATGGAAAA GCTTGATACT
GAAGAAGCCA AAGCAGCATT GGAACCTCCA TTGGAGCCTA GAAAATATTC AGTAGAGAGA
CATCCTCTCT TTGTGGACTA TATTCCTGTC AAGGCTATCA ACCAATGGAT CTATGAAGAA
GATAGAGGAC CCAGAAATGG CAAGTGGAAG ATCGACTACG AGACGAAGGA TGATGAAGTA
GTTGCCAGCC ACAGTCTCTT ATCAGGTGAC TTTGTCCCAC GCTACCAAAG AGGTAGAGAC
CGCCGTGGCC GTGGCCGTGG AGAAGGTAGA TACAGAGGCT ACAGAGGAGG AGACAGATAC
GGTGGGGATC GTTATGAAAG AGACAGATAC GTAGCTGATG ATAGAGATAG AGAAAGAGAC
AACAGGGACA GAGACAGGTA CGGTGAAAGA GAAAGATACG GTGGTGACAG AGACAGGTAT
GGAGACAGAG ACAGGTACAC GGAGAGAGAC AGATACAGAG GTGGAAACGA TTACAATGGA
CACAATGACT ATCCTCCTCC AAGAAGAGGT TACAGGGGAG ACCGTTACGA TCGCGACGGT
CCCAGACCAT ATAACGCGGT GCCTCCACCT CGTCAAGACA GCTACTATCC CAGACGTGGC
CGGGACAGAG ATGCATATAT TCCTGGTGAC AGGGTGGTAG GCTCTCGTAC CGACACTTAT
GAGCCCAAAT ATCGTGACAG ATCTGATAGC AGACAGAGAA GAAACCGTTC AAGATCGAGA
TCCAGATCGC CATAA
 
Protein sequence
MSDTEVLDYE RQESDATEAE TEPEASKQTT VDHHGLQTSQ TLEPQEEVKY EVVDEPKFQV 
TRSIFIGNLR RPLNAMHFQN FLKELAKEAG DYIVERAWLN RTRTHGIVLV DKEEGAKFLR
EKLLGTIYPS EEDDFKLKEE YEIREQERYE QQKLQYEDEM EKLDTEEAKA ALEPPLEPRK
YSVERHPLFV DYIPVKAINQ WIYEEDRGPR NGKWKIDYET KDDEVVASHS LLSGDFVPRY
QRGRDRRGRG RGEGRYRGYR GGDRYGGDRY ERDRYRDRYR GGNDYNGHND YPPPRRGYRG
DRYDRDGPRP YNAVPPPRQD SYYPRRGRDR DAYIPGDRVV GSRTDTYEPK YRDRSDSRQR
RNRSRSRSRS P