Gene PICST_40135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_40135 
Symbol 
ID4851716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2638823 
End bp2640295 
Gene Length1473 bp 
Protein Length490 aa 
Translation table 
GC content42% 
IMG OID640393424 
Productpredicted protein 
Protein accessionXP_001387074 
Protein GI126275374 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.379103 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAAG CTGCATATGA CAACGTCGAC TTCTCGGACG ATATTCGAGT CCAGGAGCTC 
CAATCCTGTA AAGCTATCTA CCCCAACTGC ACGATGAACT TCTCAAAATA TACTGGATCG
ATAGAGATCC CACTCAAAAA CGAAGACGGC ATAATACTTC GACTATTACC CGAGTCCCAT
CGAGATACTC CCTTATTAAC GCATAAGGTT TGCAATCTAC CTTCGCTTCT ATTCACTTTC
GAACTTCCAG AAAGATATCC GTACGAAGAA TCACTTAACT TCAACCTTAC CAGTTCAATT
TTGCACCAGA CTGTAGTAGA CTCTATGATA GTCCACTTGG AGCAAATCTG GGAAAGTTAC
CAAGACCAGG TACTTTTCAG CATGATAGAC TATTTGCACG ACCAAACTCA GAACGAATGG
GATCTGCTCA TTGGTCCCAA GTACGATGTT ACTAGTGGCC AAGAATTTCA GACCATAGTA
GACTATGACA ACGACATTAA GCAACAGGAG TACGAAACTA AGACATTTAC CTGTGAGGTG
TGCCAGGAAG ACTATAAAGG CGTTAATTGT CTGCGCTTCG ACTCATGTGG CCATACCTTT
TGTAATACCT GTTTATTTGC CTACTTCTCG TCTGTGATCC GAACCGGAGA GATAGACAAA
GTGCACTGCC CCAGTTATGA GTGCACCAAG AAGTTTGTCA AGACCAAAGA TGAATACTCC
AAGTTGGAGT CGTGGCTTAT GTCAGATACT AGAGTTGAGG AAATTGTCAG GACTTTGCTC
ACACCTGCTG TGCCGCTCAA TTTTCTCTCT AAGATATTGA CATCTGTCCA GAGTAATGAA
AGTGGTGAGA AGACAAGTGA AGACTTGGTC AATAGATATT ATACGCTCTT CAAGAAGTCG
CAGTACGAAT TCATTGGTAA ATTGCTACCT AACAGACTTG TAAAATGTCC TAGAATTGGC
TGCGACGAAG CCATATTTAG AGAAGATCTC ACAGAGCGGT TGGTAGTATG TCCCAGATGT
GCATATGCCT TTTGCAACGA CTGTCACAAC TCTTACCATG CCCGATTCAA AGTATGTAAA
AAGGTCACTT CCGAGAGTGG CGATTATCTA GGGGTGGAAG TAAAGGATAT TGAGGCATAT
ATGTCTTTAC CTAGAGACTC CTACGAGAGG AAGACTCTAA ATGCTCGTTA TGGCAGACAG
CGTATCATTC GAGCAGTAGA AGAGTACCAG ATGGACCTTC TTTTCAACAA GATGTTGAAA
GAAAGCAACG AAGTCAAGGA GTGCCCTGGC TGTGGAATCA TCATAGAGAA GTCTGATGGC
TGTAACAAAG TCAAATGTTC GCAATGTGGC ACCAATATGT GTTTCTTATG TGGAGAGATG
CTTGAGAATA ACTATGATCA TTTTGTTTCT GAAGACTCCT CTTGTTATAG GAAGTTATTT
TTTGGAATGC CAGGTGCAGA GGAAGAATCA TGA
 
Protein sequence
MTEAAYDNVD FSDDIRVQEL QSCKAIYPNC TMNFSKYTGS IEIPLKNEDG IILRLLPESH 
RDTPLLTHKV CNLPSLLFTF ELPERYPYEE SLNFNLTSSI LHQTVVDSMI VHLEQIWESY
QDQVLFSMID YLHDQTQNEW DLLIGPKYDV TSGQEFQTIV DYDNDIKQQE YETKTFTCEV
CQEDYKGVNC LRFDSCGHTF CNTCLFAYFS SVIRTGEIDK VHCPSYECTK KFVKTKDEYS
KLESWLMSDT RVEEIVRTLL TPAVPLNFLS KILTSVQSNE SGEKTSEDLV NRYYTLFKKS
QYEFIGKLLP NRLVKCPRIG CDEAIFREDL TERLVVCPRC AYAFCNDCHN SYHARFKVCK
KVTSESGDYL GVEVKDIEAY MSLPRDSYER KTLNARYGRQ RIIRAVEEYQ MDLLFNKMLK
ESNEVKECPG CGIIIEKSDG CNKVKCSQCG TNMCFLCGEM LENNYDHFVS EDSSCYRKLF
FGMPGAEEES