Gene PICST_40872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_40872 
Symbol 
ID4836774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp284076 
End bp285833 
Gene Length1758 bp 
Protein Length585 aa 
Translation table12 
GC content42% 
IMG OID640388089 
Productpredicted protein 
Protein accessionXP_001382289 
Protein GI150863724 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.788268 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.345333 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTCT TGGCTCCTTT AGTCAGTGCC AATCGTGCTC TATTTGATGA AGACACCAAC 
CTGCAAAGAT GTTCAGGCAT GTATGCCAAA CATGACTGGG GTGGTTCTTA CAAGCCGCAG
ATTTCATTGC TGTTACTGCA ATTTGACAAA TACAAATACG ACTCCAAGAA GGACAATGCC
GAAGAACACG ACGAAGACAT TTCTGTGAGC TTCATTATTT TCGAATACAA GGACCTCGGC
AATATCGGCG TGGAATTGAG TGATGGATCC AGCAAGTACA TCTGCGATGA CTATGCGATT
GATACCTTGG GAATCTGTGA AGCTAAGCAG AAGGGTAAAT TCTTGCTCAA TTCCAATAGC
ACCAACTCTA CCATTATGAC TTCCCAGTTG ATCCATTTGG GGCCTTCTGA TATACATTAC
TCCGTCAACA GGACTGGATA CTACTGTGTC TCGACGTATA ACTTCGATAA AAAGTACAGA
GGAGTCATCA ACTTCCAGAA CGCTTTTGGC CAATTGAGTG CCTCTGAAAT TCCCAAATTG
CCAGCTTACG GTATTCTCAC TTTGTGCTAT GCTATAGCTC TAGCTTTGTT CGGGTTCCAG
TTCTTCAAAA AGAGAAAGGA AAACCAGATT CTCCCATTGC AGAGATACTT GTTGGCCATG
TTGGGCTTTT TAGCTTTTGA CACTATGGTT GTGTGGTCCT ACTACGACTT GGTCAACCGA
ACCAAGAACC CTTCCAACGC CTTCGTAACT TTCTACATGT TTTTCTTATC GCTAATGAAT
GCTGCGAAAA TCACCTTCTC GTTCTTTTTG CTTTTGTGCA TTTCCTTGGG CTACGGTGTA
GTCTTGTTGA AGTTGGACAA GAAAACAATG CTTAAATGTA AAATCTTGGG AGTTGTCCAC
TTTGTGGCCT CTATAGTGTA TTTGGTAGCC ACTTACTATG GTGGATCCTC AAAGTCAACT
ACTTCCGGAG GCAACATTGG TGAAGGAAGC ATGGGTAGCT TTTTGGGATT GTTACCTTTG
ATCCCAGTCA CTATCACATT AACAATTTAC TACATTGCGA TCTTGGTGTC TATTAAAAAG
ACCACTGCGA ACTTACACAA GCAACGCCAA ATCATTAAAT TGCAGTTGTA CGAAAACTTG
TTCAGAATTA TTTTCTTTTC TGTCGTGTTA ACCTTCGGTG GGCTCATTTT GTCTTCCATA
GTCTATTTGA GCATGTCCAC TACTGACATG ATCGAAGAAC ACTGGAAGAG TGCATTCTTT
ATTTTTGAGT TCTGGCCCAG TGTGATTTTC TTCTTCGTTT TTATGGGTAT TGCCTGGTTG
TGGAGACCTA CTGAAACAAG TTACATGTTG GCTATTTCTC AGCAATTATC CACTGGCGAA
GGATTAGACG ACGAAGCAGA TGGACAGGGT TACCAACAAG GTGGGCACGA ATTCGAATTG
GACGACTTGT CTTTAATCAG TCATAGTGAC GATGAAGCAA GGGGCCCTGG TAACGCTGAA
CACGACAGTT TCGAATTGTC CAGAGAAGCT CAACCTTTCC CCAAGTCTAC CGATGGGCCT
CCAGGATACA GTGAAGTGAA TGGAAAGGAA AACCCATTCA ATGATCCAGA GAATCCCTTT
GAAGAAAATA GCTCAAGAAC CGAAGGTAAT ACATTATTTG AATTGGGAGA AGATGACGAG
GACGATTCAC GTTTAGTTGA AGACAATGAT AATGAGGTTG TAGATGACAG ATTAAAGGAT
GCTAGACACA AAGAGTAG
 
Protein sequence
MAVLAPLVSA NRALFDEDTN SQRCSGMYAK HDWGGSYKPQ ISLSLSQFDK YKYDSKKDNA 
EEHDEDISVS FIIFEYKDLG NIGVELSDGS SKYICDDYAI DTLGICEAKQ KGKFLLNSNS
TNSTIMTSQL IHLGPSDIHY SVNRTGYYCV STYNFDKKYR GVINFQNAFG QLSASEIPKL
PAYGILTLCY AIALALFGFQ FFKKRKENQI LPLQRYLLAM LGFLAFDTMV VWSYYDLVNR
TKNPSNAFVT FYMFFLSLMN AAKITFSFFL LLCISLGYGV VLLKLDKKTM LKCKILGVVH
FVASIVYLVA TYYGGSSKST TSGGNIGEGS MGSFLGLLPL IPVTITLTIY YIAILVSIKK
TTANLHKQRQ IIKLQLYENL FRIIFFSVVL TFGGLILSSI VYLSMSTTDM IEEHWKSAFF
IFEFWPSVIF FFVFMGIAWL WRPTETSYML AISQQLSTGE GLDDEADGQG YQQGGHEFEL
DDLSLISHSD DEARGPGNAE HDSFELSREA QPFPKSTDGP PGYSEVNGKE NPFNDPENPF
EENSSRTEGN TLFELGEDDE DDSRLVEDND NEVVDDRLKD ARHKE