Gene PICST_33654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33654 
Symbol 
ID4840834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp121960 
End bp123399 
Gene Length1440 bp 
Protein Length479 aa 
Translation table12 
GC content40% 
IMG OID640392149 
Productpredicted protein 
Protein accessionXP_001386420 
Protein GI150866732 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCAG AGTCCAAAAT TGGATATCCC ACGCGAGAGT ATAGAATCCC GTATGACAAT 
TCTGCATCTC TGGACCTTCA TACTGAATGG TACTTGATTC TCACTTCCAA TAATTACCAT
TTCTATTTCA ACAGGCTATT GAAACAGTCA TACTGGCAAT TGGCAGACAT AGCTGCCGAA
TTCAAAGATG TCGACATCGA AGAATTTGTG CTGGCTATCA ACTTCGATGT TATTTCTTTG
ATGTTCGCCA GGAATGTGGG GCTTAAGGGT TTAGATGGCT ACTATTTTGA AAAGCAGGAT
GCTGATCAAG ACACGATAGA AGTAGAAGAG TTTGAAGAAG AGGAGGAAGA AATCAGGGAA
AGCGATACTG GAGATGCAGA AGAAGTAGAA ATCGACGTCG AAGCCAGAGA CGGCATGATC
AGAGAGTTCT TACTAGAAGA AGGGTACGAA GTAAAGGAAG AGAAAGCGGA AGATGAAGTC
AAAGAAGAAC CAAAAGCCCC AACGGGAATT TCTTTGGTTT CAGGATACTC TTCTAGTGAA
GAGGAAGACG ACGAATCTGG AGAAGAAAAA CCTGCTGAGA AAGGAAGTAA TTCTGTTGAA
AAAAACGATC ATCATGATAT ATCACAAGAT AAAGAAGAAC AAGTTGAAGA AGTTGACGAT
CTTCAATCTG ATGAGTCTGA TTCCGAAAAT AGTGGCCTAG ACCTCAATAT TTCGGAAGAT
GAAGATGGCT CAGAAAGATT GCAAACGTCA GCAGTTACAG AATTCATAGA GCTTTTGGAT
ATGTTTGCCG ATCGAATAGA CAAATACCAA CCTTGGGATC TAATTGAAGA AGAATTGCTT
CCTGACTTCG TCAAACATCC ACAGTACTAT GCCCTTGAAC ATGCTTCGCA AAGAGAGGAA
GCATTTGACG AATGGTTGAA GAGTAGATCC CAGAAAGAAG ATAATTCTTC AGAGAAAGAA
GTCACACAAG AACCTCCATT ATATCCTACA CCAACTTTGG ACTTTTACCA TTTCCTCCAA
GATCATAAAA AGGAACTAAA GTCGGCTACA TACCAGGAAT TTTACAATAA GAACCACGAG
CACATTAATG ATGTAGATCT CGTGTCTAAG GAGAAAGAAG CGCTTTTCCG AAAATTCAAG
ATCATGCTCC AGGACCAGAC GGAATTCGAG AAGAGCGCTA AAAAATCGAA AGCATTATCC
CCGGGAATCA ACCTAAAGAG GTATAAGTTA GACGAGTTTT TGTCTACCCA AGAATCCGTC
GAAATCCATC CCGGCCAATT ACAGGAAATT ACAAATAGCG GAAGTACAGA CTACGGAAAA
TGGCTCGCAT TAGCTAACAA ATTAAATCTC CGCAAGGAGT TAATTGAAAG CACTAAAAAC
TTCATAGTAG GTGACGAGAA GAGACTAGCA GCTTACCTAG ATAAGTTTTC AAGTAATTGA
 
Protein sequence
MASESKIGYP TREYRIPYDN SASSDLHTEW YLILTSNNYH FYFNRLLKQS YWQLADIAAE 
FKDVDIEEFV SAINFDVISL MFARNVGLKG LDGYYFEKQD ADQDTIEVEE FEEEEEEIRE
SDTGDAEEVE IDVEARDGMI REFLLEEGYE VKEEKAEDEV KEEPKAPTGI SLVSGYSSSE
EEDDESGEEK PAEKGSNSVE KNDHHDISQD KEEQVEEVDD LQSDESDSEN SGLDLNISED
EDGSERLQTS AVTEFIELLD MFADRIDKYQ PWDLIEEELL PDFVKHPQYY ALEHASQREE
AFDEWLKSRS QKEDNSSEKE VTQEPPLYPT PTLDFYHFLQ DHKKELKSAT YQEFYNKNHE
HINDVDLVSK EKEALFRKFK IMLQDQTEFE KSAKKSKALS PGINLKRYKL DEFLSTQESV
EIHPGQLQEI TNSGSTDYGK WLALANKLNL RKELIESTKN FIVGDEKRLA AYLDKFSSN