Gene PICST_33555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33555 
Symbol 
ID4840602 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp946233 
End bp948277 
Gene Length2045 bp 
Protein Length572 aa 
Translation table12 
GC content40% 
IMG OID640391917 
Productpredicted protein 
Protein accessionXP_001386370 
Protein GI150866694 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTCG ATGATAGAGT TTCTCGTAAG AAGTCCCGTG TCCGAGGATC GGGGTAAGTG 
TTTTGATTGG AAAATTGAGG AAAATCGAAA GAATAAGCTC ATATATCTCA TATACCTAGA
GTTTCTGGAA TGATTTCATA TGCGAGTTAA GACAACAAGA ATTATATCTT TATTGAAATC
AAATGGTCGA TTGGCTCCAT TTGAAATAGG ATCTACTTTA TAAGAGATCA TGAGTACGAA
AGCATACCTT TTCTTTATGT GCTATGCTAT CAACATAGTA TTTGATCAAC TTCTATTCCG
ATACTAATTA ATTGAAACAA CCAATGATAA TTACGATTTC CGAGATGATT TCAGGATCGA
AGTACTAACA CCTTTTAAGA AAACGAAATG TTCTAGTGGA TGATTTCTTC GTGCTAAACA
AACAATCCAC TTCAGACAAA CCAAGTAAGA AACGACGTCT CACGGACAGT CTTAAGTCTG
CATTGAACCA TTGGGATAAC ATCGAAAGTC TTCCCAGACT TCGAAATCGA AAAGTTTCGG
CAATTGGAAA AGCAGATTTG GACAATGCTA ATTCTAAGAC TACTACACCA GTACCTCAAG
ATAGGGGAGA GCAAGAAGAG TCTCAACCTC AGAACAAACT GGTTCATTCT GTGGCCCTTC
ACTCTGAGGA GGCTGGGGAG TTGTTAACTC CGTCTCATTT GATCAACCAG TCTATCTATG
ACGAAATTGA GTACACAGGA AATCATAATC AGGATGAAAG TCTCCAGGTA ACGAACAGCG
ATGTTTCGGT AAGAAAAAGT GCAACTGATA AAACAAACAG TGGACGTGGA AGACCAAAAA
AGAGAGCAAG AGTCAATGTA GGTCGTCCAC GAAAAGTGGA CAAGCCTAAG AACAATTTGA
ATAGCACTAC TGAGGCTGCT CAGAAAGATA CAATAGAGAA CAACGAGCAC GATTCTTCAG
CTGAAACGTC GTTTCGAAGA GAGAGTTTAA GAAAGACAAG GAGAATAAGC TACAAAGAAA
TGGTCAGCGA CAACGAGAAA GAAGAGTCCA GTGAAGACGA GAAGATAGAG TATGCATTCC
GATCGCTAGC TGCACGAACG TTGCGACAGA AGTCGAGACT CAAGCAGTTT CTTGGAGGTG
TAGATTCGCC AGAAGAAGCA GAAATTGTAG AAGAACCACA TACAAGGATC AGAAAAATTC
GAGATAATGT GAAGAAGCGA CAGAATGAGT TGAGAGAATC GCAAGAAAGG AATAAAGTAT
CGAATGGAAA GACATTAAGA AAATCAAAAT CAAGTGACAG GACAACAACA AGCGACAATA
CAAAATCAAA TGGAAATAAG TCAAGAAAAG CAAAATCCAA GTCTAGGGTC GAGAGCACTG
AAACTGGCCC TGAGAGTAGA ATTGAAAGAG TACCATCACG AGTGAGGGAA AGAGTAGCGC
CTATTCCGTT ATCGAGAAAG AGGCGGAACG AACGTCAGAA ACCAATGAAT ATTGATGTTG
AGAGATTGAG AGATGAAGAA AACAAAGACA AACGCGTCAA GATTCACACA ATCGATGTAC
TTAGACATTT AGTCAAAGAG TACGAACCGG AAGAAACTGC CTCAGAAGTA ATTAGAGAAC
AAGTTGTTCA GGAAGACTTC AAGGCACATC TTGTCCATCA GCTAGACTAT CTTATGGATG
TTCATTCCGC TATAAACGAT ATCACTACAA GAATCAACGA GGTTCAGAAG TTGAAAAACG
AATACCGACA AAGGATATAC ACGTTGAAAC AGAATCATGT TGATGTGGGT ACCAAATTAA
ACACATTAAG AAGTCAATAT AATCGAGACA AGGATAGACA TGCCGAAGTT CAAATGGTCG
AAACCGAAAT GAAGTCGTTG CAACAGATAG GCAATACTAC AGAGGATGCA AAGCTGTCGT
TGAGCCAACA GGTTACAGTT GCGTTGAGCC GTGCATCGTC GATTGTGAAT CCTTCTGCTG
GAGTCCTACG TAAACTTCAG ATTGTAAACC AGAAGCTTGT TGACCTCGAC AAGGAACTAT
TATAG
 
Protein sequence
MAVDDRVSRK KSRVRGSGKR NVLVDDFFVL NKQSTSDKPS KKRRLTDSLK SALNHWDNIE 
SLPRLRNRKV SAIGKADLDN ANSKTTTPVP QDRGEQEESQ PQNKSVHSVA LHSEEAGELL
TPSHLINQSI YDEIEYTGNH NQDESLQVTN SDVSVRKSAT DKTNSGRGRP KKRARVNVGR
PRKVDKPKNN LNSTTEAAQK DTIENNEHDS SAETSFRRES LRKTRRISYK EMVSDNEKEE
SSEDEKIEYA FRSLAARTLR QKSRLKQFLG GVDSPEEAEI VEEPHTRIRK IRDNVKKRQN
ELRESQERNK VSNGKTLRKS KSSDRTTTSD NTKSNGNKSR KAKSKSRVES TETGPESRIE
RVPSRVRERV APIPLSRKRR NERQKPMNID VERLRDEENK DKRVKIHTID VLRHLVKEYE
PEETASEVIR EQVVQEDFKA HLVHQLDYLM DVHSAINDIT TRINEVQKLK NEYRQRIYTL
KQNHVDVGTK LNTLRSQYNR DKDRHAEVQM VETEMKSLQQ IGNTTEDAKS SLSQQVTVAL
SRASSIVNPS AGVLRKLQIV NQKLVDLDKE LL