Gene PICST_28220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28220 
SymbolSIS2 
ID4850997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp664382 
End bp666736 
Gene Length2355 bp 
Protein Length784 aa 
Translation table 
GC content46% 
IMG OID640392705 
ProductHalotolerance protein HAL3 (contains flavoprotein domain) 
Protein accessionXP_001387767 
Protein GI126273958 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0452] Phosphopantothenoylcysteine synthetase/decarboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCTC CCCAATCCCT GAAATCGCAC GGACAGGCTA GCACTGAATC TCCAGATTCT 
CAACAATTGC AAAACAACAA AGCCACCGAG CTCAACCATA CTAAGCATAT AATCCTGGGC
GAACCAGGCG CATCACCTAC ACGAACTTCA AATGCGCCCG GGGCGCCAGC ACTAAGAATT
GATACTGTTC AATCGGAAGC GCCTCATTCT TCCAATCCTT CTGAATCGGC TGAACCCATT
CCAGAAATCA CCACTGAGTC CCTGAGCCTG AATTTGTCTC TGTCCAAAAG CCACCCCAAG
TATACGTCGT CAGCTCTGAC TCTGTTTGGC TCTAGCATAC ATTCTCCAAT CCCACACAAG
ATATTCAGAC CTGCTGTAGC AGCAACTCCT GAATCCAGTA ACGATGCGCT TGTTCGCGAA
GTGAAGAAAG ACGCAAATGC AAACGACACA AATGGAAAAG ACGCAAACGT GAAAAATGCT
GCTAATTCAA GCTCTGGCGC TACTGGAATT TCCATAGGCT CGGCCAACTC TGCTGTTTCT
TTGAATTCCA CGCCTGCCCC TAGACGAAAA TCTACCATCA CAGCCTTGAA GGTGAAAGCA
GACCCAACGT TGAAGTCGTG CTTATCCCCT ACTTCCGACG CTACCAGCAC TCTCAATCTG
ACTCTATCGA TGCTTCCCCA TAACGAACTG ATTTCGCTCT TGGACAATAC CCAACATCCA
TCGTTTTTGA TCAAGTCCGA CAATATCAGC CCTAATAAGC ACGGGTTACT AGGAATTAAT
CAAAAGGGTA CCGCTACACC CTCCTACGAT GACGAACATG CTCGTTCTCA CGTATCATAT
ACTCCCTTAG TGACTAGTAA TTCAGCCTCT GTAGTCTCGT CTGTGCCCGT ATCTCCTCCT
AGGAACACAT CTAAGTCGCC TCCGTCACGA ATGAACTCAT TGCGGTACTC CAACCAAGAC
GAGAAGCGCG TGATAGAAAG CATGCGTCTT TCAAATAACA GCTCTTTAGC TTCTGAAGAA
AGGCAGAAAA GCTTTGGCTC TATCTCTGGT TCCGGAATTG GGTCTGTCTC AGGTTCATTT
TCTGATCCTG GAGCTGGTCC AGGAGCTGCA ATCACAGAAG AAAATGGTCC TGCTATAACC
TCAGATGATA ACTCGCGAAA TAATAAGTTG CATTTGCTTA TAGGTATCAC CGGTTGTATT
TCCATCCACA AGAACATTTT TCTTATGATT GACAAGTTCT TCGAGCTCTA CACTCACGAT
AAGTTGGAGA TCCAGGTGAT ACTCACAAAA TCGGCAGAAT GGTTTCTCAC TGACAAGTTG
CACAAGTTCG AACTGTTGGG TGTCAAGGTG TGGTTCAGTG GAGACGACGT TAAGTACTTC
CTTTCCAGTC CATTTCAGAA GCACGTCGCT CTTTTCCAAA ACCCCAATTC TGCCTACAGA
CCCAAGCTCA CATCTAATCT CTTGAGCCAA TACAGTTTGG CCCACGATTT ACAGAAATGG
ACCGATGTTT TCTTGTTGGC CCCATTATCT GCGAATACCA TGGCCAAGTT GATTGGTGGG
TTGTCAGACG ATTTTTTGAC CAATTTGTTG CATGTTTGGC CCATTCCACA GTCGAACCAA
ACTCTCGCTG TTCCCGAAGG CAATTCGACG ATCTTGTCTA ACTCCTTGGT TTGTCCTAAA
CCTATTGTTG CGGCGTTAGC TCTTACCAAC TCCATGTATT CACACCCAAT CACCAAGAAA
CAGTTGGTGC TATTGCAAGA AAGCTACCCC AACATGAGTA TCTTGAAACC GGTGGAAAAA
TGTGTTGATA TAGACGGTAA TATTTCCATG GGAGGAATGC GCGCTTGGAG AGAAGTGGTG
GACTTTGTTT GCAAGAAATT GGGACAACCT CCACAAGATG AAGACGACGA AGATGACGAA
GATGACAACG ATGGTGACGA AGAGGAACTA GACCAGGACG AACAGGCTGA TCAAGATGAA
GAAGATGAAG ATCAAAGCAG CTACAGTCAG ACGGAAGCTG ATGGTATCAA AACGGCATCC
TCAGAGAAAT CCAGATCAGA AGTGAAGACG GCGCCTGTTA TTGATTTGAA GTCTGAGCAA
AAGCCCAAAC TCAAAGAAGA AAAGAAGGAG CCATCTCTTG AGATCCACGA GGATTTGACG
TTGCAGCCAA TCATCAGCAA CAGAGACGAA AAGAGATCTT TGTCCAATGG TGGCCCAGAG
TCCTCAACTC GGCATAGAAG AAATACAATA ACTAGAAAGG AACTCCAAGA ACATGAGAAG
TTGGCTCTAC AGAATGCCAG ACTCAATGCC GGCTTGGGCG TTTCTCCTAT ACCTGTGAAA
CCGCAACAAA ACTAG
 
Protein sequence
MDPPQSLKSH GQASTESPDS QQLQNNKATE LNHTKHIILG EPGASPTRTS NAPGAPALRI 
DTVQSEAPHS SNPSESAEPI PEITTESLSL NLSLSKSHPK YTSSALTLFG SSIHSPIPHK
IFRPAVAATP ESSNDALVRE VKKDANANDT NGKDANVKNA ANSSSGATGI SIGSANSAVS
LNSTPAPRRK STITALKVKA DPTLKSCLSP TSDATSTLNL TLSMLPHNEL ISLLDNTQHP
SFLIKSDNIS PNKHGLLGIN QKGTATPSYD DEHARSHVSY TPLVTSNSAS VVSSVPVSPP
RNTSKSPPSR MNSLRYSNQD EKRVIESMRL SNNSSLASEE RQKSFGSISG SGIGSVSGSF
SDPGAGPGAA ITEENGPAIT SDDNSRNNKL HLLIGITGCI SIHKNIFLMI DKFFELYTHD
KLEIQVILTK SAEWFLTDKL HKFELLGVKV WFSGDDVKYF LSSPFQKHVA LFQNPNSAYR
PKLTSNLLSQ YSLAHDLQKW TDVFLLAPLS ANTMAKLIGG LSDDFLTNLL HVWPIPQSNQ
TLAVPEGNST ILSNSLVCPK PIVAALALTN SMYSHPITKK QLVLLQESYP NMSILKPVEK
CVDIDGNISM GGMRAWREVV DFVCKKLGQP PQDEDDEDDE DDNDGDEEEL DQDEQADQDE
EDEDQSSYSQ TEADGIKTAS SEKSRSEVKT APVIDLKSEQ KPKLKEEKKE PSLEIHEDLT
LQPIISNRDE KRSLSNGGPE SSTRHRRNTI TRKELQEHEK LALQNARLNA GLGVSPIPVK
PQQN