Gene PICST_41273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_41273 
SymbolSFH1 
ID4837138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1706644 
End bp1707723 
Gene Length1080 bp 
Protein Length359 aa 
Translation table12 
GC content36% 
IMG OID640388453 
ProductChromatin structure remodeling complex protein SFH1 (SNF5 homolog 1) 
Protein accessionXP_001383091 
Protein GI150864323 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCGT TGTCGTTTGA TACATCGTCA TTCTTTGTAC AAGGTTTGGC GTCAAGCTTG 
TCCAAAAGAC TATTGGATGA ACCTGATAAT TCACTACTAT TAAACGTTGC ACCAAGTGGA
CGGCAAGCAA AGAGAAATGC TCAACAGATA AACTACTCAG AAGAGTATAA TGATGATTTT
GATTTCGAGG ACAGCCCATC ATCTAATCAG ACGAAGACCA CAATTACGAC TAACCAGCAA
CAGTCCAATG TTCAGAAATC GGCTCCAGCG AGGAATACTC CATATTTCAG AGTTCTAGAT
GATGAACAAA GGTTGAATGA ATTAGCATAC AAGAATGACG TTTTAATTCC AATTAAATTG
TCTATTGAAA ATGCAAATTC AACTCATAAA TTAGTTGACT TTTTTATGTG GAATTTGACT
GAATCACTAA TCACTCCATA CCAGTTTGCA GATATATTAT GCAATGATCT AGAATTACCA
AATTCTATGA ATCTGCAGAT TGCCGAATCA ATTGTGCAAC AAATTGATGA CTACAATTAT
GCATCAAACT TACAACTACC AAGTAATGTT CCTTGTGTTG TGATAATTGA CTTGTCGGTT
AGTTTAAATA AGCATTTGTT CCAAGACAAA TTTGAATGGG ACTTAAACGG CAATGGGGTT
ACTCCAGAAG ACTTTGCTAG GATTGTGGTG GCTGACATGG GTTTCTCTCT CGAGTTCTAT
CCAGCCATTT CACATGCACT TCATGAAATA ATCATCAGGG TGAAAAAAGA AATAGTAGAT
GGAACGTATA ATAATGAAAT TCACAATTTC CACCAGGTCA GGGGGCTTAT TTTCGAGAGA
GGCATCAGGA TTTTCACAGA AAGTAGCATT CAAAATGGGA ATGATCACTG GGAACCTATT
GTTGAAATTC TTACTCCTTC AGAGATCGAA AGAAGAGAAA ACGAAAGGAT TCGAAATTTA
AGAAGATTGA AGAGAGAAAA TATGAGAAGA GATTATGATG ACTTTGGACC AAACAAAAGA
AGAAATGTTA CTGTTAGAAG GAAATTCGAT GAATTGGATG GTACCTGGAA AAATATGTGA
 
Protein sequence
MSSLSFDTSS FFVQGLASSL SKRLLDEPDN SLLLNVAPSG RQAKRNAQQI NYSEEYNDDF 
DFEDSPSSNQ TKTTITTNQQ QSNVQKSAPA RNTPYFRVLD DEQRLNELAY KNDVLIPIKL
SIENANSTHK LVDFFMWNLT ESLITPYQFA DILCNDLELP NSMNSQIAES IVQQIDDYNY
ASNLQLPSNV PCVVIIDLSV SLNKHLFQDK FEWDLNGNGV TPEDFARIVV ADMGFSLEFY
PAISHALHEI IIRVKKEIVD GTYNNEIHNF HQVRGLIFER GIRIFTESSI QNGNDHWEPI
VEILTPSEIE RRENERIRNL RRLKRENMRR DYDDFGPNKR RNVTVRRKFD ELDGTWKNM