Gene PICST_88330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_88330 
SymbolYHB1 
ID4838239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1654530 
End bp1655864 
Gene Length1335 bp 
Protein Length401 aa 
Translation table12 
GC content42% 
IMG OID640389554 
Productflavohemoglobin 
Protein accessionXP_001383940 
Protein GI150864925 
COG category[C] Energy production and conversion 
COG ID[COG1017] Hemoglobin-like flavoprotein
[COG1018] Flavodoxin reductases (ferredoxin-NADPH reductases) family 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.408575 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TACAAAACAA CTAAGTCTCT TAACCACACC GTTGCTACTC TGCCTCTATG ATGTCTACTG 
CTCCACAAAT ATACACCATT CAAGAATTGA CTGACTCTCA AAAGAAAATC GTCCTTGATA
CGGTGCCCAC CTTGGAGCTG GCTGGTGAAA CATTGACTGC CCAGTTCTAT CAGAACATGT
TTGTCGATTT CCCAGAAGTC AGGCCTTTCT TCAATCAGAC GGACCAGAAG TTCTTGAGAC
AGCCTCGTAT TTTGGCATTT GCCTTGTTGA ACTACGCCAA GAACATCGAG AACTTGGAAC
CCTTGACTGC GTTTGTGAAG CAGATTGTTA GCAAGCATGT AGGTTTACAG GTAAAGGCTG
AACACTACCC TTGTGTTGGT AACTCATTGA TTAAGACCAT GAAGGAGTTG CTTGGACCAG
AAGTTGCCAA CGAAGCCTTT ATTGATGCAT GGGCAACTGC GTACGGAAAC TTGGCTCAAT
TGCTCATTGA CATGGAAGAC GCCGAATATC AAAAAGCCCC TTGGAGAGGT TTCAGAGAAT
TCACTGTAAC TAAAATCCAA GACGAATGCA CCGACGTTAA GTCGATATAT TTCAAGCCTA
CAAATGAAGG CGACGAGATT TCCTTGCCAA AGAGAGGTCA ATATCTTTGC TTCAGGTGGA
GCTTGCCAGG AGAAGAGCAA GAAATAAGTA GAGAATATTC TATCTCTGAG TACCCCTCTG
AAAAAGAGTA CCGTATCTCT GTTAGAAAGT TGGAAGGTGG TAAGATCTCG GGTTACATCC
ACAACACTTT AAAGGTTGGA GACTCTCTCA AAGTAGCTCC TCCTTGTGGA AAATTTGTTT
ATGTACCCTC TGAAAAGGAT ATAGTTTTGC TTGTAGGAGG TATTGGAATC ACCCCCATTG
TATCTATCTT GGAAAAAGCT TTACAACTGG GAAGAAACGT TACCATGCTC TACTCTAACA
AAACTGTCGA ATCTAGACCT TTTGGCAACT GGTTAAAGGA ATTGAAGGAG AAGTATGGAG
AAAAGTTTAA GCTTACCGAA TTTTTCTCTA ATGAGAAGAA TGTTACTGCC AAAGATGTCA
TCGATGCAGT TGAGACTCGT ACGTTGGACA GCAGGGACTT GGACCAGATT TCCAAGGACA
GCGACGTCTA TTTATTGGGA CCTCGTGAAT ACATGAAGTA CGTTAAGGGG TATTTGGGGG
CTAAGGGTGT CGAAGACATC AAGTTAGAGT ACTTTGGCCC ACTCGAGGTT TAGAGATACA
TTTTATACGT TGATCAATAT GGATCTTGAA TATACGATAT GGACTATGAA TGAGGCACAA
ATAAAGAAAC ATAGT
 
Protein sequence
MMSTAPQIYT IQELTDSQKK IVLDTVPTLE SAGETLTAQF YQNMFVDFPE VRPFFNQTDQ 
KFLRQPRILA FALLNYAKNI ENLEPLTAFV KQIVSKHVGL QVKAEHYPCV GNSLIKTMKE
LLGPEVANEA FIDAWATAYG NLAQLLIDME DAEYQKAPWR GFREFTVTKI QDECTDVKSI
YFKPTNEGDE ISLPKRGQYL CFRWSLPGEE QEISREYSIS EYPSEKEYRI SVRKLEGGKI
SGYIHNTLKV GDSLKVAPPC GKFVYVPSEK DIVLLVGGIG ITPIVSILEK ALQSGRNVTM
LYSNKTVESR PFGNWLKELK EKYGEKFKLT EFFSNEKNVT AKDVIDAVET RTLDSRDLDQ
ISKDSDVYLL GPREYMKYVK GYLGAKGVED IKLEYFGPLE V