Gene PICST_39554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39554 
SymbolIFH2 
ID4851793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2843959 
End bp2845119 
Gene Length1161 bp 
Protein Length386 aa 
Translation table 
GC content46% 
IMG OID640393501 
Productalpha-ketoglutarate catabolism dioxygenase 
Protein accessionXP_001387107 
Protein GI126275617 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.470371 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAGCC AAGGTACTCG CTTCGGTAAT TACGATATCC ATTTCTTTGA GGGACAAGAC 
GAGGTAGACT CCGACGGAGT CTTGGTCATC AACAAGAGCA ACAGAGACAG ATCTAGTCAC
CCAGACTTCT TGCCTACCTG GGACCCTAAG CAAAAGTACC CACCTTTAAA GTTCTTCAAG
CACGAAGACC CAGGTAGAAG AGCTGATACC TCTTTCCCAA ACTTGTTCCC CAAGGAAGGA
GACTACTCTA TCAAGAAGGT TACTCCTAAG TTCGGTTCCC TGGTCACTGG TGTTCAATTG
TCTCAGTTAG ACTCTGCTGG TAAGGATGAA TTGGCCCTCT TGGTCGCTCA AAGAGGTGTT
GTCATCTTCA GAGAACAAGA CTTTGCCGAC AAGGGTCCAG CTTTCGCAGT TGAATACGGT
AAACACTTCG GAAGATTGCA CATCCACCCA ACATCTGGTG CTCCAAGAAA CCACCCAGAG
TTGCACATCA CCTACAGAAG ACCAGACAAG GGTGAGTTTG AAAGAGTTTT CTCCAACAGA
ACCAACAACG TTGGATGGCA CTCGGACGTT TCGTACGAAT TGCAACCACC AGGAACCACT
TTCTTCTCAG TAATTGAAGG TCCAGAATCT GGTGGTGACA CCATTTTTGC TGACACCGTC
GAAGCCTACA ACAGATTGTC GCCAGAGTTC CAAAAGAGAT TGGCCGGCTT ACATGTGTTG
CACACTTCTA AGGATCAAGC CTCTAACTCC AGAGGTCAAG GTGGAATTGA AAGAAGAAAG
CCAGTTTCAA ACATCCATCC ATTGATCAGA ACCCACCCAG TCACCGGTGA AAAGGCTATC
TTCTTGAACA AGCCCTTCGC CAGAAAGATT GTTGAATTGA AGGAAGAAGA ATCCGAGTAC
TTGCTTAAGT TCTTGTTTGA CCACATTGAA TCTTCCCACG ATTTACAATT AAGAGCCAAC
TGGGAACCAA ACTCGGTTGT TTTGTGGGAT AACAGAAGAA CTGTTCATTC AGCCATCATT
GATTGGGACA CCCCTGTTCA CAGACATGCC TTCAGAATCA CTCCACAAGC TGAAAGACCA
GTCGAAGACT TGAACGACTT GAACAAGGAA GAGTATGATG TTGGGGACTT AGAAGAAGCA
TTGAAGTCCG TTACTGCTTG A
 
Protein sequence
MASQGTRFGN YDIHFFEGQD EVDSDGVLVI NKSNRDRSSH PDFLPTWDPK QKYPPLKFFK 
HEDPGRRADT SFPNLFPKEG DYSIKKVTPK FGSLVTGVQL SQLDSAGKDE LALLVAQRGV
VIFREQDFAD KGPAFAVEYG KHFGRLHIHP TSGAPRNHPE LHITYRRPDK GEFERVFSNR
TNNVGWHSDV SYELQPPGTT FFSVIEGPES GGDTIFADTV EAYNRLSPEF QKRLAGLHVL
HTSKDQASNS RGQGGIERRK PVSNIHPLIR THPVTGEKAI FLNKPFARKI VELKEEESEY
LLKFLFDHIE SSHDLQLRAN WEPNSVVLWD NRRTVHSAII DWDTPVHRHA FRITPQAERP
VEDLNDLNKE EYDVGDLEEA LKSVTA