Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_39554 |
Symbol | IFH2 |
ID | 4851793 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2843959 |
End bp | 2845119 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | |
GC content | 46% |
IMG OID | 640393501 |
Product | alpha-ketoglutarate catabolism dioxygenase |
Protein accession | XP_001387107 |
Protein GI | 126275617 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.470371 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAGCC AAGGTACTCG CTTCGGTAAT TACGATATCC ATTTCTTTGA GGGACAAGAC GAGGTAGACT CCGACGGAGT CTTGGTCATC AACAAGAGCA ACAGAGACAG ATCTAGTCAC CCAGACTTCT TGCCTACCTG GGACCCTAAG CAAAAGTACC CACCTTTAAA GTTCTTCAAG CACGAAGACC CAGGTAGAAG AGCTGATACC TCTTTCCCAA ACTTGTTCCC CAAGGAAGGA GACTACTCTA TCAAGAAGGT TACTCCTAAG TTCGGTTCCC TGGTCACTGG TGTTCAATTG TCTCAGTTAG ACTCTGCTGG TAAGGATGAA TTGGCCCTCT TGGTCGCTCA AAGAGGTGTT GTCATCTTCA GAGAACAAGA CTTTGCCGAC AAGGGTCCAG CTTTCGCAGT TGAATACGGT AAACACTTCG GAAGATTGCA CATCCACCCA ACATCTGGTG CTCCAAGAAA CCACCCAGAG TTGCACATCA CCTACAGAAG ACCAGACAAG GGTGAGTTTG AAAGAGTTTT CTCCAACAGA ACCAACAACG TTGGATGGCA CTCGGACGTT TCGTACGAAT TGCAACCACC AGGAACCACT TTCTTCTCAG TAATTGAAGG TCCAGAATCT GGTGGTGACA CCATTTTTGC TGACACCGTC GAAGCCTACA ACAGATTGTC GCCAGAGTTC CAAAAGAGAT TGGCCGGCTT ACATGTGTTG CACACTTCTA AGGATCAAGC CTCTAACTCC AGAGGTCAAG GTGGAATTGA AAGAAGAAAG CCAGTTTCAA ACATCCATCC ATTGATCAGA ACCCACCCAG TCACCGGTGA AAAGGCTATC TTCTTGAACA AGCCCTTCGC CAGAAAGATT GTTGAATTGA AGGAAGAAGA ATCCGAGTAC TTGCTTAAGT TCTTGTTTGA CCACATTGAA TCTTCCCACG ATTTACAATT AAGAGCCAAC TGGGAACCAA ACTCGGTTGT TTTGTGGGAT AACAGAAGAA CTGTTCATTC AGCCATCATT GATTGGGACA CCCCTGTTCA CAGACATGCC TTCAGAATCA CTCCACAAGC TGAAAGACCA GTCGAAGACT TGAACGACTT GAACAAGGAA GAGTATGATG TTGGGGACTT AGAAGAAGCA TTGAAGTCCG TTACTGCTTG A
|
Protein sequence | MASQGTRFGN YDIHFFEGQD EVDSDGVLVI NKSNRDRSSH PDFLPTWDPK QKYPPLKFFK HEDPGRRADT SFPNLFPKEG DYSIKKVTPK FGSLVTGVQL SQLDSAGKDE LALLVAQRGV VIFREQDFAD KGPAFAVEYG KHFGRLHIHP TSGAPRNHPE LHITYRRPDK GEFERVFSNR TNNVGWHSDV SYELQPPGTT FFSVIEGPES GGDTIFADTV EAYNRLSPEF QKRLAGLHVL HTSKDQASNS RGQGGIERRK PVSNIHPLIR THPVTGEKAI FLNKPFARKI VELKEEESEY LLKFLFDHIE SSHDLQLRAN WEPNSVVLWD NRRTVHSAII DWDTPVHRHA FRITPQAERP VEDLNDLNKE EYDVGDLEEA LKSVTA
|
| |