Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_36516 |
Symbol | |
ID | 4839895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 86124 |
End bp | 87161 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640391210 |
Product | predicted protein |
Protein accession | XP_001385707 |
Protein GI | 150866198 |
COG category | [R] General function prediction only |
COG ID | [COG1355] Predicted dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGTATA TCCGCCCTGC CACACATGCC GGCTCGTGGT ACTCAAACAA TCCTACCAAG TTGGGGTTGC AGTTAGAAGC CTACTTTCAC AAGGCTGAAT CACATAGCGG AGAAGACTCC AGACACATAA TACCTGGTGC ACGAATCTTA ATAGGTCCCC ATGCTGGCTT TGCCTATTCT GGTGAACGTT TGGCTGAAAC TTTTACTGTA TGGGACACTT CTAAAGTAAA GAGAATCTTC ATGTTGGGAC CTTCTCATCA TGTTTATTTC AAGAATTCGG TGATGGTGTC GCAGTTTGAA TGGTACGAAA CTCCGTTCGG TAATATTCCC GTAGACACCG AAACGATCGA GAAGTTGCTC CACACCAAGC CGCAGTCACA TGGCCACTCT CTTACACATG CAAAAGATTC TGTGTTCAAG TACATGAGTG AAGAGATGGA TGAAGACGAA CATTCGTTTG AAATGCACGC GCCTTTTATC TACCAAAAGA CCCACGATTT GCCCCAGGGC ATTCCCAAGA TCATTCCCAT ACTTATCAGT GGAATGGATG AGAAGTTGAA CGATGAGGTG GTGTCGGCTT TGTTGCCCTA TCTCGAAAAT GAAGAGAACC ACTTCATCAT CAGTCTGGAC TTCTGCCACT GGGGCTCTCG TTTCGGATAC ACCAAATATG TTCCTCAGAA GGTCGACTCC CTTCAGCTCC TCACCGAAAA CTTATCGAGC TTGGGCCATT CATTGAGAAC CAAACCCAAC GAATTACCCA TATATAAGTC AATAGAGGTG TTGGATAAAG CTGCGATGGA AATTGCTTCA CTGGGAAGCT ATTCTGACTG GAAAACCTAC ATTTCTCAAA CAGGAAACAC TATCTGTGGC CAGAAGCCCA TCGCAGTGGT GTTGAAGTTG ATTCAAAAGT ATAGATTGGC TGCCGGTGAT ACAGATAAGG CAGCCATCTT TAAGTGGATA GGCTATTCTC AGAGTAACCA AGCACGTAGG GCTTCGGATT CGAGTGTCTC ATATGCTTCT GGTTATGTTA CGATTTGA
|
Protein sequence | MSYIRPATHA GSWYSNNPTK LGLQLEAYFH KAESHSGEDS RHIIPGARIL IGPHAGFAYS GERLAETFTV WDTSKVKRIF MLGPSHHVYF KNSVMVSQFE WYETPFGNIP VDTETIEKLL HTKPQSHGHS LTHAKDSVFK YMSEEMDEDE HSFEMHAPFI YQKTHDLPQG IPKIIPILIS GMDEKLNDEV VSALLPYLEN EENHFIISSD FCHWGSRFGY TKYVPQKVDS LQLLTENLSS LGHSLRTKPN ELPIYKSIEV LDKAAMEIAS SGSYSDWKTY ISQTGNTICG QKPIAVVLKL IQKYRLAAGD TDKAAIFKWI GYSQSNQARR ASDSSVSYAS GYVTI
|
| |