Gene PICST_36516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_36516 
Symbol 
ID4839895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp86124 
End bp87161 
Gene Length1038 bp 
Protein Length345 aa 
Translation table12 
GC content45% 
IMG OID640391210 
Productpredicted protein 
Protein accessionXP_001385707 
Protein GI150866198 
COG category[R] General function prediction only 
COG ID[COG1355] Predicted dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTATA TCCGCCCTGC CACACATGCC GGCTCGTGGT ACTCAAACAA TCCTACCAAG 
TTGGGGTTGC AGTTAGAAGC CTACTTTCAC AAGGCTGAAT CACATAGCGG AGAAGACTCC
AGACACATAA TACCTGGTGC ACGAATCTTA ATAGGTCCCC ATGCTGGCTT TGCCTATTCT
GGTGAACGTT TGGCTGAAAC TTTTACTGTA TGGGACACTT CTAAAGTAAA GAGAATCTTC
ATGTTGGGAC CTTCTCATCA TGTTTATTTC AAGAATTCGG TGATGGTGTC GCAGTTTGAA
TGGTACGAAA CTCCGTTCGG TAATATTCCC GTAGACACCG AAACGATCGA GAAGTTGCTC
CACACCAAGC CGCAGTCACA TGGCCACTCT CTTACACATG CAAAAGATTC TGTGTTCAAG
TACATGAGTG AAGAGATGGA TGAAGACGAA CATTCGTTTG AAATGCACGC GCCTTTTATC
TACCAAAAGA CCCACGATTT GCCCCAGGGC ATTCCCAAGA TCATTCCCAT ACTTATCAGT
GGAATGGATG AGAAGTTGAA CGATGAGGTG GTGTCGGCTT TGTTGCCCTA TCTCGAAAAT
GAAGAGAACC ACTTCATCAT CAGTCTGGAC TTCTGCCACT GGGGCTCTCG TTTCGGATAC
ACCAAATATG TTCCTCAGAA GGTCGACTCC CTTCAGCTCC TCACCGAAAA CTTATCGAGC
TTGGGCCATT CATTGAGAAC CAAACCCAAC GAATTACCCA TATATAAGTC AATAGAGGTG
TTGGATAAAG CTGCGATGGA AATTGCTTCA CTGGGAAGCT ATTCTGACTG GAAAACCTAC
ATTTCTCAAA CAGGAAACAC TATCTGTGGC CAGAAGCCCA TCGCAGTGGT GTTGAAGTTG
ATTCAAAAGT ATAGATTGGC TGCCGGTGAT ACAGATAAGG CAGCCATCTT TAAGTGGATA
GGCTATTCTC AGAGTAACCA AGCACGTAGG GCTTCGGATT CGAGTGTCTC ATATGCTTCT
GGTTATGTTA CGATTTGA
 
Protein sequence
MSYIRPATHA GSWYSNNPTK LGLQLEAYFH KAESHSGEDS RHIIPGARIL IGPHAGFAYS 
GERLAETFTV WDTSKVKRIF MLGPSHHVYF KNSVMVSQFE WYETPFGNIP VDTETIEKLL
HTKPQSHGHS LTHAKDSVFK YMSEEMDEDE HSFEMHAPFI YQKTHDLPQG IPKIIPILIS
GMDEKLNDEV VSALLPYLEN EENHFIISSD FCHWGSRFGY TKYVPQKVDS LQLLTENLSS
LGHSLRTKPN ELPIYKSIEV LDKAAMEIAS SGSYSDWKTY ISQTGNTICG QKPIAVVLKL
IQKYRLAAGD TDKAAIFKWI GYSQSNQARR ASDSSVSYAS GYVTI