Gene PICST_31468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31468 
Symbol 
ID4839050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp884156 
End bp885181 
Gene Length1026 bp 
Protein Length341 aa 
Translation table12 
GC content43% 
IMG OID640390365 
Productpredicted protein 
Protein accessionXP_001384472 
Protein GI126135896 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.561665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAA TTGATGGAAA CCCTTTGAAG ATTGTAGATA TCACCTCAGT TGATAAATCA 
ACTGCTGAAG AATTGTATGA CGCCGCTACG TCCCAGGGGT TCTTATTTGT TGAAGGTCAC
GAGTTCAGCC AGGAAGAAAT AGATACTGTA TTCCAACTTT CTAAGGAATT CTTTGAGTTA
CCCCATAGTT ATAAGTCAAA GTATCCAATA GGCTCTATGA ATCATGGATA CGCAGACTTT
GGAGGCGAGA ACTTGGATCC TAAGGGCCAG AAGAAGGGAG ATCCCAAGGA AGCTCTCAAT
ATCTGTTTGT TAAACTTCTT GACAGGTTTA TCATCCCAAG AGATTCCGGA CTGGTTTACG
GAGGATCCCA AGAGGTTGGC GATTATCACC ACAACTGTAA AGAAGTTCTA TGCCTTGTCG
ATGAAGATCT TGAAGTTGTT GGCTATTGGA TTGAAAATAG AGGATTCCAA CGAAATCAAA
GGTGAAGACT GGTTTTCTTC AAGATATGAA GCCACTAAAG TCTCAGGTTC TACTTTCAGG
TTTTTGCATT ACCCAGGCCA AAAGAGTTTG AACCCAGAAG CTGTGATCAG AGCTGGTGCC
CATACCGATT ATGGATCTGT GACATTATTA TTCCAGCAGG AGAATCAGGA AGGACTAGAG
ATCTACTCAC CGGTATCAAA GCAATGGGTT GCGGTTCCTT TTGTAGCTGC TAATACAGAA
AAGTTTCCAG GAATGGGCCC TCCTATTGTA GTTAATATTG GAGATTTATT AAGTTACTGG
ACAGCTGGTT TGTTGAAGTC AACTATTCAC AGAGTCAAGT TTCCGGCCAA AGTTCAAGCC
ACCGGCCAGG ATAGATACTC GATTGTATTC TTTAGTCATC CTAACGATGA GGCGTTGTTA
GAGGCTGTAC CTAGTGAGGT GGTGAGAAGT ATCAAGGGAA GAGGAGCCAA TAAGGATACT
GTTGCCATCA CAGCTAAAGA GCATTTGGAC AGTAGGCTTG CAGCAACATA CGGCTGGAAG
AAGTAG
 
Protein sequence
MAEIDGNPLK IVDITSVDKS TAEELYDAAT SQGFLFVEGH EFSQEEIDTV FQLSKEFFEL 
PHSYKSKYPI GSMNHGYADF GGENLDPKGQ KKGDPKEALN ICLLNFLTGL SSQEIPDWFT
EDPKRLAIIT TTVKKFYALS MKILKLLAIG LKIEDSNEIK GEDWFSSRYE ATKVSGSTFR
FLHYPGQKSL NPEAVIRAGA HTDYGSVTLL FQQENQEGLE IYSPVSKQWV AVPFVAANTE
KFPGMGPPIV VNIGDLLSYW TAGLLKSTIH RVKFPAKVQA TGQDRYSIVF FSHPNDEALL
EAVPSEVVRS IKGRGANKDT VAITAKEHLD SRLAATYGWK K