Gene PICST_36557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_36557 
Symbol 
ID4840122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp279889 
End bp281094 
Gene Length1206 bp 
Protein Length401 aa 
Translation table12 
GC content43% 
IMG OID640391437 
Productpredicted protein 
Protein accessionXP_001385393 
Protein GI150865963 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.325381 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGCAC TATCTGCTGA GGAATTAGAT AGGAAGTATA ATGTGCGTCC GTTTGTTGAC 
CCTGAGCCAA CTAAAATCGA TGTGAATCCT TTAGTTTTAA CAGCAATAGA CCTTTCCTTA
TTCAAAGAAG GGGATGAACA TCTAAACGAA AGAAAGAGTT TAGCAAAGGT GCTAGAATCA
TCTGTAACCA CATATGGTTT TTTCAATTTG GTCAATTTTG GTATACCTAA AGAACGAATT
GAACATATAA GAGCTATTAG TCAGAGCTTG CTTACAATTC CATACGAAGA AAAGTTGAAG
TATTTGGCGA GTGCCGCTAC TAAAGAAGAA GAGAAGCCTA AAAGCATAGG TGCCGAGCGT
GGCCAGGGAT TTAAGCCAAA AGGCTACTGG TCCATCAAGA ACGGAATTCG AGACTCGATT
GATTTCTATA ATGTCAGAGA TACTTATCAT GATTCGTTCT TAGAGACTCC GGAAGCACAC
CCTGAGTTGT TGCAAGTCCA TCTCAAAGAA GTGGCAGACT ATTATAACCA TTTACATAGA
GTTGTATTGC CCAAACTTTT GCGATTGTTT GACTTAATTT TCAAGATTCC CGAAGGTACC
TTGTTGAAAC GGTATTTCCA CAAAAGTGGA ACAAACGAAG ATACGTCAGG CAGCCATGGT
CGTCTTATGT TGTACCGGCC ATATGAGAAT CAACAAGAGT TTGAGCAGAC AGACAAGATG
TTCTTGCGTG GGCACTCAGA TATTAGCGCG CTTACCTTTA TAACTTCCCA GCCAATATTG
GCCTTGCAGA TTATGGATGT CTACACTGGA GCGTGGAGAT ATGTTGCTCA TCGCGACGAC
TCTTTGATTG TCAATATTGG GGATGCGCTT GAGTTCATCA GCGGTGGTCA TTTCAAGGCT
TGTCTCCATA GAGTGGTCGA GCCTCCTGCG GATCAGAGAG GGTTTAATCG GCTTGTGGTT
ATTTACTTTT GCAACCCAAG TGACAACTCC GAGATGGATC CCGAGCTCTT GGACTCTCCT
GCATTACGCA GATTGGGGTA CACCAGGGAG GATAAGTTGA AGCAATGGGA AAAGATCCAA
TTCCATGACT GGAACACTAC GAAAGGCGAG CTCCTTGGGA GAACCGCAGC TGGTGAGAGA
AATCTACTTC AGTATCACGG AAGGTACATT GAGAGGTGGC ACCGATTTTC GGAATTGGCA
AATTAG
 
Protein sequence
MSALSAEELD RKYNVRPFVD PEPTKIDVNP LVLTAIDLSL FKEGDEHLNE RKSLAKVLES 
SVTTYGFFNL VNFGIPKERI EHIRAISQSL LTIPYEEKLK YLASAATKEE EKPKSIGAER
GQGFKPKGYW SIKNGIRDSI DFYNVRDTYH DSFLETPEAH PELLQVHLKE VADYYNHLHR
VVLPKLLRLF DLIFKIPEGT LLKRYFHKSG TNEDTSGSHG RLMLYRPYEN QQEFEQTDKM
FLRGHSDISA LTFITSQPIL ALQIMDVYTG AWRYVAHRDD SLIVNIGDAL EFISGGHFKA
CLHRVVEPPA DQRGFNRLVV IYFCNPSDNS EMDPELLDSP ALRRLGYTRE DKLKQWEKIQ
FHDWNTTKGE LLGRTAAGER NLLQYHGRYI ERWHRFSELA N