Gene PICST_37559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_37559 
Symbol 
ID4851544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2093775 
End bp2094812 
Gene Length1038 bp 
Protein Length345 aa 
Translation table 
GC content44% 
IMG OID640393252 
Productpredicted protein 
Protein accessionXP_001388030 
Protein GI126274804 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.960018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.136249 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATG CTGCTGTCAG AACTCCGGTA GTAGTTTCCT TAAAGGAATT AGTCCAGGGA 
ATTGACCATG CTACTTTGGC AGAAGCTTTT GGACCCCAGT CGTTAGGAAT CATCGTTATC
AAAGATTTAC CCCAGAAGTT TCATGATCTC AGATTGAAGG TGTTGAAGTC GATTTCGATA
TTGGCCAACT TGGGGCCTGA CGTGTTAAGT AATCTAGAAT CAGAGGAGGC AATGTGGTTA
ACAGGCTGGT CTTGTGGTAA AGAAATATTG GCTAACTCAG GAAAACCAGA CTTTAACAAA
GGTTCCTACT ATGTGAACTG TGCCTTCCAC AAGAATCCTG AGTGGGAAGG ACCGACGGAA
AAATTGACCA AAGAGTTCAT CAACCACAGG GCATACACCA CAGCCAATAT GTGGCCTTCT
GCAGATCACA AAGGTCTTGA AAATTTTCAA GAAGATGCTA AGGAGCTTAT TAGCTTAATC
ATAGATGTAG CCCAATCTGT AGCTGCTAAT TGTGACAAAT TCATTACAGA GAGCAAAATC
TCTCCCAACT ACGAACAAAA CTACTTAGAA CGAATTGTGA AAAACTCGAC TTGTACGAAG
GCAAGGTTAC TCCATTATTT TCCACTGAAG TCGTCGTCGG AATCGGGCAA AGATGATGAC
TGGTGCGGTG AGCATTTGGA CCACTCTTGT CTCACAGGAT TGACATCTGC TTTGTTCATC
GACGAATCTA AGGGTCTAAC CGCTGCTCTT GATAAATCCC CAGACCCTGA ACTGGGTTTG
TACATTCGTG ACAGACAGAA TGAAGTGGTT AAAGTGAACA TTCCTCCCGA ATGTCTTGCT
TTCCAGACTG GATCTACTCT CCAGGAAGTT TCTCGAGGAA AATTCCTGGC AGTACCCCAC
TATGTCAAAG GAACTTCGAT TCCAAATATC GCTAGAAACA CTTTGGCTGT GTTCTGCCAG
CCAGACTTGG ACGAAATGGT TAATGATTCT GAGAACTTTG CCCAGTATGC CGATAGAATT
CTCAAGGCCA ACCACTAA
 
Protein sequence
MTNAAVRTPV VVSLKELVQG IDHATLAEAF GPQSLGIIVI KDLPQKFHDL RLKVLKSISI 
LANLGPDVLS NLESEEAMWL TGWSCGKEIL ANSGKPDFNK GSYYVNCAFH KNPEWEGPTE
KLTKEFINHR AYTTANMWPS ADHKGLENFQ EDAKELISLI IDVAQSVAAN CDKFITESKI
SPNYEQNYLE RIVKNSTCTK ARLLHYFPLK SSSESGKDDD WCGEHLDHSC LTGLTSALFI
DESKGLTAAL DKSPDPELGL YIRDRQNEVV KVNIPPECLA FQTGSTLQEV SRGKFLAVPH
YVKGTSIPNI ARNTLAVFCQ PDLDEMVNDS ENFAQYADRI LKANH