Gene PICST_34797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_34797 
Symbol 
ID4837262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp199313 
End bp200581 
Gene Length1269 bp 
Protein Length422 aa 
Translation table12 
GC content39% 
IMG OID640388577 
Productpredicted protein 
Protein accessionXP_001382803 
Protein GI126132556 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.242371 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACAA CTGAAAATTT GAATATTGAC GAAATTGACA AGAAGTACAA TTTGAGACCA 
TTTGTAGAGG CTACTCCGTC AGATACTGCT GCAGAAGTTA TCCAATTGAA CTCACTAGAT
TTGTCTCTTT TCCAGGAAGG ACCAGATTTC TTGGATCAGA GAAAAAAACT TGCTACCCAG
CTTGAAGAGT CTCTCTCAAC TGTAGGATTT TTCGCTTTGG TTAACCATGG AATCAGCCAA
GACACGTTTG ACCAATTGAG GTCTGTTGCT CAATCCACGT TTGAGTTGCC GGATCAGGAA
AAGAAGAAGT ACTTGTCTGG AGCATTGACT TCTGATACAG AAGACAGAAG TGTTTCATTA
GGTGCGGAAA GAGGTGCCGG ATTCAAACCA AAGGGATATT GGTCTATGAA GAACGGAGTC
AAAGATAGTA TTGAATTGTA CAATTTCAGG GACTTGCAAC AAAGGGAAGT TTATGATTCC
TCCAAGCCCT ACCCAGAGAT AGTGAAAGCA CATCTTCCAA ATGTTGTGAG CTATTTTAGA
TTCATACATG GCAATATCTT GAAGAAGTTG ACTATTTTAT GTGATATTAT ATTAGAGCTT
CCAGAAGGTT ACTTGTGGGA GAACTACTTC AAGGTTGTGG ATGGTGATTC CTATAATTCA
GGAAGTGGAT TCGGAAGATT CATGATCTAC CATGCTTTGA ATCCTGAAGA TGAAGCAAAA
GTTGATAACA ATTGGCTCCG TGGACATTCT GATGGCACGG CGTTCACATT TATTACATCC
CAGCCTATCT TGTCATTACA GATAAGAGAC TATTATACTG GTGATTGGAA GTATGTTGGC
CATACACCTA ACGGACTTAT TGTTAATATA GGCGATGCAT TGGAATTTAT AACTGGTGCA
TACTTCAAGT CTTCTATACA TCGAGTCGTA ACCCCACCTG ATGATCAGAA AAATTTTAAA
AGATTGGTAA TCATTTACTT CTGTGATCCC AAGCTTCCTT CTATTCTCGA TCCCGAGCCA
TTGAATTCTC CAAAATTGAA AAGATTGGGA TACAGAAAAC ACGATGAATG GGAAAGGATT
ACATTCCAGC AATGGGACGA GGAAAAAGGT AGATTATTTG GAAGGAGTGA CGTAAACGAT
GCCAAAAGTG ACGAACCAAA CTTGGTGCTA CTCTACGGAA GACTACATGA AAGGTGGCAT
CAAGCAGAAC ACAATTTCTC TCTTGAAGAA GCTAGGAAGA AGTATAAGGT AATTGAAAAC
AAAAGTTAA
 
Protein sequence
MATTENLNID EIDKKYNLRP FVEATPSDTA AEVIQLNSLD LSLFQEGPDF LDQRKKLATQ 
LEESLSTVGF FALVNHGISQ DTFDQLRSVA QSTFELPDQE KKKYLSGALT SDTEDRSVSL
GAERGAGFKP KGYWSMKNGV KDSIELYNFR DLQQREVYDS SKPYPEIVKA HLPNVVSYFR
FIHGNILKKL TILCDIILEL PEGYLWENYF KVVDGDSYNS GSGFGRFMIY HALNPEDEAK
VDNNWLRGHS DGTAFTFITS QPILSLQIRD YYTGDWKYVG HTPNGLIVNI GDALEFITGA
YFKSSIHRVV TPPDDQKNFK RLVIIYFCDP KLPSILDPEP LNSPKLKRLG YRKHDEWERI
TFQQWDEEKG RLFGRSDVND AKSDEPNLVL LYGRLHERWH QAEHNFSLEE ARKKYKVIEN
KS