Gene PICST_33443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33443 
Symbol 
ID4840610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp658513 
End bp659826 
Gene Length1314 bp 
Protein Length437 aa 
Translation table12 
GC content39% 
IMG OID640391925 
Productpredicted protein 
Protein accessionXP_001386320 
Protein GI150866653 
COG category[R] General function prediction only 
COG ID[COG5273] Uncharacterized protein containing DHHC-type Zn finger 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.137532 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0676408 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGCC ATCAGAGTCC GATATGGCTG CGCGTGGTCA AAATCGGAGT TCCTATAGGC 
GTAATATTTG GTCTAGCATA CATTGACTTT GCTTCTTTTT ACTCTTTGGG ATACCAGGAA
ATCTACAAAC ACCATTCTAA GGGTATAGCC GTTGCATTAT GGGTACTTGG AGGATTTTCA
CAGTTTGTAA TGTTTTTGTA TTGGATTCTC ATGTTTGTCA TTGGTCCAGG AAAGTCTCCT
GTTTTCCAGC CACTCAACTT GTACGGAACA GATGATCCTA ATTTGATACC ATGTCCAGAT
ATATTTCCCT GCGATGAGTA TGGATACCCA GAATATAATT CCAATGCCAG ATCGATAGTG
ACTGCTCGAA CGTTTTATCT GAAAGATATG GGATATATGG TATTGAAGTT TGACCACTTT
TGTCTCTGGA TAGGCACTGT CGTAGGTGAA ACTAACTACT TGTTCTTCAT GAAGTATTGC
CAGTGGTTTC TAACGTATTT CGTTGTAATA TTAATTTTTT TAATCAGGTA TACACCCCTG
AACATTGGTC GTGGAGGTGA GATCAATCAC AACTTCATCC CGCTATACGT GATGTGTGGA
ATGTGGATCT TGATGATTGG AGCTCTCTTT GGAACTCATT TCTACTACAT TGTGATTAAC
AAAGTGACCC TAGACCAGGT CTCCGAGAAC CAAAAAAGAT CATTTGAAAG ATGGCAGAAT
AGCCACAAGG ACGATAAGAA GAAAAGACCC AAACCTAGAG AGGAGACTGG ATGGAGATAT
ATTAATGTCA AGAAAGACAA CTTGAGGTTA GTTGTACAGT ATGGAATTAA TGATAGAGTG
TATGACATGG GTGCAAAAAA GAACTTCATC AATTTGGTAT ACAATGGAAA TCGTAACCAC
GGTCTAGAGG AGCTGTTTTA TACCACTCAA AAGTATTTTG TTGCTCTTGC AATCTTATTC
ATTCCGTTTG TTGATATCTA CAATGGTTTC AAATATGCAA AAAGACCTCC TATAGACCCA
GAAATCGGTT CACTTCAAGA ACGAAAACGC ATGGAGTTTG AAAGCTATTC TTGTAAACTT
AATGACGATT TTCTCCAGCA CGTATATGGA AAAATCGAAA GAAAGGAATG TTACATAGCC
CAGTATATAA GACTTCCACA GGAACATCAG AAACCACCGG AAGACGAAAG AGAGCCACAA
AAGGAAGCAT CAACGGAGAT TAAATCAGAG ACCAAATCTG AAACAGGCTC CGAAATGGTG
TCTCCTTTGA AGAATAGTAC AGAGTCAGGT TCTCCTAGAT CTTCACAGCA ATAG
 
Protein sequence
MTGHQSPIWS RVVKIGVPIG VIFGLAYIDF ASFYSLGYQE IYKHHSKGIA VALWVLGGFS 
QFVMFLYWIL MFVIGPGKSP VFQPLNLYGT DDPNLIPCPD IFPCDEYGYP EYNSNARSIV
TARTFYSKDM GYMVLKFDHF CLWIGTVVGE TNYLFFMKYC QWFLTYFVVI LIFLIRYTPS
NIGRGGEINH NFIPLYVMCG MWILMIGALF GTHFYYIVIN KVTLDQVSEN QKRSFERWQN
SHKDDKKKRP KPREETGWRY INVKKDNLRL VVQYGINDRV YDMGAKKNFI NLVYNGNRNH
GLEESFYTTQ KYFVALAILF IPFVDIYNGF KYAKRPPIDP EIGSLQERKR MEFESYSCKL
NDDFLQHVYG KIERKECYIA QYIRLPQEHQ KPPEDEREPQ KEASTEIKSE TKSETGSEMV
SPLKNSTESG SPRSSQQ