Gene PICST_33568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33568 
Symbol 
ID4840608 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp980471 
End bp981727 
Gene Length1257 bp 
Protein Length387 aa 
Translation table12 
GC content43% 
IMG OID640391923 
Productpredicted protein 
Protein accessionXP_001386201 
Protein GI150866559 
COG category[R] General function prediction only 
COG ID[COG0724] RNA-binding proteins (RRM domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0922684 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAATA CTGACAAGAG CGCAAAGAGT GAGTCCAAGA AGGAGAAGCG TGAGCGTAAG 
CTCGAGAAAA AAACCAAAAA AAGATCTATT GAGGAAGAAG TTGACAAGGT AGCTGCAGAA
GAGAAGAATG AAGAGTCTCA AGAGCCATCT ACCGAAGCTG AAAACACACA AGCCTCTGCT
CCAAAGCATG CCGATTTCGA AGAGTTAGAA ATCGACTTAA GCGCAGGAGT TCCTCTTTCG
AAGAAGCAGC TGCGTTTGTT AAAGAAAGGG AAGTTGGACT TGGAAAGACT TGCTAAGAAG
CATCCAGTTC CCAAACCAGA GCTTACTGAA GAGGAAAAGC TTGCCCAGGA GGAAGAAGAT
AAGAAGAAGT CCAAGAAGTC AGAGTTTGCA GTATGGATCG GAAACTTATC GTTCGATACT
ACTAAAGAGG ACTTGGTTCG TTTTATTGTC GGAAAGACAG CTCACAATGG AGAAGACGAC
TCTCAGTTAA TCAAGATCGA AGAAGCAGAT ATCACCAGAG TGAACTTACC CAAGAAGGAA
AACAAAATCA AGGGATTTGC ATATATTGAT TTGCCCAGTG CCGTACATGT GACCAGTGTA
GTTGCTTTGA GCGAATCTCC TTTGAACGGA AGAAAGTTGT TGATCAAGAA TGCCAACTCG
TTTGAAGGTA GACCAGCTGC TGCTGTTGCT CCCTTGTCGA AAAATCCACC TTCTCGTATT
CTTTTCGTAG GAAATTTGTC GTTTGACACT AGTGAAGATA ACTTGGAAGA ACACTTCCGT
CACTGTGGTG AAATTGTTCG TATAAGAATG GCTACATTTG AAGATACCGG TAAGTGTAAG
GGCTTCGCAT TCATTGACTT TAAAGACGAA ACCGGTCCTA CCGCTGCGTT GAAGTCGAAG
TTGGCCAAGA AGTTGATCAA CAGACCGCTC AGATTAGAAT ATGGTGAAGA CAGATCTAAA
AGAAACCCTA ATCATATCAG AAAGGCAGAA GTTCAAGAAG GAGAAGTTGA CGATTTTGCT
CCTGTCAGAG AAAGACCTGC TGCAAGAGAA AGACCTGCTA GAGAAGCTCC AAATTTCGAG
AATAGTAACT ACGAGAAACC ACAAAGAGCA TCATCAACTC CTAAGAAGAG AGTATTCAGA
GACGATAATC ATAACCACAG CAATAAGAGA GTCAAGTCGT CGGTAGCTTT GGCCACAGCA
CAGAGAGCCA GTGCCGCCAT TGTTCCATCT TCAGGTAAGA AGATCACATT TGACTAG
 
Protein sequence
MGNTDKSAKS ESKKEKRERK LEKKTKKRSI EEEVDKPSTE AENTQASAPK HADFEELEID 
LSAGVPLSKK QSRLLKKGKL DLERLAKKHP VPKPELTEEE KLAQEEEDKK KSKKSEFAVW
IGNLSFDTTK EDLVRFIVGK TAHNGEDDSQ LIKIEEADIT RVNLPKKENK IKGFAYIDLP
SAVHVTSVVA LSESPLNGRK LLIKNANSFE GRPAAAVAPL SKNPPSRILF VGNLSFDTSE
DNLEEHFRHC GEIVRIRMAT FEDTGKCKGF AFIDFKDETG PTAALKSKLA KKLINRPLRL
EYGEDRSKRN PNHIRKAEVQ EGEVDDFAPN SNYEKPQRAS STPKKRVFRD DNHNHSNKRV
KSSVALATAQ RASAAIVPSS GKKITFD