Gene PICST_33021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33021 
Symbol 
ID4839795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1243227 
End bp1244255 
Gene Length1029 bp 
Protein Length342 aa 
Translation table12 
GC content48% 
IMG OID640391110 
Productpredicted protein 
Protein accessionXP_001385933 
Protein GI150866361 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2152] Predicted glycosylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCTCC CAGTAGTCTC TAGAGCCTCA ACTCTCGGCG ACAGGAACTA CAAGGCCCCC 
ACTAAGTTCC CCATCGGGCC GTTTCAGAAG TACGAGAACA ACCCCATTCT CACGCCAAAC
CCTGACAACG AGTTTGAAAG CGCATATATA TATAATGCAA CTGCCATCGT TGTAGACGAC
AAAGTGTACT TGCTTTATAG GGCCCAGAAC GCAGCCAAGT TGTCACTGGT GGGTCTAGCC
TGGTCAACAG ACGGGGTCAA CTTTGTCAGA TACCATAAAC CTATCATCAC AGCCACAGAG
CCCTGGGAAC AGGGTGGAGG AGTTGAAGAC CCAAGAATCG TTAGAGACCC CGTGTCCAAG
CTCTTTATTG TCACGTATAC CGCCTACGAT AAACATTTTG CTCGTCTCTG TGTAGCTACC
TCGGAAGACT TGTTCAACTG GAACAAACTT CCCTCGTTCA TTCCACCAAC TTGGCATGAT
GTCTCATACG ACGGAAATGG AAACCCAAGT ATTCGTCGTC AATGGCTGAA GTCGGGTGCC
ATCTTCACCG AACGGGCTCC AGATGGTAAG TACTACATGA TCTGGGGGGA CAGCGCCTTG
TATTTGGCTG AGTCTGATGA TTTGGTTCAT TGGAAACTAC CTACTCAAGA CTTCAGACAA
GATACCTTTG CTGGAGTCCA GTACGATTTC GAAAGCAAAT TGATTGAGCT GGGTCCCGCA
CCGGTCAAGA TGGGAAATGG TACAAATCAG TGGATCTTCG TCTACAATGC TGATACGACA
GGAACAGACG ACTTGCCTGC TAATACTTAT ACCATCAGTC AGATGCTTGT CGACTACGAC
AACATTAAGG CTGGACCTGT AAAAAGGTTG TCTGAGCCCA TCCTCAAGCC TGAAAAAGAT
AACGAAAAGA ATGGCCAGGT TAACAAGGTT GTATTCTGCG AAGGCATGGT CCAGTTCAAG
GGCAAGTGGT TCTTATACTT TGGCCAGGCA GATTCCGAAT TGGGAGTGGC TATTGCTCCG
GTAGACTAA
 
Protein sequence
MMLPVVSRAS TLGDRNYKAP TKFPIGPFQK YENNPILTPN PDNEFESAYI YNATAIVVDD 
KVYLLYRAQN AAKLSSVGLA WSTDGVNFVR YHKPIITATE PWEQGGGVED PRIVRDPVSK
LFIVTYTAYD KHFARLCVAT SEDLFNWNKL PSFIPPTWHD VSYDGNGNPS IRRQWSKSGA
IFTERAPDGK YYMIWGDSAL YLAESDDLVH WKLPTQDFRQ DTFAGVQYDF ESKLIESGPA
PVKMGNGTNQ WIFVYNADTT GTDDLPANTY TISQMLVDYD NIKAGPVKRL SEPILKPEKD
NEKNGQVNKV VFCEGMVQFK GKWFLYFGQA DSELGVAIAP VD