Gene PICST_35359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_35359 
Symbol 
ID4837848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp27509 
End bp28783 
Gene Length1275 bp 
Protein Length424 aa 
Translation table12 
GC content42% 
IMG OID640389163 
Productpredicted protein 
Protein accessionXP_001383278 
Protein GI150864455 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3616] Predicted amino acid aldolase or racemase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTACC CTGCTCAATT CATAGCAAAA CCAAGCAAGG AAGCTTTACT CGATGCTTTC 
AAGGGGAAGC TGATATCCAG CTTGCCTACC CCTTCTTTTC TTATCAATGA AGATATATTC
ACCAAAAATT GCAATAGAAT GCTTCAGAAT ACTTCGCACT TGAGTGCCGA TTTCCGAGCT
CACGTCAAAA CCCACAAGAC AGTAGAAGGC ACTCGTTTGC AATTGGGAGA AAAGTCACCG
ATCAAAACCG ACAAAATTGT GGTCTCCACG TTAGTGGAAG CTTGGAGTCT AATGCCACTT
GTGGAAGAGG GCTTGATTAG CGATATTTTG TTTAGTTTGC CTGTTGTGAA GTCCAGGCTC
CCTGAATTGG CCGAATTGGC AAACAAAGTT CCCCACTTAC GGTTGATGCT CGATGGATCA
GATCAGTTGG AATTGTTGGC AGACTTCTCC AGAGAATTTT CCATAAAAGC AAAATGGTCT
ATCTTCGTTA AGATCAATAT GGGAACAAAC AGAGCTGGCT TAGTCAACGA ATCCACCTCT
TTAGAGAATA CTTTACAAAA ACTCTTAAAG GATGATAAAA TTAGCGAGTT TGTGGACTTA
TATGGGTTCT ACTGTCATGC TGGTCATTCG TATAGTGCGG ATTCTCCTCT GTCAGCTAAA
GATTTCTTAA TTCAAGAGAT TATCCACGCT AATCAGGCTG CAAAAGGAGC ACTCCAAATA
CAGCCAGGCT TGAAACTCCA AATTTCTGTC GGTGCAACGC CTACAGCTCA TTCTTCGGAA
CACTTGAATA CAGATGAATT GATAGCAGCT ATCGGAGATG AACTTTCAGG AAAATTGGAA
TTACATGCTG GTTGCTATCC ATGTTGTGAC TTGCAACAAG TTTCTACTGG TTGTGTTACG
CTTGAAGAGG TGTCCATTTC TTTATTGGCC GAGGTTATCT CAATTTACCC GAACAGAGGT
TCCAAGGCTC CAGGGGAACA ACTTGTCAAT GCTGGAGTTT TGGCCTTATG TCGAGAATTT
GGACCTTTAC CAGGCCATGG TAGAGTGGTT GATCCTCCAG GACTTGAAAA TTGGATTGTT
GGTAGATTGA GTCAAGAACA TGGGATCTTA GTTCCACTTG ATGAAAACCA AGTTAATGAC
TTTATTCCTT TGGGAACCAA AGTAAGAATT GTCCCACAAC ATTCTTGCAT CACAGCAGCA
GCTCATCCTT GGTACTATAT AGTAGACTCC AGTAATAGTG TAGTTGACAT TTGGATACCA
GCTAGAGGAT GGTAG
 
Protein sequence
MSYPAQFIAK PSKEALLDAF KGKSISSLPT PSFLINEDIF TKNCNRMLQN TSHLSADFRA 
HVKTHKTVEG TRLQLGEKSP IKTDKIVVST LVEAWSLMPL VEEGLISDIL FSLPVVKSRL
PELAELANKV PHLRLMLDGS DQLELLADFS REFSIKAKWS IFVKINMGTN RAGLVNESTS
LENTLQKLLK DDKISEFVDL YGFYCHAGHS YSADSPSSAK DFLIQEIIHA NQAAKGALQI
QPGLKLQISV GATPTAHSSE HLNTDELIAA IGDELSGKLE LHAGCYPCCD LQQVSTGCVT
LEEVSISLLA EVISIYPNRG SKAPGEQLVN AGVLALCREF GPLPGHGRVV DPPGLENWIV
GRLSQEHGIL VPLDENQVND FIPLGTKVRI VPQHSCITAA AHPWYYIVDS SNSVVDIWIP
ARGW