Gene PICST_63574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_63574 
Symbol 
ID4840774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp239286 
End bp240560 
Gene Length1275 bp 
Protein Length424 aa 
Translation table12 
GC content42% 
IMG OID640392089 
Productpredicted protein 
Protein accessionXP_001386064 
Protein GI126139083 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3616] Predicted amino acid aldolase or racemase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCCCT TTCAATTTGT TGCCCTTCCA GACAAGGAAG CTCTACTAAG GGCATACAAA 
AATAAAAAGC TTAATGAATT GCCCACTCCA TCTGTTGTTA TCGACCGAGC TGTCTTCCAA
GAGAATTGCG AAAAGATGCT TGCCAACGCC GAAAGATTAA AAGTAGATTT TAGACCGCAT
ATTAAGACCC ACAAAACACT TGAAGGAGCG AGACTACAAT TGGGTTCAGG TAGAAGGAAG
TCAGATAAGA TTATTGTTTC AACTATGATG GAGGCTTGGA ACTTGCTTCC CCTTGTCAAT
GAAGGCTTAT CCGTAACAGA TTTTCTCTAT AGCCTACCAG TAGTAAAACC CAGGGTGGCC
GAATTGGCCG AATTTGCTAC TAAGATACCA CACTTGAGGT TATTAATTGA TCATAGGGAA
CAATTAGATA TCTTATCTGA GTGGAGTGAA GCTCATCCTC ATTCTAAAAG ATGGTCGGTA
TTCATCAAGA TTGACATGGG TACCCATAGA GCAGGATTAA CGAATGAAAG CCATAATCTT
GGTGAAACAC TTCAGCATAT CCTTACGGAT GCCACATCAA GGAAAAATAT TGAGTTGTAT
GGATTCTACT GCCATGCTGG TCATTCTTAT TCTTCAACAA CAGAGGATTC AGCAAAGGAA
CTATTGCTTG AAGAGATTGT CCAAGCAAAC CATGCTGCAA TCGCTGCCAA AAGTATTGAC
CCAAGTTTAC ATTTGAGGCT CTCAGTTGGT GCTACACCAA CTTCTCATGC TTCGGAAATA
CTTACAATCG AAGAATTGGA ATCAGCGTTG GGTCCCAATA GTTTGCAGGG TACATTAGAA
TTACATGCGG GAAACTATTG TTGCTGCGAT TTACAACAGC TTGCAACAGG CTGCATAAGA
GAAGAAAACA TTTCACTTTC GGTTATTGCC CATGTTATAT CTACGTATCC AAAGAGAGGT
GAGAAGACTC CGGGTGAACA GTTAATAAAT GCTGGAGTGG TAGCTTTATC TCGTGAGTCG
GGGCCAATCA TTGGATATGG AAAGGTGATT GAACCTGCCG AGTACAACAA TTGGATAGTC
GGAAGATTAA GCCAAGAGCA CGGTATCCTC GTACCCTTTG ATGAACATCA TGCTACAAAG
TTCATTCCAA TTGGAACTCA AATCAGGATT GTTCCACAGC ACTCTTGTAT TACAGCAGCT
TCTAATCCTT GGTTCTTCAT AGTCGATTCG GGCGACGTAG TTGTGGATGT TTGGGTTCCA
TTTAGAGGAT GGTAA
 
Protein sequence
MYPFQFVALP DKEALLRAYK NKKLNELPTP SVVIDRAVFQ ENCEKMLANA ERLKVDFRPH 
IKTHKTLEGA RLQLGSGRRK SDKIIVSTMM EAWNLLPLVN EGLSVTDFLY SLPVVKPRVA
ELAEFATKIP HLRLLIDHRE QLDILSEWSE AHPHSKRWSV FIKIDMGTHR AGLTNESHNL
GETLQHILTD ATSRKNIELY GFYCHAGHSY SSTTEDSAKE LLLEEIVQAN HAAIAAKSID
PSLHLRLSVG ATPTSHASEI LTIEELESAL GPNSLQGTLE LHAGNYCCCD LQQLATGCIR
EENISLSVIA HVISTYPKRG EKTPGEQLIN AGVVALSRES GPIIGYGKVI EPAEYNNWIV
GRLSQEHGIL VPFDEHHATK FIPIGTQIRI VPQHSCITAA SNPWFFIVDS GDVVVDVWVP
FRGW