Gene PICST_33243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33243 
Symbol 
ID4840569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp105337 
End bp106446 
Gene Length1110 bp 
Protein Length369 aa 
Translation table12 
GC content46% 
IMG OID640391884 
Productpredicted protein 
Protein accessionXP_001386035 
Protein GI150866434 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0434678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.79491 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGACA GCCAAAGTTT AACCGTCGGA CTAGCAGTGG GCATCCCCTC GTTCGTTATC 
ATCTCCGTGG TGCTCCTCTT GTGGTTACGT AATCAGCGCA AACAAAAGAA GGAAGACTCT
GTCAACGACG ACATCGACAT GGACCTCAGA GACGACCAGT CCTTCAACCA GTTCCAAGAA
GAACTACATC GGCCTTACGC AAAAGGGAAA GCCGAACTAA ACATAAACAC ATACAGCGAC
CCCAATAAGT CGTCATCAAC AACGCAGGGT GAATCCTCTA CAGAAAAACC CTACATCTCC
TCGCATGGAT CTACATCATC CAGCAACATA ATTGACCAGC CCCACGTCTA TGCGACTCCA
CCACGTCCTA AATACAGTAA TGCTAACAAC GGCACTAATA ACGTAAACAA TAACAACATC
AACAACTTAA ACAACAACTT AAACAACAAC ATCAATAATA CTAATAATAT TCATCTGAAA
TCTCCTTCGG CGTACGATTT CTACGAGACC TTTATTCCTA TATTACCAGG AGGAGGTAGT
CCCTCCACTA ACAACGGCAA CACAGCGCAC CATTCCAATG CGAACAGCGT CCACGAGGCT
CACAGTAACG GTAATTTGCA AAGCCAACTC CAGCAGCCTC CTCATATTCA CGATGTTGCG
TCTACAAACA ACAGCTCTAA CGATAGTTTG AACGGCAACG ATAGATCGTC TCTAGATAAT
TTGGCTAAAC AGTTAACAAG CCCCGTCTTC TTTGAAAAGT TGCCCTCGCG TGCCACCACG
GTAGCGTTGA AACCTCGTTT CCCCAACATG CAATCGAACA ACTCTTCTAG TGAAGGCTTG
AACAATAGAT TGATCGGCGA CACTACTGCT CTCAATGACA ACTTCATCTA CGAAGCACCT
ACAGTGGATG TGAAAAAGAC AGAGTTGCAG GCCAAAGATT TCCAGCGCAG TCATCTTTCA
CGTGAACAAA CAGTGAAGTC TCATGAAGAC GATGTTTCGT CTATACTGGC CAGTCACGCT
GAAGACGTCG ATTCTTTGGT GGAGCCTGAT GTGTCGAGGT CTTCGAATGA CGACGAATTC
GTTTCGGATA TCGACACTAC CGCCAGTTGA
 
Protein sequence
MVDSQSLTVG LAVGIPSFVI ISVVLLLWLR NQRKQKKEDS VNDDIDMDLR DDQSFNQFQE 
ELHRPYAKGK AELNINTYSD PNKSSSTTQG ESSTEKPYIS SHGSTSSSNI IDQPHVYATP
PRPKYSNANN GTNNVNNNNI NNLNNNLNNN INNTNNIHSK SPSAYDFYET FIPILPGGGS
PSTNNGNTAH HSNANSVHEA HSNGNLQSQL QQPPHIHDVA STNNSSNDSL NGNDRSSLDN
LAKQLTSPVF FEKLPSRATT VALKPRFPNM QSNNSSSEGL NNRLIGDTTA LNDNFIYEAP
TVDVKKTELQ AKDFQRSHLS REQTVKSHED DVSSISASHA EDVDSLVEPD VSRSSNDDEF
VSDIDTTAS