Gene PICST_50672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_50672 
Symbol 
ID4841050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp185792 
End bp187081 
Gene Length1290 bp 
Protein Length429 aa 
Translation table12 
GC content43% 
IMG OID640392365 
Producthypothetical protein unknown function 
Protein accessionXP_001386433 
Protein GI150866739 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.482129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGTAA AAGACTTCCC ACAGATCAAA AGTATCAAAA CTTTCATTTT CAACCAACCC 
GGCGTCGGCG GAGACTACCA TAATGTTGAA AGGGGCCATT GGTTGATCGA CCATCCCATT
GCCAACCCCA TGTCCAAGTT TGAAGAGTAC CGTGCCTCTC GTGTCAGTTG GGGAATCAAT
GTTTTGGGTT CTTTCTGTGT TGAAATCGAA GCAACGAATG GCGTTAAGGG ATTTGCTACT
GGTTTTGGAG GCCCACCTGC TTGCTGGTTA GTAGCCAATC ATTTCAGACG TTTCTTGATT
GGAGCTGATC CAAGAGATAC TACTTTGTTA TGGGATAAGA TGTTTAGAGC TTCCATGTTT
TACGGTAGAA AAGGTTTGAC TGTGGCTGTC ATCAGTGTCA TAGATTTGGC TATCTGGGAC
TTGTTGGGAA AGTTGAGAAA CGAGCCCGTC TACAAGATGA TTGGAGGTGC TACTAGAGAA
AGATTGGACT TTTACTGTAC TGGCTGTAGA CCAGACATAG CTAAGGAAGT TGGTTTCTGG
GGAGGTAAAG TTGCTTTACC TTATGGTCCA GCAGAGGGTC ACGATGGTCT TAGAAGAAAT
GTCGAGTTTT TGAGAAAGCA TCGTAAGTCC GTAGGACCAG ACTTCCCCAT TATGGTAGAT
TGCTACATGT CTCTTAATGT ATCGTATGTT ATCGATTTGG TAAATGCTTG CAAAGACTTG
AACATCAACT GGTTTGAAGA GGTCTTGCAT CCAGATGACT TTGACGGTTT CCAGAAGTTG
AAGAGTGCCT GCCCGTGGAT GAAATTCACA ACTGGTGAAC ATGAGTACTC CAAGTATGGA
TTCAGAAAGT TGATCGAAGG TAGGAATGTA GACATCTTGC AACCTGATAT CATGTGGGTC
GGTGGTCTTA CTGAAATCCT CAAGATCTCT CATCAAGCTG CTGCCTACGA TATTCCAGTA
GTTCCACATG CTTCTGGTCC ATATTCGTAC CATTTTGTAA TCTCTCAAGA AAATACTCCA
TTCCACGAAT ACTTGTCGAA CTCTCCGGAC TCGATGTCTG TGTTGCCAGT ATTTGGGGAA
CTTTTCACCG ATGAACCAGT TCCTACAGAA GGTTATTTGC TGATTACGGA ATTTGACAAA
CCTGGGTTTG GCTTGACTTT GAACCCAAAG ATCGAGTTGA TCAATGGCGA CTGCTTATTA
TCGCCTAATC CAGAAAGACC ATTAAGTATT CAAAATGGAA ATGGACATGC AAAGACCAAT
GGCAATGGAA CCATCAAGAA TGGCCATTAG
 
Protein sequence
MSVKDFPQIK SIKTFIFNQP GVGGDYHNVE RGHWLIDHPI ANPMSKFEEY RASRVSWGIN 
VLGSFCVEIE ATNGVKGFAT GFGGPPACWL VANHFRRFLI GADPRDTTLL WDKMFRASMF
YGRKGLTVAV ISVIDLAIWD LLGKLRNEPV YKMIGGATRE RLDFYCTGCR PDIAKEVGFW
GGKVALPYGP AEGHDGLRRN VEFLRKHRKS VGPDFPIMVD CYMSLNVSYV IDLVNACKDL
NINWFEEVLH PDDFDGFQKL KSACPWMKFT TGEHEYSKYG FRKLIEGRNV DILQPDIMWV
GGLTEILKIS HQAAAYDIPV VPHASGPYSY HFVISQENTP FHEYLSNSPD SMSVLPVFGE
LFTDEPVPTE GYLSITEFDK PGFGLTLNPK IELINGDCLL SPNPERPLSI QNGNGHAKTN
GNGTIKNGH