Gene PICST_58854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_58854 
Symbol 
ID4838381 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp425674 
End bp427218 
Gene Length1545 bp 
Protein Length502 aa 
Translation table12 
GC content43% 
IMG OID640389696 
Productpredicted protein 
Protein accessionXP_001384378 
Protein GI150865242 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3579] Aminopeptidase C 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTCCA ACACATCGAA GGAAGTCAAG ACTACTACCG AGGAAACCTT TAACGAAAAG 
GTTTCGTACT CGTCTTCTCT GTCGCAGTAC GATTCGTATT CACACTTATT GAGTAGAATG
GATGTCTCAA GAGAAGACGT CGAAGCCAAA GAAGATGAAG ACAATGGTGA TGGCAACGGT
ATTTGGCCAG GTATTTCCGT CGGATTGCTT GATGGTTGGA AAGGTGACGT CTTGAGAGAC
GATAAGAACA AATTGGTTCA GAACTCTCTT GCTATCAATC CTATCCAGCT GATTATCGCC
AAGTCAGACG TCGAAACTGT CTTAAAGGAC CAGTACTTCT TCAACGTCAC AGTCAAGACA
ATTGGATCAC CTTCCTATTT CAACAACCAA AAACTGTCCG GTAGATGTTG GATCTTTGCT
GCTTCCAATG TGTTCAGAAC TCTGGTTATC AAGAATTATA ACTTGAAGGA TGATCTGTTC
CAGTTGTCGC AAGCTTATCT TTTCTTCTAT GACAAATTGG AAAAATCTCA TTTCTTCTTG
GATAACATCG CCGACACTGC TGATCACGAC TTGGACTCAA GATTAGTTCA GTATCTTCTT
TCCAGTCCTG TTGGTGACGG TGGTCAATGG GATATGATTG TCAACTTGGT AGAGAAATAC
GGTCTTGTTC CACACCAAGT GTTCCCAGAT AATGCCCAAG CCTCAAACTC TTCTCCTTTG
AACTATTTGG TCACCGAGAA GTTGAGGGAA GCTGCTTTGA TCATCAGAAG ATTGTACCAA
GAAAAGGCAC CTCAGCCTGT CATTGAAATT CTTAAGGGTG CAACCGTCTA TACTGTGTTC
AAGATTCTTT CTTTGGCTTT GGGTTCTCCA CCAAATGCTG ATGAACCTTT CACTTGGGAA
TACATCGACA AGGATGGCAA GTACAAGTCC TATCAAACGA ATCCTAGAGA CTTCTACAGG
GACCATGTCA GACTTGATGC CGCTAAACAC TTCTCGTTGA TCCACGACCC TAGAAACGAC
TATGATAAGT TGTATACTGT GGACAGATTA AACAACATTT TAGGCGGTAA GAAGATCGAA
TACGTCAATA CTGAAATTGA CGAGATCAAG CTGGTTGCTA TCAAGATGTT GAAGGACGAT
GAGCCCATCT TCTTTGGTTC TGATGTAGGC AAGTTTGGTG ACAGGTCTTC TGGTGTTTTG
GACGTTACAG CATACGACTA CAAGTTGGCT TTCAATATCT CCTTGGGTTT GGACAAGGCT
GAAAGATTGA GAACCGGCTC ATCTCAAATG ACCCATGCTA TGGTGATTAC TGGTGTTCAC
CTTGATCCTG TAACTCAGCT TCCTGTCAGA TGGAAGATCG AGAATTCGTG GGGTGATGCC
GTCGGCGACA AGGGTTACTT TGTTATGTCG GACGAATGGT TCAGTGAATA CGTGTTCCAG
ATTGTCACCA ACAAGAAGTA TGCCTCTAAG AAGACATATG ATACTTGGAA GGGTAAAGAC
TTCACTGTCT TGCCTTATTA TGATCCTATG GGCTCATTAG CTTAA
 
Protein sequence
MGSNTSKEVK TTTEETFNEK VSYSSSLMDV SREDVEAKED EDNGDGNGIW PGISVGLLDG 
WKGDVLRDDK NKLVQNSLAI NPIQSIIAKS DVETVLKDQY FFNVTVKTIG SPSYFNNQKS
SGRCWIFAAS NVFRTSVIKN YNLKDDSFQL SQAYLFFYDK LEKSHFFLDN IADTADHDLD
SRLVQYLLSS PVGDGGQWDM IVNLVEKYGL VPHQVFPDNA QASNSSPLNY LVTEKLREAA
LIIRRLYQEK APQPVIEILK GATVYTVFKI LSLALGSPPN ADEPFTWEYI DKDGKYKSYQ
TNPRDFYRDH VRLDAAKHFS LIHDPRNDYD KLYTVDRLNN ILGGKKIEYV NTEIDEIKSV
AIKMLKDDEP IFFGSDVGKF GDRSSGVLDV TAYDYKLAFN ISLGLDKAER LRTGSSQMTH
AMVITGVHLD PVTQLPVRWK IENSWGDAVG DKGYFVMSDE WFSEYVFQIV TNKKYASKKT
YDTWKGKDFT VLPYYDPMGS LA