Gene PICST_39213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39213 
SymbolSAP7 
ID4851218 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1224409 
End bp1226169 
Gene Length1761 bp 
Protein Length335 aa 
Translation table 
GC content40% 
IMG OID640392926 
Productsecreted aspartyl proteinase 
Protein accessionXP_001387880 
Protein GI126274204 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.464374 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00607516 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TATGCCGTCA ACATTACTGT TGGTACACCA CCAGTACAGC TTCAAGTTGA ATTGGATACT 
GGCTCTTCTG ATCTTTGGGT TATTAGCACA AAAGCGCCTG TCTGTAAAAG ATACAATTGC
AACATTTACG GAGTCTTCGA CAAAGATAAG TCATCGAGTT GGAAAAGTAA TGGCACAGAG
TTTGGAATTA GGTATGTTGA TGGCAGTGGT AACGATGGAG GAATTTATGG ACAAGATACT
ATTAATTTCG GCAACGGGTT GGTGTTGAAC AACGCAACTT TTGCTGTGGC CAATGAAACG
GTTGATCTTG TTGGTATAAT GGGGATTTCC TTTGAATTCA GAGAAGCTGC CACACAGAAA
TATCCAAATA TACCTTCTCT AATGAAACTA CAGGGATACA CCAAGAAAAC AGCTTATTCT
TTATATGTCA CTGATGAAGT CAAACAAGCT GGTTCTATTT TGTTTGGCGG AATTGACCAC
GCAAAATACG AGGGTGAATT GGTTTCCTTG GATATGGTTG CTCTGAATGG AACTTATCGA
GCTTTACAAA CTGAATTAAC TTCCATCAAA ATCTGTATCA ATGGTTCTTG TGCTGATGCC
TCCGAATCAG GTTCAAGTTC TTCAGTGAAT TCATCCGGTA TGCCTTCGTC CATACAATCA
AGCACAAAAA AATCTACGAA AGGTTCGTCC ATATCCAACC CTACGACTAG CCATTCTTTG
ACTAGTCTTG GAGTCTCTAG TTCAACTCCT TTCACTACTT CAATTACTCC CGTATCGTCT
GTTTCAACAG GTTCGGTAGG TTCAACTGCC TCTATAAGCT TAAGCAAGTC TCCTCCCATG
TTGTCTGAAT CTAGTATGTC TATCCGATCT TCATCTTCAT CTTCTTCAAT TGAAGCAGCA
TCCTCTTCAA AGCATACGGT AGATCTGGGA ACTTCTTTCA GTAATGAGAC ACCTTCGACA
TCTAGTTTCT CAAGTAGTCT TGCGCAGCAA ACTGCCAACC CAAGTGCTAC TTATGTCCTT
TCTACTCTTT CGACTGCTTA TGTTTCCTAC TTTACTGAAA TTACCGAAAC TTCTTATTAT
ACCACCACAT CAAGCAGAAT CACTGAATAC TATACTCAGA CTATAAAGAA TTCGATCAGT
GCAAGTAACA GTACCAGTGC TCCAACATGG ATTGAGTACT ACAATCAAAG TTTACAAATG
TTCGAATCTC TTAAAGCGTT GTTGCAAAAA ATGACTGATT TGTACAATCA GCAGAGTCAA
GCTCAAAGTA GCTCCAGTAA CAATAAGAAA AGGCAAGTAA TTGAGCAAAT CCAGAAGTCC
AAGAGAGATG AAAATGATGT TGCTATTGGT ATTCCATTTA CATTCGACTC TGGAGCAGAT
ATAACTATTA TCAGAGAAGA TTTCCTTGAG CAGATATATA AGGTGTTGGC TCCTGGAGAA
GATTTTCAAG TCTATGGTAC ATTAGGAACA TACAAGGTAC CATGTTATTT GAATGACAGT
GGCAATTTCT TGAGTTTCAG CTTCAGCAAT AAAAAGGAAA TCCAAGTGCC AATGAGTGAG
TTTCTTTTGT CGCAAGAAAC CGCTAACGGA ATTCAGTGTG GGTTGAAGAT ACTGACTCCT
TCTCCAGGAT TGGAAGATCA TGGAATCTTT GGAGTTAACT TCTTGAGATC GGTCTACACA
GTTTTCAACT TGGATGATAA GACTATCTCT CTCGCCAAGG TAAAGTATTC TGAAGATGAA
CGGATTACTG CCATAGAGTA A
 
Protein sequence
YAVNITVGTP PVQLQVELDT GSSDLWVIST KAPVCKRYNC NIYGVFDKDK SSSWKSNGTE 
FGIRYVDGSG NDGGIYGQDT INFGNGLVLN NATFAVANET VDLVGIMGIS FEFREAATQK
YPNIPSLMKL QGYTKKTAYS LYVTDEVKQA GSILFGGIDH AKYEGELVSL DMVALNGTYR
ALQTELTSIK IYENDVAIGI PFTFDSGADI TIIREDFLEQ IYKVLAPGED FQVYGTLGTY
KVPCYLNDSG NFLSFSFSNK KEIQVPMSEF LLSQETANGI QCGLKILTPS PGLEDHGIFG
VNFLRSVYTV FNLDDKTISL AKVKYSEDER ITAIE