Gene PICST_73446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_73446 
SymbolAPR1 
ID4840317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1630516 
End bp1632226 
Gene Length1711 bp 
Protein Length417 aa 
Translation table12 
GC content44% 
IMG OID640391632 
Productaspartic proteinase precursor 
Protein accessionXP_001385673 
Protein GI150866171 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAGACTTCTG AATTGTGTTG AAGCATCCTT TGTTATCATT GTTGGTTTAC ACAGATTAGA 
GCAATATCAA TAGTGCCATA GCCTCAGTCT CGTTTTTGCC CCCACACTAT TATCCACTAG
ACTTAATCTG CTAATCTAGA AGAGTATTCT GCCATTGCAT CCAGCCATTG CAGTCATTCC
GGACACTCTT ATAGTGCTTA TAATCATCTA ATTCTGCCAT ACTCCTGGTT TATCATATTG
TTAGTCTTTT AGATTCTTCT CAAACTCCAC TACTTTCAAA AGTCTCATTT CTAAAATGCA
CTTCTCGTTA TCGTTACTCA CCACCATCGC TACTGCTTTG CTTGCTCTTC CTGTTGATGC
CGGAAAGCAC TCTGCCAAGT TGAAGAAGGT TCCTACTGAA GAAACCCTTG ACGCCAACAC
CTTCAGAGAA TACACCGACG CCCTCTCCAA CAAGTACATG AACATGTTCA ACGCTGCTGC
TGGTAACCCG GTCGTTCCTA ACGTTATGGG TATGGCCAAC CAGGCTCAAG TACCATTTGT
TAACCCAGAA GGTAAAAAGG GTGCCCATGA AGCTCCATTG ACAAACTACT TGAACGCTCA
GTACTTTACT GAAATCCTGT TGGGTACTCC AGCTCAGCAG TTCAAAGTCA TTTTGGATAC
TGGGTCTTCC AACTTGTGGG TTCCATCACA GGAATGCTCG TCTTTGGCAT GTTTCTTGCA
TACCAAATAC GACCACGACT CGTCGTCCAC TTACAAGGCT AATGGCTCCG AATTCTCTAT
CCAATACGGT AGTGGAGCTA TGGAAGGTTA CGTCTCCCAA GATACTTTGG CTATTGGTGA
CTTAGTGATT CCAAAGCAAG ACTTTGCTGA AGCCACTTCT GAACCAGGTT TGGCTTTTGC
TTTCGGTAAG TTCGATGGTA TCTTAGGTTT GGCTTATAAT ACTATCTCCG TCAACAAGAT
TGTGCCTCCA GTCTACAACG CTCTTGCACA GGGTTTGTTG GATGAGCCAC AATTCGCCTT
CTACTTGGGT GACACCAAAA AGGACGAAAA TGACGGTGGT TTGGCCACCT TTGGAGGTTA
CGACGAATCC GCTTTCACTG GTAAGATCAC ATGGTTACCT GTCAGAAGAA AGGCTTACTG
GGAAGTTTCC TTTGAAGGTA TCGGCTTAGG TGACGAATAT GCCGAGTTGG ACAACACCGG
TGCTGCTATC GATACCGGTA CTTCGTTGAT CACCTTGCCA TCTTCTTTGG CCGAAATCAT
TAACGCTAAG ATCGGCGCTA CCAAGTCTTG GTCCGGACAG TACCAGATTG ATTGTGAGAA
GCAGGACACT TTGCCTGACT TGACATTGAA CTTTGCTGGG TACAACTTCA CCTTGACCGC
CCACGACTAC ATCTTGGAAG TTGGTGGTTC ATGTATCTCT GTATTCACTC CAATGGACTT
CCCTAAGCCA ATTGGTGACT TGGCCATCAT TGGTGATGCT TTCTTGAGAA GATACTATTC
CATCTACGAC TTGAAAAAGG ACGCTGTTGG ATTGGCTACC TCGAAGTAAG TTTTTAACTA
CGTAACGGCC ATTGGGCTAC ATAATTCTTA AAATTGCTTC TTTCTGATCT ATTGTTTGTT
TTGCTTATCT TGTTGTTTTG TTCTGGTTTG CGATTCCACT CCAGTATTCT CTTATGTATG
CTGAAACAAA ATAATTTACA ATTATATGCA T
 
Protein sequence
MHFSLSLLTT IATALLALPV DAGKHSAKLK KVPTEETLDA NTFREYTDAL SNKYMNMFNA 
AAGNPVVPNV MGMANQAQVP FVNPEGKKGA HEAPLTNYLN AQYFTEISLG TPAQQFKVIL
DTGSSNLWVP SQECSSLACF LHTKYDHDSS STYKANGSEF SIQYGSGAME GYVSQDTLAI
GDLVIPKQDF AEATSEPGLA FAFGKFDGIL GLAYNTISVN KIVPPVYNAL AQGLLDEPQF
AFYLGDTKKD ENDGGLATFG GYDESAFTGK ITWLPVRRKA YWEVSFEGIG LGDEYAELDN
TGAAIDTGTS LITLPSSLAE IINAKIGATK SWSGQYQIDC EKQDTLPDLT LNFAGYNFTL
TAHDYILEVG GSCISVFTPM DFPKPIGDLA IIGDAFLRRY YSIYDLKKDA VGLATSK