Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_73446 |
Symbol | APR1 |
ID | 4840317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 1630516 |
End bp | 1632226 |
Gene Length | 1711 bp |
Protein Length | 417 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640391632 |
Product | aspartic proteinase precursor |
Protein accession | XP_001385673 |
Protein GI | 150866171 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AAGACTTCTG AATTGTGTTG AAGCATCCTT TGTTATCATT GTTGGTTTAC ACAGATTAGA GCAATATCAA TAGTGCCATA GCCTCAGTCT CGTTTTTGCC CCCACACTAT TATCCACTAG ACTTAATCTG CTAATCTAGA AGAGTATTCT GCCATTGCAT CCAGCCATTG CAGTCATTCC GGACACTCTT ATAGTGCTTA TAATCATCTA ATTCTGCCAT ACTCCTGGTT TATCATATTG TTAGTCTTTT AGATTCTTCT CAAACTCCAC TACTTTCAAA AGTCTCATTT CTAAAATGCA CTTCTCGTTA TCGTTACTCA CCACCATCGC TACTGCTTTG CTTGCTCTTC CTGTTGATGC CGGAAAGCAC TCTGCCAAGT TGAAGAAGGT TCCTACTGAA GAAACCCTTG ACGCCAACAC CTTCAGAGAA TACACCGACG CCCTCTCCAA CAAGTACATG AACATGTTCA ACGCTGCTGC TGGTAACCCG GTCGTTCCTA ACGTTATGGG TATGGCCAAC CAGGCTCAAG TACCATTTGT TAACCCAGAA GGTAAAAAGG GTGCCCATGA AGCTCCATTG ACAAACTACT TGAACGCTCA GTACTTTACT GAAATCCTGT TGGGTACTCC AGCTCAGCAG TTCAAAGTCA TTTTGGATAC TGGGTCTTCC AACTTGTGGG TTCCATCACA GGAATGCTCG TCTTTGGCAT GTTTCTTGCA TACCAAATAC GACCACGACT CGTCGTCCAC TTACAAGGCT AATGGCTCCG AATTCTCTAT CCAATACGGT AGTGGAGCTA TGGAAGGTTA CGTCTCCCAA GATACTTTGG CTATTGGTGA CTTAGTGATT CCAAAGCAAG ACTTTGCTGA AGCCACTTCT GAACCAGGTT TGGCTTTTGC TTTCGGTAAG TTCGATGGTA TCTTAGGTTT GGCTTATAAT ACTATCTCCG TCAACAAGAT TGTGCCTCCA GTCTACAACG CTCTTGCACA GGGTTTGTTG GATGAGCCAC AATTCGCCTT CTACTTGGGT GACACCAAAA AGGACGAAAA TGACGGTGGT TTGGCCACCT TTGGAGGTTA CGACGAATCC GCTTTCACTG GTAAGATCAC ATGGTTACCT GTCAGAAGAA AGGCTTACTG GGAAGTTTCC TTTGAAGGTA TCGGCTTAGG TGACGAATAT GCCGAGTTGG ACAACACCGG TGCTGCTATC GATACCGGTA CTTCGTTGAT CACCTTGCCA TCTTCTTTGG CCGAAATCAT TAACGCTAAG ATCGGCGCTA CCAAGTCTTG GTCCGGACAG TACCAGATTG ATTGTGAGAA GCAGGACACT TTGCCTGACT TGACATTGAA CTTTGCTGGG TACAACTTCA CCTTGACCGC CCACGACTAC ATCTTGGAAG TTGGTGGTTC ATGTATCTCT GTATTCACTC CAATGGACTT CCCTAAGCCA ATTGGTGACT TGGCCATCAT TGGTGATGCT TTCTTGAGAA GATACTATTC CATCTACGAC TTGAAAAAGG ACGCTGTTGG ATTGGCTACC TCGAAGTAAG TTTTTAACTA CGTAACGGCC ATTGGGCTAC ATAATTCTTA AAATTGCTTC TTTCTGATCT ATTGTTTGTT TTGCTTATCT TGTTGTTTTG TTCTGGTTTG CGATTCCACT CCAGTATTCT CTTATGTATG CTGAAACAAA ATAATTTACA ATTATATGCA T
|
Protein sequence | MHFSLSLLTT IATALLALPV DAGKHSAKLK KVPTEETLDA NTFREYTDAL SNKYMNMFNA AAGNPVVPNV MGMANQAQVP FVNPEGKKGA HEAPLTNYLN AQYFTEISLG TPAQQFKVIL DTGSSNLWVP SQECSSLACF LHTKYDHDSS STYKANGSEF SIQYGSGAME GYVSQDTLAI GDLVIPKQDF AEATSEPGLA FAFGKFDGIL GLAYNTISVN KIVPPVYNAL AQGLLDEPQF AFYLGDTKKD ENDGGLATFG GYDESAFTGK ITWLPVRRKA YWEVSFEGIG LGDEYAELDN TGAAIDTGTS LITLPSSLAE IINAKIGATK SWSGQYQIDC EKQDTLPDLT LNFAGYNFTL TAHDYILEVG GSCISVFTPM DFPKPIGDLA IIGDAFLRRY YSIYDLKKDA VGLATSK
|
| |