Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_77752 |
Symbol | AAP1 |
ID | 4838617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 1193752 |
End bp | 1196735 |
Gene Length | 2984 bp |
Protein Length | 890 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640389932 |
Product | arginine/alanine aminopeptidase |
Protein accession | XP_001384182 |
Protein GI | 150865102 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.204376 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AGCAACGCAT AGTTCTGTTT CTTTGTTTTG AGTCTCATAT CTGTGAAGCC GCTAGCAAAA GCTGAACGTA AACTTGTTGC AATTGAATTT GTCAAATCTT CAATTCATCT ATAGCAAGTC TATTACTAAA AGTAACTGAC CAACAGAATT TTGAAGCCCT ACTGTCAATT GATCTAATTG CCACAGTATT AAAGTCTCGC TGTTCTGATA TTGTATAATC AAAGAGGTAT TTCTTGTATT TTCGTACTTG CTTTGCTTCT TCTCATCTAA TGTGCGCAAC TACGAAGCCA TACTATGAGG CTCTTCCTGC CAGCCTCAAG CCAGTCCATT ACGACTTGTC TATTTCTGCA ATCGACGTAG CTGCTGAAAC CTTCAAAGGC AAAGTATCCA TCAACTTGGA CATTGTAGAA GAAACTGATG AGTTACACTT GAACTACAGA GACTTGACTG TGACTAAAGA AGACATCGAA GTCACCTTGA TCACCAGCGA TGACAAATCT TCCTCTGTCA ACATCGTCTC ATTGACTGAA TTTAAGGAAA AGGAATTTTT CATCATCAAG TTTGCTGAAA AGGTACAACC AGCTGCTGGT GCAAAACTCT TAGTTACTCT TCACTACAAT GCTATCATCC AGACCAACAT GGCAGGTTTC TATAAATCAG GTTATACCGA AGATGGCGTC GAAAAGTTCA TGTTGTCCAC ACAATTCGAA GCTACTGATG CCAGAAGAGC TTTCCCATGT TTGGATGAAC CCTCATTGAA GGCCACCTTC ATTGTGGATG TCACTGTTCC TGGTCAATGG ACTGCTTTGG GTAATACTCC TGTAGCTGAA TCCGAAGACA TTGTAGATAA AAACCTCAAG AAAGTCACAT TTGAAAAGAC ACCCATCATG TCCACATATT TGTTGGCTTG GGCTACGGGT GAATTCGAGT ATATCGAGTC TTTCACGGAA GAAAACTACG TGGACAACAA GCCTTTGCCT GTTCGTATTT ATACGACAAA GGGATATCTT GAAGATGCGA AGTTAGCTTC TGAAATTGCT CCAAAGATCG TAGACTACTT TTCCAAGATT TTCGAAATCA AGTACCCCTT GCCCAAATTG GACTTGATAG CGGTGCATTC CTTCTCGCAC AATGCCATGG AAAACTGGGG GCTTATCACC TATAGATCAA CGGCCTTGTT GTACTCAGAG GAGAAGTCAG ATCCTTCTTA CAAACAGAAG GTAGTCTATG TTGTAGCCCA CGAATTGGCC CATCAGTGGT TTGGTAATTT GGTTACCATG AAGTGGTGGG ATGAACTTTG GCTTAACGAA GGGTTTGCTA CCTGGGTTGG ATTTGCAGCT GTAGAGTACT TGTACCCAGA ATGGAATATT TTCAGCGGGT TTGTCTCTGA GTCGTTGCAG CAGGCTCTTA ACTTGGATGG ATTGCGAAAC TCGCATCCTA TCGAAGTACC TGTTATCGAC GCATTGGACA TCGACCAGTT GTTTGATGTT ATCTCTTACT TGAAGGGTGC CTCTACTATT CTAATGATTT CTAATTACTT GGGAAAGGAA GAATTCCTCA AGGGTGTGGC TCTCTACTTG AACCGCAACA AGTTCGGCAA CGCCAGCTCT CACGACTTGT GGAGTGCCGT AGGCGAAGTG AGTGGTAAAC CCATCGACAG CTTGATGGAA TCCTGGATCA AAAAGGTTGG GTTCCCAGTT GTTAGCGTAG ACGAAGACAA AAACAACTTA GTGCTTAACC AATCGCGTTT CTTGAACAGT GGAGACATAA CCGATGCTGA GAATGACACC AAGTGGTGGA TTCCACTTAA CATCACGACG GATTCCACTT CTGTAAGAGA CATTTCTGTT GACTCTTTTG ACTCGGAAAA GTTGATCATT GAGAATTTTG CCTTGAAAAA TGACTTTTTC AAGTTGAACA AGGACACGAG CGGAGTCTAC AGAGTTAACT ACTCCAGCTC TATCTTGGAG AAGAACATTC TTCCACACTT CAACAGAATG TCCCCAAGAG ACAGAGTTGG ACTCATAGCT GACACTGCAT CCATTGCTGT TTCTGGTAAC AATCTGACAG AGACCTTCTT GAAGTTGGTC AAGTCGATTG TACACCAGCT CGGAGACGAC TACGTAGTGT GGTTGGAATT GGGCAAACGA TTGGATGATT TGTTTACCGC ATTCGGTGGA GTTGATGAAG AGTTGACTAA TAACTTGAAT AAGTTCTTGA GATTCGTCTA CCAAGACAAG GCTCTTGCCT TCATTGACGA GTTGAGAAAC TCTTCCTCTA TTGACAATTC CGACTTTTTG AAAGTTAAGC TTAGATCGGA AGTGTTGACC CATGCTGGTT TGTTGTCAAT TCCAGAAGTG ACTCAATATG CCTCTGAATT GTTCAAGAAA TGGTTGGAAG GTACCCCAAT TCACCCATCG CTCAGATCGT TTGTCTTTGG CACCGTAGCT GCTTCTCCAG ACTTATCTAA TACGCAGTTT GATTCCATTT TGAAGGAAGT GACTCACCCA AGCTCGTTGG ACTCCAGAGA AGTAGCCTTA CGTTCGTTGG GTAACGTCAA CAATGACGAG CTTTCCGCTA GGTTGCTCAA CTACTTGGTG GACCCAGAAG TTATTCCCAC GATGGACTCG CACTTTTTGG GTGTACCTTT ATCGTCCAAC CTTCACACCA AGGAAAAGTT TCTTCAGTTT TTCTTTGAGC ACTATGCTGA TTTCTACAAG TTGATGTCAA CCAATATGGT TGTTTTGGAT AGGTTCATTA AGTTCACATT TGTCAACTAC CAGTCGTTGG ACACTCTTGA GAAGATGGAA ACCTTCTTCA AGGGCAAGGA TATCCATGGG TTCGAAAGAG CCCTCAAGCA AGCATTGGAT AATGTAAGAA TCAATGCCAA CTGGTTCAAC AGAGACCACC AGACGGTCAA GGACTTCTTG GCTGGTTTGT AGGCATATAT ACTCTAATAC ATAATACAAT GAAAAGAATA CGAT
|
Protein sequence | MCATTKPYYE ALPASLKPVH YDLSISAIDV AAETFKGKVS INLDIVEETD ELHLNYRDLT VTKEDIEVTL ITSDDKSSSV NIVSLTEFKE KEFFIIKFAE KVQPAAGAKL LVTLHYNAII QTNMAGFYKS GYTEDGVEKF MLSTQFEATD ARRAFPCLDE PSLKATFIVD VTVPGQWTAL GNTPVAESED IVDKNLKKVT FEKTPIMSTY LLAWATGEFE YIESFTEENY VDNKPLPVRI YTTKGYLEDA KLASEIAPKI VDYFSKIFEI KYPLPKLDLI AVHSFSHNAM ENWGLITYRS TALLYSEEKS DPSYKQKVVY VVAHELAHQW FGNLVTMKWW DELWLNEGFA TWVGFAAVEY LYPEWNIFSG FVSESLQQAL NLDGLRNSHP IEVPVIDALD IDQLFDVISY LKGASTILMI SNYLGKEEFL KGVALYLNRN KFGNASSHDL WSAVGEVSGK PIDSLMESWI KKVGFPVVSV DEDKNNLVLN QSRFLNSGDI TDAENDTKWW IPLNITTDST SVRDISVDSF DSEKLIIENF ALKNDFFKLN KDTSGVYRVN YSSSILEKNI LPHFNRMSPR DRVGLIADTA SIAVSGNNST ETFLKLVKSI VHQLGDDYVV WLELGKRLDD LFTAFGGVDE ELTNNLNKFL RFVYQDKALA FIDELRNSSS IDNSDFLKVK LRSEVLTHAG LLSIPEVTQY ASELFKKWLE GTPIHPSLRS FVFGTVAASP DLSNTQFDSI LKEVTHPSSL DSREVALRSL GNVNNDELSA RLLNYLVDPE VIPTMDSHFL GVPLSSNLHT KEKFLQFFFE HYADFYKLMS TNMVVLDRFI KFTFVNYQSL DTLEKMETFF KGKDIHGFER ALKQALDNVR INANWFNRDH QTVKDFLAGL
|
| |