Gene PICST_77752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_77752 
SymbolAAP1 
ID4838617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1193752 
End bp1196735 
Gene Length2984 bp 
Protein Length890 aa 
Translation table12 
GC content43% 
IMG OID640389932 
Productarginine/alanine aminopeptidase 
Protein accessionXP_001384182 
Protein GI150865102 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.204376 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AGCAACGCAT AGTTCTGTTT CTTTGTTTTG AGTCTCATAT CTGTGAAGCC GCTAGCAAAA 
GCTGAACGTA AACTTGTTGC AATTGAATTT GTCAAATCTT CAATTCATCT ATAGCAAGTC
TATTACTAAA AGTAACTGAC CAACAGAATT TTGAAGCCCT ACTGTCAATT GATCTAATTG
CCACAGTATT AAAGTCTCGC TGTTCTGATA TTGTATAATC AAAGAGGTAT TTCTTGTATT
TTCGTACTTG CTTTGCTTCT TCTCATCTAA TGTGCGCAAC TACGAAGCCA TACTATGAGG
CTCTTCCTGC CAGCCTCAAG CCAGTCCATT ACGACTTGTC TATTTCTGCA ATCGACGTAG
CTGCTGAAAC CTTCAAAGGC AAAGTATCCA TCAACTTGGA CATTGTAGAA GAAACTGATG
AGTTACACTT GAACTACAGA GACTTGACTG TGACTAAAGA AGACATCGAA GTCACCTTGA
TCACCAGCGA TGACAAATCT TCCTCTGTCA ACATCGTCTC ATTGACTGAA TTTAAGGAAA
AGGAATTTTT CATCATCAAG TTTGCTGAAA AGGTACAACC AGCTGCTGGT GCAAAACTCT
TAGTTACTCT TCACTACAAT GCTATCATCC AGACCAACAT GGCAGGTTTC TATAAATCAG
GTTATACCGA AGATGGCGTC GAAAAGTTCA TGTTGTCCAC ACAATTCGAA GCTACTGATG
CCAGAAGAGC TTTCCCATGT TTGGATGAAC CCTCATTGAA GGCCACCTTC ATTGTGGATG
TCACTGTTCC TGGTCAATGG ACTGCTTTGG GTAATACTCC TGTAGCTGAA TCCGAAGACA
TTGTAGATAA AAACCTCAAG AAAGTCACAT TTGAAAAGAC ACCCATCATG TCCACATATT
TGTTGGCTTG GGCTACGGGT GAATTCGAGT ATATCGAGTC TTTCACGGAA GAAAACTACG
TGGACAACAA GCCTTTGCCT GTTCGTATTT ATACGACAAA GGGATATCTT GAAGATGCGA
AGTTAGCTTC TGAAATTGCT CCAAAGATCG TAGACTACTT TTCCAAGATT TTCGAAATCA
AGTACCCCTT GCCCAAATTG GACTTGATAG CGGTGCATTC CTTCTCGCAC AATGCCATGG
AAAACTGGGG GCTTATCACC TATAGATCAA CGGCCTTGTT GTACTCAGAG GAGAAGTCAG
ATCCTTCTTA CAAACAGAAG GTAGTCTATG TTGTAGCCCA CGAATTGGCC CATCAGTGGT
TTGGTAATTT GGTTACCATG AAGTGGTGGG ATGAACTTTG GCTTAACGAA GGGTTTGCTA
CCTGGGTTGG ATTTGCAGCT GTAGAGTACT TGTACCCAGA ATGGAATATT TTCAGCGGGT
TTGTCTCTGA GTCGTTGCAG CAGGCTCTTA ACTTGGATGG ATTGCGAAAC TCGCATCCTA
TCGAAGTACC TGTTATCGAC GCATTGGACA TCGACCAGTT GTTTGATGTT ATCTCTTACT
TGAAGGGTGC CTCTACTATT CTAATGATTT CTAATTACTT GGGAAAGGAA GAATTCCTCA
AGGGTGTGGC TCTCTACTTG AACCGCAACA AGTTCGGCAA CGCCAGCTCT CACGACTTGT
GGAGTGCCGT AGGCGAAGTG AGTGGTAAAC CCATCGACAG CTTGATGGAA TCCTGGATCA
AAAAGGTTGG GTTCCCAGTT GTTAGCGTAG ACGAAGACAA AAACAACTTA GTGCTTAACC
AATCGCGTTT CTTGAACAGT GGAGACATAA CCGATGCTGA GAATGACACC AAGTGGTGGA
TTCCACTTAA CATCACGACG GATTCCACTT CTGTAAGAGA CATTTCTGTT GACTCTTTTG
ACTCGGAAAA GTTGATCATT GAGAATTTTG CCTTGAAAAA TGACTTTTTC AAGTTGAACA
AGGACACGAG CGGAGTCTAC AGAGTTAACT ACTCCAGCTC TATCTTGGAG AAGAACATTC
TTCCACACTT CAACAGAATG TCCCCAAGAG ACAGAGTTGG ACTCATAGCT GACACTGCAT
CCATTGCTGT TTCTGGTAAC AATCTGACAG AGACCTTCTT GAAGTTGGTC AAGTCGATTG
TACACCAGCT CGGAGACGAC TACGTAGTGT GGTTGGAATT GGGCAAACGA TTGGATGATT
TGTTTACCGC ATTCGGTGGA GTTGATGAAG AGTTGACTAA TAACTTGAAT AAGTTCTTGA
GATTCGTCTA CCAAGACAAG GCTCTTGCCT TCATTGACGA GTTGAGAAAC TCTTCCTCTA
TTGACAATTC CGACTTTTTG AAAGTTAAGC TTAGATCGGA AGTGTTGACC CATGCTGGTT
TGTTGTCAAT TCCAGAAGTG ACTCAATATG CCTCTGAATT GTTCAAGAAA TGGTTGGAAG
GTACCCCAAT TCACCCATCG CTCAGATCGT TTGTCTTTGG CACCGTAGCT GCTTCTCCAG
ACTTATCTAA TACGCAGTTT GATTCCATTT TGAAGGAAGT GACTCACCCA AGCTCGTTGG
ACTCCAGAGA AGTAGCCTTA CGTTCGTTGG GTAACGTCAA CAATGACGAG CTTTCCGCTA
GGTTGCTCAA CTACTTGGTG GACCCAGAAG TTATTCCCAC GATGGACTCG CACTTTTTGG
GTGTACCTTT ATCGTCCAAC CTTCACACCA AGGAAAAGTT TCTTCAGTTT TTCTTTGAGC
ACTATGCTGA TTTCTACAAG TTGATGTCAA CCAATATGGT TGTTTTGGAT AGGTTCATTA
AGTTCACATT TGTCAACTAC CAGTCGTTGG ACACTCTTGA GAAGATGGAA ACCTTCTTCA
AGGGCAAGGA TATCCATGGG TTCGAAAGAG CCCTCAAGCA AGCATTGGAT AATGTAAGAA
TCAATGCCAA CTGGTTCAAC AGAGACCACC AGACGGTCAA GGACTTCTTG GCTGGTTTGT
AGGCATATAT ACTCTAATAC ATAATACAAT GAAAAGAATA CGAT
 
Protein sequence
MCATTKPYYE ALPASLKPVH YDLSISAIDV AAETFKGKVS INLDIVEETD ELHLNYRDLT 
VTKEDIEVTL ITSDDKSSSV NIVSLTEFKE KEFFIIKFAE KVQPAAGAKL LVTLHYNAII
QTNMAGFYKS GYTEDGVEKF MLSTQFEATD ARRAFPCLDE PSLKATFIVD VTVPGQWTAL
GNTPVAESED IVDKNLKKVT FEKTPIMSTY LLAWATGEFE YIESFTEENY VDNKPLPVRI
YTTKGYLEDA KLASEIAPKI VDYFSKIFEI KYPLPKLDLI AVHSFSHNAM ENWGLITYRS
TALLYSEEKS DPSYKQKVVY VVAHELAHQW FGNLVTMKWW DELWLNEGFA TWVGFAAVEY
LYPEWNIFSG FVSESLQQAL NLDGLRNSHP IEVPVIDALD IDQLFDVISY LKGASTILMI
SNYLGKEEFL KGVALYLNRN KFGNASSHDL WSAVGEVSGK PIDSLMESWI KKVGFPVVSV
DEDKNNLVLN QSRFLNSGDI TDAENDTKWW IPLNITTDST SVRDISVDSF DSEKLIIENF
ALKNDFFKLN KDTSGVYRVN YSSSILEKNI LPHFNRMSPR DRVGLIADTA SIAVSGNNST
ETFLKLVKSI VHQLGDDYVV WLELGKRLDD LFTAFGGVDE ELTNNLNKFL RFVYQDKALA
FIDELRNSSS IDNSDFLKVK LRSEVLTHAG LLSIPEVTQY ASELFKKWLE GTPIHPSLRS
FVFGTVAASP DLSNTQFDSI LKEVTHPSSL DSREVALRSL GNVNNDELSA RLLNYLVDPE
VIPTMDSHFL GVPLSSNLHT KEKFLQFFFE HYADFYKLMS TNMVVLDRFI KFTFVNYQSL
DTLEKMETFF KGKDIHGFER ALKQALDNVR INANWFNRDH QTVKDFLAGL