Gene PICST_31787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31787 
SymbolAPE2.1 
ID4838808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1692628 
End bp1695240 
Gene Length2613 bp 
Protein Length870 aa 
Translation table12 
GC content38% 
IMG OID640390123 
Productalanine/arginine aminopeptidase 
Protein accessionXP_001384279 
Protein GI150865173 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAAAT CAGGTACGAT AAACGATTTG CAACTACCAG AACATGTGCG TCCCAGCTCT 
TATACGCTTC AGCTCAAGGT AGATGTCGAG AAGCAAATAT ACGACGGCTC TGTACTCATT
AAAATCTTCA TTTATGAGGA TTGCGATTTC ATTGTTCTCA ATAGCAGCAA CCTAGAAGTC
CAGGGGGCCA GACTTGGCAA TAAACCAATT AGTTGGTCTG TTGATAGGGA GTTTTTAAGG
TTCGATTCGA AGTTCACAAA GAATGAATTG GTTGAGTTAT CTATTGAGTT TGCTGGTAAG
TTTAACGATC ACATAGCTGG CTTATATCAG TCTTCCTACA CAATTGAGGA GGAAAACGAA
GAGAAAACAC GATATGTGGC AGCTACACAT TTTGAACCAA TCGATTGCCG AACAGTCTTT
CCGTGCTTTG ATCAACCTGA CATGCGAGCC GAGTTCGAGA TCATTTTGAT AGTTAAAAGT
GAATTGACGG CTTTGTCAAA TATGGAAGTA GAGAAGGAAA TCGCACTTGA AAATGGCTTT
AAACAAGTTG TCTTTAAACG CTCACCACCA ATGCCAACGT ATTTGGTGGG CTTGCTAATC
GGACAATTTG ATTATGTAGA ATCCAAACTT CTGAGAATTC CAATAAGAGT ATGGTCGGAT
CCCGGAAAAA TTAACAAGGC ATTATATGCA CTTGAACTTG CTGAAGCTGC CCTAGAATTT
TATGAAAAGC AGTTCAAAAT CAACTATCCA CTACCAAAAT TGGACTTCGT AGCCATTCCA
GATTTTCCCA AACTAGGCAT GGAAAACTTT GGCTTGATTT TCTTTAAGGA AGAAACCATT
TTGGTAGACA GGGATACTAC CTCTACCAAC AATAAGTACG AGGTTGCTGC AACTATTTTC
CATGAAGTTT CTCATCAATG GTTTGGAAAT CTTGTGACAT TGAAATTCTG GGATAGCCTT
TGGCTCAAGG AAGGATTCGC CGATTGGATG TCTTGGTATG CTATTGATAG CCTTTATCCC
GAATGGAAGC CGTTTCAAAA TTATCTCGTC TATGACTTGC AAAATTCATT AACAGCAGAT
GCTCTCAGTA CGACACATTC AGTAGAGATG CCAATTGCGT CATTGGAGGA TATCAAACAA
GGGTACGATA GTATTTCATA CGCCAAAGGC TGTTCATTAA TAGTGATGGT TGCAAAATGG
CTTGGAGTGG ATATTTTTAT GGAAGGCGTT GTAAAATATT TGTCTACATT CAGTTGGAAA
GCTACCACCG CACTGGATTT ATGGTCATGT TTATACGATG TCAGCGGAAT AGATGTTGGA
TCTGCAATGG AGGTTTGGAT AAAACAAGCT GGTTTTCCGA AGGTTACTGT TGAGGAATTG
GATGATAACA AGATAAAGAT TTCCCAAAGG AGGTTCATAT CTAATCCGAC TGTAACTGAA
TATGATGACT ATTTATTTCC AATCTTTGTA AATATCAGGA CTACAAAAGA ACCAAGTTAT
CAAATTTTAC TCAAATCAAA GGAAGAGATA TTTGAATTAG AGCTTGAGGA TGACTTTTTC
AAAGTTAATA GTGACCAAGT TGGGTTTTAT CGAACTGCGT ACAGTGAAGA GAGGTGGACC
AAACTAGGTT TAGCTGGAGT TCAAGGAAAG CTATCCGTCG AAGACAGAAT TGGTTTACTT
GCCGATTGTG CAAAATTGGC GGAATCCGGT TATATACTGA CTATTTGCTT TTTTGAGCTA
CTTAAGCAAT GGAGTTGCGA AAAAGACGCA TACGTTTGGG ATGAAGCAAT TAATGACTTA
TTTGATATTC AGGATGTATT CATATTTGAC AATGGGAATA TATTTGAAGG CATCAATCAA
TTTATGAGAA ATCTAATTGG GAATCATATT CAGGCTAGTC TTGAAATCAG CCAAAAAGAT
TCTTATGAAT TGAATAATTT CAAGAAAACA ATCTTTTCTG CTGGCTATAG TAATGGCGAT
CCACTTGTTA TCAAATACTG TAGAGAAAAA TTTGCTTCAT TTACACACGA CGGAAACACA
GTTAATGCTG ACCAAAGAGG TGTCATTTTC AAGTGTGTTG CTAAATACGG AGGGCAAGAA
GAGTATGAAA GATTGATGGA GATCTATGAG GAAGCAGATG ATGACGACAT TGCAGATGAT
GCGTTGGTTT CATTAGGTCG ATTTCCAAAG CCTGACATAT TGCAGAACTT CTTGCAATTT
GTTCTAGATC TGCTAAATGA GCAGGATGTA TTGTTTGCAT TATCTGCTTT GAAAAATCAT
TCTACAGGTA TCACAACTCT TTGGAGATTT TTGCGGAATG ATTGGGAATC GATTGCTGAA
AAGCTCGGAT ATGGTTCAGA AACACATATC AAAGTCGTCA ACGCATGCTT GTTAAGATTA
GCTACAAGGG ATCAAAAAGA CGAAATTGAG AGTTTCTTTG CTGATAAGGG CGAGGTTTTT
GAGAAGACCG TTCGTCTGGC GTTAGAACGA ATTGACTTGA AAGTGCAGTG GGTCGAAAGA
GATGGTTCTA AGTTAAGGGA ATGGTTCAAA ACCAACTTGC ATTATTCCGA TGGTATTGTG
TCCAAAGTGG AAGGATTGTC AATAGACAAG TAG
 
Protein sequence
MTKSGTINDL QLPEHVRPSS YTLQLKVDVE KQIYDGSVLI KIFIYEDCDF IVLNSSNLEV 
QGARLGNKPI SWSVDREFLR FDSKFTKNEL VELSIEFAGK FNDHIAGLYQ SSYTIEEENE
EKTRYVAATH FEPIDCRTVF PCFDQPDMRA EFEIILIVKS ELTALSNMEV EKEIALENGF
KQVVFKRSPP MPTYLVGLLI GQFDYVESKL SRIPIRVWSD PGKINKALYA LELAEAALEF
YEKQFKINYP LPKLDFVAIP DFPKLGMENF GLIFFKEETI LVDRDTTSTN NKYEVAATIF
HEVSHQWFGN LVTLKFWDSL WLKEGFADWM SWYAIDSLYP EWKPFQNYLV YDLQNSLTAD
ALSTTHSVEM PIASLEDIKQ GYDSISYAKG CSLIVMVAKW LGVDIFMEGV VKYLSTFSWK
ATTASDLWSC LYDVSGIDVG SAMEVWIKQA GFPKVTVEEL DDNKIKISQR RFISNPTVTE
YDDYLFPIFV NIRTTKEPSY QILLKSKEEI FELELEDDFF KVNSDQVGFY RTAYSEERWT
KLGLAGVQGK LSVEDRIGLL ADCAKLAESG YISTICFFEL LKQWSCEKDA YVWDEAINDL
FDIQDVFIFD NGNIFEGINQ FMRNLIGNHI QASLEISQKD SYELNNFKKT IFSAGYSNGD
PLVIKYCREK FASFTHDGNT VNADQRGVIF KCVAKYGGQE EYERLMEIYE EADDDDIADD
ALVSLGRFPK PDILQNFLQF VLDSLNEQDV LFALSALKNH STGITTLWRF LRNDWESIAE
KLGYGSETHI KVVNACLLRL ATRDQKDEIE SFFADKGEVF EKTVRSALER IDLKVQWVER
DGSKLREWFK TNLHYSDGIV SKVEGLSIDK