Gene PICST_74268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_74268 
SymbolAPL2 
ID4840988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp219710 
End bp221977 
Gene Length2268 bp 
Protein Length736 aa 
Translation table12 
GC content43% 
IMG OID640392303 
Productbeta-adaptin, large subunit of the clathrin-associated protein (AP-1) complex 
Protein accessionXP_001386441 
Protein GI150866746 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5096] Vesicle coat complex, various subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGATGTCGCT TGAAAGGAAG ATTCGCAGCT TCTTGACGGG CCCCAGAAAG GGCGAGACGT 
TTGAATTGAA GAGCGGTTTG GTTTCGCAGT ACAAGCACGA AAGAAAGGAT GCGATCCAGC
GAGTGATCCA GGCCATGACT GTGGGTAAGG ATGTGTCTTC GCTCTTTCCC GATGTCTTGA
AGAACATCGC CACCTATGAT TTAGAACAGA AGAAGTTGGT CTATTTATAC TTGATGAACT
ACGCCAAAAC ACATCCCGAG CTTTGTATTT TGGCTGTCAA CACTTTTGTG CAAGATACTG
AAGATCCCAA TCCCTTGGTG AGAGCCCTAG CCATCCGTAC TATGGGCTGT ATACGTGTCA
ACAAGATGGT GGACTATATG GAGATTCCAT TGCAGAGAAC GCTTCAAGAC GAGAATCCCT
ATGTGAGAAA GACCGCTGCT CTCTGTGTTG CCAAATTGTT TGACTTGAAT CCCGAAATGT
GTGTAGAGTT TGGCTTCTTG GACCAGTTGA AGGGTCTCAT CAAGGACTCC AACCCTATGG
TGGTGGCCAA TTCGTTGAAC GCTTTATACG AAATCAGAGA CATGAACTCC GATGCCAACT
TAGAGATTTT CACGGCTGAT ACCGAAACTG TCAAGAACTT ACTTATGTGC TTGAACGAAT
GCACTGAATG GGGAAGAATC ACCATATTGA CCACTTTGAA TGAATATCAT ACTGATGATG
CTGAAGAAGC CAACCACATA ATAGAGCGTG TGACTCCACA ATTGCAGCAT GTAAATCCGT
CTGTGGTGTT GAGTTCCATC AGAGCCATAA TCCACCATAT AGATGCTATA CCCGTCACAG
CGCAGAGAGC TGCTATCTTA AAGAAACTTT CTGCTCCATT GGTTTCGTTG GTTAGTTCTT
CGATTCCTGA AGCTCAATAT GTAGGCTTGA AGAATATCCG CATCATCTTG GAAAAGTATC
CTCAGATCTT GTCCAAAGAG TTGAGAGTGT TTTTCATCAA GTACTCCGAT CCCTTGTACT
TAAAGTTGGA GAAGTTGGAA ATCATGGTCC GTTTGGCTAA CGATTCTAAC AGCGCCTTGT
TGTTAGGTGA GTTAAAAGAG TATGCCATGG AATTCGAGCC TTCGTTGGTG GCTAAGGCTA
TCAAATCCAT TGGCTCTGTT GCCATCAAAT TGTCTGGCTC TACTGTCAAA GCAATCAATC
TTTTGAATAG CTTGATAGAC CATAGAGGGG GTGATTTAGT CATCAACGAG TCCATCGTCG
TTTTGACAAA TATATTGAGA CGTTACCCTG GTAAAAACGA TCTTATCACT TTAATTATCC
CAGTTATATC AAACCATATT TCCGAATTGG AGAGACTGGA TGCCATGTCC GGTTACATCT
GGCTCTTGGG AGAGTATCCC AAGTATTTCT CCAACTTGCA TGACAAATTA CAAGTCTTGA
TCGACGATTT CTTGTCGTTT GAGTCTGTAT TGCAGTTGAA TATCTTGACT GCTATTGTCA
AGATTAATCT CTCTGCTTCA GGCTCTAAGT ACTCCAGTTT GTTACAGAAG GTGTTGGAGT
CGTCCACCAA AGATTGTGAA AATGCTGATG TCAGGGACAA GGCATATATC TACTGGCGTT
TATTGTCGTC TTCATCTACC GAATCACAGA AGGAAATTAT CTTGACCAAG TTGCCTCCTA
TCACCACAAC CATTGCTTCT TTCAACCCTG TAGTTTTGGA GTCGTTGGTG GAAGAGTTGT
CGACATTGTC GTCTGTCTAC CATAAGCCTG CATTCACCTT CATTGATCCA AACGCTGCCC
ACAGTCATGT AGCTCAAGGT AACAAGTCCA GATCGTCTTC TAAGAAGGAC AACATTGAAG
ACTTGACCAA CTTGGCCAAA CAGGAAATTA TCAATAACGC TAAGAACGAA AACTTGCTTG
ACTTTGACGA TGACGATGAC GCACTTACAG GCGACAATGG TGCCGCTGAA GGTTCTGGCA
GTTTATTGGA TGAATTAAAC GACTTGTTCA GTGCACCTGT TCCCGTATCT CAGGGACAGC
AGACGCAGCC TTCTTCTAAT AACGATATAT TGAGTTTGTT TGGTGCAATC CCACAAAATG
CCCCAACTCC TGTGAGCAAT GTGACCCAGG GATTAAATAA CTTCAACATC GGATCCAATG
CTGCTCCAAG CAATACCAGC AAGTTGAATA ACGATCTCTT GGATCTTATG TAACGTTTAT
TTTGTATATG TTATAGATGT TATACAAAAT ACGGAATGTA ATGTACTC
 
Protein sequence
MSLERKIRSF LTGPRKGETF ELKSGLVSQY KHERKDAIQR VIQAMTVGKD VSSLFPDVLK 
NIATYDLEQK KLVYLYLMNY AKTHPELCIL AVNTFVQDTE DPNPLVRALA IRTMGCIRVN
KMVDYMEIPL QRTLQDENPY VRKTAALCVA KLFDLNPEMC VEFGFLDQLK GLIKDSNPMV
VANSLNALYE IRDMNSDANL EIFTADTETV KNLLMCLNEC TEWGRITILT TLNEYHTDDA
EEANHIIERV TPQLQHVNPS VVLSSIRAII HHIDAIPVTA QRAAILKKLS APLVSLVSSS
IPEAQYVGLK NIRIILEKYP QILSKELRVF FIKYSDPLYL KLEKLEIMVR LANDSNSALL
LGELKEYAME FEPSLVAKAI KSIGSVAIKL SGSTVKAINL LNSLIDHRGG DLVINESIVV
LTNILRRYPG KNDLITLIIP VISNHISELE RSDAMSGYIW LLGEYPKYFS NLHDKLQVLI
DDFLSFESVL QLNILTAIVK INLSASGSKY SSLLQKVLES STKDCENADV RDKAYIYWRL
LSSSSTESQK EIILTKLPPI TTTIASFNPV VLESLVEELS TLSSVYHKPA FTFIDPNAAH
SHVAQGNKSR SSSKKDNIED LTNLAKQEII NNAKNENLLD FDDDDDALTG DNGAAEGSGS
LLDELNDLFS APVPVSQGQQ TQPSSNNDIL SLFGAIPQNA PTPVSNVTQG LNNFNIGSNA
APSNTSKLNN DLLDLM