Gene PICST_36810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_36810 
Symbol 
ID4840258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1402330 
End bp1403703 
Gene Length1374 bp 
Protein Length457 aa 
Translation table12 
GC content48% 
IMG OID640391573 
Productpredicted protein 
Protein accessionXP_001385968 
Protein GI126138890 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.820131 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCTCG AGTTCCCTGA GACCGTTTCC AAGTTGAACT TCAAGGCTCC AGCCAAAAAG 
TCCTCTAGCA AGTCGCAGTT CGACTACCAT GTATCTGACG CCAAGCTTCC TAACCACAAG
CTTAGAGTCA AAAACACTCC TAAGGACTTG GGCATTGACT CCGTCAAGCA GTACAGTGGT
TACTTGGATG TTGAGGACGA AGACAAGCAC TTCTTCTACT GGTTCTTCGA ATCGAGAAAC
GACCCCAAGA ACGACCCCGT TATCTTGTGG TTGAACGGTG GTCCAGGATG TTCGTCTTTG
ACCGGTTTGT TCTTTGAATT GGGCCCAGCA TCCATCGGCG CCGACTTGAA GCCTGTTCAC
AACCCATACT CATGGAACAG TAATGCCTCG GTAATCTTCT TGGACCAGCC AGTAAATGTT
GGATACTCCT ACTCTTCTCA GTCTGTTTCC AACACCATTG CTGCTGGCCA GGACGTGTAT
GCCTTCTTGG AATTATTCTT CAAGCAGTTC CCAGAATACA ACACTCTTCC TTTCCACATT
GCTGGTGAAT CCTACGCCGG CCATTACATC CCAGTGTTCG CCAGTGAGAT CTTGAGCCAT
GAGGACCGTT CTTTCAACTT GACCTCGGTG TTGATCGGAA ACGGTTTGAC CGACCCTTTG
ACCCAATACG AATACTACGA GCCTATGGCC TGTGGTGAAG GAGGAGAACC TTCCGTCTTG
GAACCAGAAG AATGCCAAGC CATGTCCAAC GCCATTCCTA GATGTTTGTC TTTAATCAAG
TCCTGTTATG AGTCCGGCTC TTTGTGGTCG TGTGTTCCTG CCACGATCTA CTGTAACAAC
GGTCAGATGG GTCCTTACCA AAAGACTGGT AGAAATGTCT ACGACATCAG AACCATGTGT
GAAGGCTCCA ACTTGTGCTA CAAAGATTTG GAATACATCG ACCAATACTT GAACCAGCCG
GAAGTCAAGG CTAAGCTTGG TGCCGAGGTG GACGAGTATG AATCCTGTAA CTTCGACATT
AACAGAAACT TCTTGTTGGC CGGTGACTGG ATGAAGCCTT ACTACAAGAA TGTCATTGAA
TTATTGGAAG CTAAGCTCCC AGTGTTGATT TATGCCGGTG ACAAGGATTT CATCTGTAAC
TGGTTGGGAA ACCAAGCCTG GACCAACAGT TTGCCATGGT CTGGAGCTGC CAAGTTTGCC
ACAGAAAAAA TCAGAACCTG GACAGTAGGA AAGAAGGCTG CCGGTGAAGT CAAGAACTTT
GCCAACTTCA CCTTCTTGAG AGTGTTTGGT GGTGGTCACA TGGTGCCATA CGACCAACCA
GAGAATGCTT TGGACATGGT CAACAGATGG GTTTCTGGCG ACCGCAAGTT CTGA
 
Protein sequence
MMLEFPETVS KLNFKAPAKK SSSKSQFDYH VSDAKLPNHK LRVKNTPKDL GIDSVKQYSG 
YLDVEDEDKH FFYWFFESRN DPKNDPVILW LNGGPGCSSL TGLFFELGPA SIGADLKPVH
NPYSWNSNAS VIFLDQPVNV GYSYSSQSVS NTIAAGQDVY AFLELFFKQF PEYNTLPFHI
AGESYAGHYI PVFASEILSH EDRSFNLTSV LIGNGLTDPL TQYEYYEPMA CGEGGEPSVL
EPEECQAMSN AIPRCLSLIK SCYESGSLWS CVPATIYCNN GQMGPYQKTG RNVYDIRTMC
EGSNLCYKDL EYIDQYLNQP EVKAKLGAEV DEYESCNFDI NRNFLLAGDW MKPYYKNVIE
LLEAKLPVLI YAGDKDFICN WLGNQAWTNS LPWSGAAKFA TEKIRTWTVG KKAAGEVKNF
ANFTFLRVFG GGHMVPYDQP ENALDMVNRW VSGDRKF