Gene PICST_36090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_36090 
Symbol 
ID4838788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1646366 
End bp1647892 
Gene Length1527 bp 
Protein Length508 aa 
Translation table12 
GC content47% 
IMG OID640390103 
Productpredicted protein 
Protein accessionXP_001384620 
Protein GI150865414 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1362] Aspartyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAAGA AGTTGTCCAC GGCAATTGAA GCTACTTCCG CTGGTTCCAG CTCCAGTTCT 
ACTGCTTCTA CTGCTAGTGC TCCAGCTCCT ACTCCTGTTT CTACCAGTAC TTCCAAGTAC
TCTGACGATT ACTACGAAGC CCATGCTGAC GAATACATTA ATTTCACCTA TAAGTCGCCA
ACCATCTACC ATGTGGTGGA CTACTTTGGA AGCAAGCTTG AACATGCTGG CTTTCTGTTC
ATTTCTGAAA AGGAATCGTG GGACTCGATT AGACCTGGAA AGTACTACAC TGTCAGAAAC
GGCTCTGCTT TAGCCGCGTT CATAGTTGGT GAACACTGGA AGCCCGCTAA GGGTGTTGGA
ATCATTGGTT CGCACATCGA TGCGCTTACA GTGATTTTAA AGCCCAACTC TACCAAAGCC
AAAGTAGAAG GTTACGAGCT TTTGGGTGTA GCCCCATACG CCGGTACTTT GGGCGATGTA
TGGTGGGATA GAGACCTTGG GGTAGGTGGT AGATTGCTTG TGAAAGACAG TACTGGCAAG
GTTGTGCCAC AGCTTGTAGA CTCTACCCCT AACCCTATTG CCCATATTCC TACTTTAGCA
CCTCACTTTG GAGCTCCAGC TGTGGGTCCT TTCAACAAGG AAACGCAGGC CGTTCCCGTG
GTTGGTTTCT CCACTGAAGA CCCAGAAGAA CCCACCGAAG AAGAAAAGTC CGCTCCTTTA
TTTGGCAAAC ACCCTATGAC CTTGTTGCGT TACATTGCCA AAAAGGCCAA CGTCAAGGTT
TCGGACATTG TCCAATGGGA CTTGCAATTG TACGACATCC AGAAGGGTGT CAAAGGAGGT
TTGAATAAGG AGTTTGTTTT TGCACCCAGA ATTGACGACA GAGTCTGTTC CTTCGCAGCC
ATCAACTCGT TGATTGAAGT CGATAACGAC CATTTGCTCA AGTCGGACTC GTTCTCGCTC
GTGGGTCTTT TCGACAACGA GGAAATCGGT TCGGCAACTC GTCAAGGTAT CAAGGGTGGT
TTGACTGAAT CTGTCATCAC TAGAGTCATC TCGTCTAACT ACTTCAACCC TAAATCCTAC
GATGTTCAGG AACAAATCCG TTTGACTTAT GCCAACACCA TCATCTTGTC AGCTGACGTC
AACCACTTGC TCAACCCCAA CTTTGCCAAC GTTTATTTGG AGCACCACAA GCCTGTTCCC
AACACTGGTG TCACCATTGC CTTGGACCCT AACGGACACA TGGCTACCGA CTCCACTGGT
TTAGCTCTTG TTGAAGAATT GGCCAAACTC AACGGCGATA CGTTGCAATA TTTCCAGATC
AGAAATGACT CCAGATCTGG TGGTACCATT GGCCCTTCAA TTTCGTTGCA GACCGGAGCC
AGAACCATCG ACTTAGGTAT CCCTCAACTC TCGATGCATT CTATCAGAGC TACTGTTGGT
ACCAAGGATA TCGGTTTGGG AGTCAAGTTC TTTGCTGGTT TCTTTTCCAA TTGGAGAAAG
ACCTACGACA GCTACAAGGA CTTGTAA
 
Protein sequence
MLKKLSTAIE ATSAGSSSSS TASTASAPAP TPVSTSTSKY SDDYYEAHAD EYINFTYKSP 
TIYHVVDYFG SKLEHAGFSF ISEKESWDSI RPGKYYTVRN GSALAAFIVG EHWKPAKGVG
IIGSHIDALT VILKPNSTKA KVEGYELLGV APYAGTLGDV WWDRDLGVGG RLLVKDSTGK
VVPQLVDSTP NPIAHIPTLA PHFGAPAVGP FNKETQAVPV VGFSTEDPEE PTEEEKSAPL
FGKHPMTLLR YIAKKANVKV SDIVQWDLQL YDIQKGVKGG LNKEFVFAPR IDDRVCSFAA
INSLIEVDND HLLKSDSFSL VGLFDNEEIG SATRQGIKGG LTESVITRVI SSNYFNPKSY
DVQEQIRLTY ANTIILSADV NHLLNPNFAN VYLEHHKPVP NTGVTIALDP NGHMATDSTG
LALVEELAKL NGDTLQYFQI RNDSRSGGTI GPSISLQTGA RTIDLGIPQL SMHSIRATVG
TKDIGLGVKF FAGFFSNWRK TYDSYKDL