Gene PICST_46961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_46961 
Symbol 
ID4839104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp628569 
End bp629585 
Gene Length1017 bp 
Protein Length338 aa 
Translation table12 
GC content38% 
IMG OID640390419 
Productpredicted protein 
Protein accessionXP_001384787 
Protein GI126136527 
COG category[R] General function prediction only 
COG ID[COG1310] Predicted metal-dependent protease of the PAD1/JAB1 superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCAC CAACAGCAAG TGAATTAACA TTCTTGAGTA AAACCGTTTC TGTTTCACCA 
TTGGTGTTAT TATCAGTTGT CGATCACTTT AACAGAGTTG CCAAAGACTC GAAGAAAAGA
GTAGTTGGCG TTATTTTAGG CGACAACTCC ACTGATTTGA TCAAAGTCAC AAATTCATAC
GCCATTCCCT TTGAAGAGGA CGAGAAAAAC CCCAGTGTTT GGTTCTTGGA CCAGAATTTT
ATAGACTCCA TGGGCGATAT GTTCAAAAAG ATTAATGCAA AGGAAAAATT AATCGGTTGG
TATCACTCTG GTCCCAAGCT AAGACCATCA GACTTGAAAA TTAATGATGT GTTCAAGAAA
TATACTTCAA ATCCATTACT ACTTATCGTG GATGTACAGC CAAGAGAAGT AGGTATTCCT
ACTGATGCTT ACTTCGCAGT AGATGATATT AAGAACGATG GTTCCGCTGC TGAAAAGACA
TTTGTTCATG TTCCATCGCT TATTGAAGCA GAAGAGGCTG AAGAAATTGG TGTTGAACAC
TTATTGAGAG ACATCAGAGA TCAAGCGGCA GGAAACTTGT CCTTGAGAGT TACACAGACA
TATCAATCCT TGTTGGGATT GCACCAAAAG CTTAAAGAAA TTGCCAATTA CTTGGACAAA
GTCTACCAAA AAAAGCTCCC TATAAATCAT ACCATTTTGG GAAAATTGCA GAACGTGTTC
AACTTACTAC CAAACCTATC TAATTCCAAC TTGGTTGGAG GCGAAGGCGT TGTAGATTCA
CAAACACCAA GCCAATCAAG TAATCCTTTG TCGGCAGCAT TTACGATTAA GACGAATGAC
GAGTTAATGA TCGTCTATAT AAGTACACTT GTCAGAGCAA TCATTGCTTT CCATGATTTG
ATTGAGAATA AGCTTGAAAA CAAGAAGCTT AACGAAAAGA AATCGTCCTC TGAACTTGAA
ACTGGCGTTA TTTCTCTCTT AAGTAATGAA GAAAAGGGTG AAAGTACACA AGAATAG
 
Protein sequence
MSAPTASELT FLSKTVSVSP LVLLSVVDHF NRVAKDSKKR VVGVILGDNS TDLIKVTNSY 
AIPFEEDEKN PSVWFLDQNF IDSMGDMFKK INAKEKLIGW YHSGPKLRPS DLKINDVFKK
YTSNPLLLIV DVQPREVGIP TDAYFAVDDI KNDGSAAEKT FVHVPSLIEA EEAEEIGVEH
LLRDIRDQAA GNLSLRVTQT YQSLLGLHQK LKEIANYLDK VYQKKLPINH TILGKLQNVF
NLLPNLSNSN LVGGEGVVDS QTPSQSSNPL SAAFTIKTND ELMIVYISTL VRAIIAFHDL
IENKLENKKL NEKKSSSELE TGVISLLSNE EKGESTQE