Gene PICST_4493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_4493 
Symbol 
ID4838661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp934969 
End bp936315 
Gene Length1347 bp 
Protein Length449 aa 
Translation table12 
GC content48% 
IMG OID640389976 
Productpredicted protein 
Protein accessionXP_001384483 
Protein GI150865320 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0320757 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GCCAACGCAT TGGTTATCCC TGATGTCAAC TCATTCTTCA ACTTCAACCA GTTTGTGCCT 
CTGACTGTTG CTGTCAATGC TGCTCAACAA CAACCACAAC AGCTCTGGAC AGCTAACAAG
AACGAGGCTC CTGTTCTTGC AGCCGGTTCC AAAAGCGTAG CCATCCCAAA CAGATACATT
GTTATCTACA AGGAAGACGT TACCGAGGCC CAGAGAAACC ATCACAAGAA GTGGTTGATT
GCCGAACATA CGGAAATGGT TGCCACAGCC GGAATTCGTC CTTCGGTAGG TGTTCTCGAC
TTCTTCGATG TCGACACGCT ACTCCTGGGC TACTTCGGCT ACTTCACTCC CGAGATGCTC
CGCAAGATCC AGAAGGACCC TCGCATCAAG TTCATTGAGC AGGATACCGT AATGAAGGTC
AATGAGTTCG ACGTCGAGAA AGATGCCGAA TGGGGTTTGA GCAGGATTTC ACATCGTGAA
AGCAGCCCTC AACTCGAATA CCTTTACGAT AATGAAGGTG GCAAGGGTGT CACTGCTTAC
GTCATTGACA CCGGTATCAA AGTTGAACAC GAAGAATTCG AAGGCAGAGC CCTGTGGGGT
GAAGCCGTGG CTTTCCCCAA GTTGAAGATT GATGGACACG GCCATGGAAC CCACTGTGCT
GGTATCATTG GATCCAAGAC GTATGGTGTA GCTAAGAATG TTGAATTAGT AGCTGTAGGT
GTTATGAACT TGTTGGGTAG TGGTACGACC TCAGACATCA TCAAGGGTGT CGAATTTGTT
GTCGGCGACC ATAAATCAAA CTTCCTGGCA AAGAAGAAGG GCTTCAAGGG CTCCACAGTC
AACATGTCTA TTGGTGGAGG AGAATCTGAA GCTTTGGACT TGGCTGTTAA TGCTGCTACC
AAGGCTGGCT TGCATGTAGC TGTAGCTGCT GGTAACGACA ATGCTGACAC TTGTACTTTT
TCTCCAGCAA GAGCCAGCGG ACCAATAACC GTAGGAGCTT CTGATATCAA CGACAACAAG
GCTGAATTCT CCAACTGGGG TTCTTGTGTA GACATCTTCG CACCTGGGGT TGACATTGTT
TCTACATACA TCTGGAGCAA CACTGCTTCT ATGTCTGGTA CTTCAATGGC TTCTCCTCAC
ATTGCTGGAT TGCTTTCGTA CTACTTGTCG CTCTACCCTG AGCCTGAATC CGAGTACAGC
GTAGCTGTAT TGGACCCAGC AACCTTGAAG GACAAGGTGA TCAAGTATGC CACCAAGGGC
GTTATAAAGG GCTTGAAGAA TGACGGTTCG CCTAACTTAT TGGCCTTCAA TGGTGCTGGC
GCCAATATCA CCGACTTCTG GAGCTTA
 
Protein sequence
ANALVIPDVN SFFNFNQFVP STVAVNAAQQ QPQQLWTANK NEAPVLAAGS KSVAIPNRYI 
VIYKEDVTEA QRNHHKKWLI AEHTEMVATA GIRPSVGVLD FFDVDTLLSG YFGYFTPEML
RKIQKDPRIK FIEQDTVMKV NEFDVEKDAE WGLSRISHRE SSPQLEYLYD NEGGKGVTAY
VIDTGIKVEH EEFEGRASWG EAVAFPKLKI DGHGHGTHCA GIIGSKTYGV AKNVELVAVG
VMNLLGSGTT SDIIKGVEFV VGDHKSNFSA KKKGFKGSTV NMSIGGGESE ALDLAVNAAT
KAGLHVAVAA GNDNADTCTF SPARASGPIT VGASDINDNK AEFSNWGSCV DIFAPGVDIV
STYIWSNTAS MSGTSMASPH IAGLLSYYLS LYPEPESEYS VAVLDPATLK DKVIKYATKG
VIKGLKNDGS PNLLAFNGAG ANITDFWSL