Gene PICST_33594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33594 
Symbol 
ID4840757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp1059319 
End bp1060602 
Gene Length1284 bp 
Protein Length427 aa 
Translation table12 
GC content43% 
IMG OID640392072 
Productpredicted protein 
Protein accessionXP_001386208 
Protein GI150866565 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.258342 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAACA TCAGTTTCTC TCCAGATAGG AATGAACTGC CATTCACCCT CAAGCTTTAC 
CCTAATTCCA AGACAGACTC TTCTCCCAAT TCTAACACAC TTGAATCTGC TAACGATTCC
GATTCCAAAA CTTCCTATCC TTGGCTGTCT GTCACAGAGG GAGCAGGTGA GCTGTCGCCT
GGTTTCATCA GAGGAGTGTT CAAACTCAAA AGAAGAGCAT CTACTACACT ATCTCCTCCT
CCTTCACCTC CACATATTGA GGAAAAGCTT GAAGAGCCTG AACCAGTGCA TTTGGAGTTG
TACAATAAGT TCAAACAAGA ACCATTAGGA ATTTCTTTTC CTGTAGTTAA CTATGATTCT
GTCGGTGATC CAGGTGGATT GGGTATCAAC TTTGACCACC CCAGTCTTGA CTTCGACATC
AAACTTCCAA AGTTCGAGCT GTCGTCTACG ATTCTTAGCA GTGCAGGAAC CCCACCATCC
TCCAAGACAC CCACAAAGCA AAAATCGTCT TCTCCTTTTG AAAGGGTCAA ACAGTTGCTT
TACAGAAACC ATTGGCTGCA CCATGATATG AACAAACAAA GTGCTGAGTT TTCTGTGGCT
TCTTCCGAGT TCAGGTCTCC TGTGAACATG AGGATTCCTG TTAAAGAAGA TTGGAAAACA
GGAGTTGGGG TGCCAATAGA CATCTCGATC TTTCAGAACA ACGGTTTTGG ATCTCCCATA
AGCGAACAAT GTGTACAAGA TAATAAGAGT ATCAAGTTCA AACAATACGT GGATATGCTA
GTTTACAACA ACAAATTTCC TTACAGTTCA TCTACATCCT CCGAGAAAAT AGATCGCTCC
TGTGCGAGTC CAGTGTCAAA CGATTCATAC CAAGACAACT CCAATTGGAG CAATGCCTGG
CCAAAGAAGA GACTTCGCAA GAGTAAATCT GCGCCATTCA AGCGCATGGT TCCATTAGAG
TCCAGCTATG AGGGACTGGA GAACCCTAGT ACTAGAAGAA AAAGATCGAT ATCGTTATCG
CTTACGAGAA GAACTATCAA GAACATCTCC AATATTCCCG AGACACCAGC AAGTCGTCCC
GTAAGCTCAA TATTGAAAAA CAAGATAAAC TCCAATTTCG ACATCGAGCG CCAAAACACG
GTTCGGGATG ATACCGTCAA TATAGAGGCA TTCTTGAAAA ACTTTGAGAG ATTCGAGATG
GAAAAATACG AGAAGGAATC GCAATTGGAA CACCTCCGCC ATCAGCAGAT CCAGCACTAC
TACGAAAACG AGGAATTTGC TTAG
 
Protein sequence
MMNISFSPDR NESPFTLKLY PNSKTDSSPN SNTLESANDS DSKTSYPWSS VTEGAGESSP 
GFIRGVFKLK RRASTTLSPP PSPPHIEEKL EEPEPVHLEL YNKFKQEPLG ISFPVVNYDS
VGDPGGLGIN FDHPSLDFDI KLPKFESSST ILSSAGTPPS SKTPTKQKSS SPFERVKQLL
YRNHWSHHDM NKQSAEFSVA SSEFRSPVNM RIPVKEDWKT GVGVPIDISI FQNNGFGSPI
SEQCVQDNKS IKFKQYVDML VYNNKFPYSS STSSEKIDRS CASPVSNDSY QDNSNWSNAW
PKKRLRKSKS APFKRMVPLE SSYEGSENPS TRRKRSISLS LTRRTIKNIS NIPETPASRP
VSSILKNKIN SNFDIERQNT VRDDTVNIEA FLKNFERFEM EKYEKESQLE HLRHQQIQHY
YENEEFA