Gene PICST_47790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_47790 
Symbol 
ID4840410 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1292416 
End bp1293486 
Gene Length1071 bp 
Protein Length169 aa 
Translation table12 
GC content40% 
IMG OID640391725 
Productpredicted protein 
Protein accessionXP_001385944 
Protein GI126138842 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5078] Ubiquitin-protein ligase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00101457 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACAC CAGCCAAAAG AAGACTCATG CGTGATTTCA AGGTAAGTTC TACTTTTGTA 
TCCCGAAAAA GGCCCGAGTC CAAAGGTAAC TTTTGCCATA ATTAATGGTC TTCTGACCAG
CGGCAACAAG GGTTTGTTGC CAGAAAGTTA CAGTGAAAAT GAAAAATAAG GCCATAATAC
TAACACCAAC TAGCGCATGC AACAGGATGC TCCCTCCGGT GTCAGTGCTT ATCCGTTACC
GGACAATGTC ATGACATGGT ATGTGATTAC AAATGAGAAA GGATAAAAAG AGAAAGGACA
TTAAAAGATT CATGAGGAGT GGGAGATATT GAGTTATGAA TAAATAAGAA ACATGACGAG
ATGAAAGTCA TATGAAGAGC AGGAGAATGA ACTTAATATG GAAAATGAAT GGATGACTAA
TGAATGAAAT TTATCCGGAA GAACAAGACA CGAATGAAAA ACTAAAGAAA TGATAGTCAT
TATCTACATG TTTATAAGGT ACGCAACATC CAAAATGATA TGATCCTAGA TTTTCGTATC
CCTGAGACGC CAATCCACAT TCACTTTCAA TACTATAAGC ATCACAGTTG AAGAGAAAAA
TATTCATCCC CCCCCCATAT CTTTTTTTGC TTTCTTCTAC TTATTCCATT ACTTTACTAA
CCTATTAGGA ATGCTGTTAT CATTGGTCCT TCAGATACAC CCTTTGAGGA TGGAACATTC
CGTCTTGTGC TCCAGTTTGA CGAGCAATAT CCCAACAAGC CTCCTACCGT CAAGTTTATA
AGTGAGATGT TCCATCCCAA CGTATATGGA TCAGGAGAGT TGTGTTTGGA TATTCTTCAA
AACAGGTGGT CTCCAACTTA CGATGTATCA TCAATCTTGA CGTCTATCCA GTCGTTGCTT
AACGACCCCA ATATCAGCTC ACCGGCTAAC GTTGAAGCTG CCAATTTGTA TAGGGACCAT
AGGTCGCAGT ACGTGAAGCG AGTGAGAGAA ACTGTAGAGA ATAGCTGGAA TGAAGATGAC
TTTGATGACG ATGACGATGA CGATGACGAT GAAGACGACG AGATCGAGTA G
 
Protein sequence
MSTPAKRRLM RDFKRMQQDA PSGVSAYPLP DNVMTWNAVI IGPSDTPFED GTFRLVLQFD 
EQYPNKPPTV KFISEMFHPN VYGSGELCLD ILQNRWSPTY DVSSILTSIQ SLLNDPNISS
PANVEAANLY RDHRSQYVKR VRETVENSWN EDDFDDDDDD DDDEDDEIE