Gene PICST_32792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32792 
Symbol 
ID4840028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp648024 
End bp650330 
Gene Length2307 bp 
Protein Length768 aa 
Translation table12 
GC content44% 
IMG OID640391343 
Productpredicted protein 
Protein accessionXP_001385474 
Protein GI150866015 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.980986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0911162 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGGCT TGTTAGACGC AGAAGCCAGT GAAGATGACG AAGAAGATGA GGAAAATAGT 
GAAGATGATG AAGAGGAAGA CTCTGACAAG GAGTTGAACG AGTTGTTGGG AGAGGAAGAA
GACCCAAGCG ACTACGACTC AGAAAATTTC TCCGATGAAC CTCAGGAATC TGATACTAGA
TCTATAACCG ATGCCATCTC TGGTGTAAAG ATCAGATCTC TTTCAGACAT TTCTTCCCAA
GACAAACAAG AATTCCACAC CAAGTATTCT GACGGTTCGG AAAGAATCAT CAAGCCGGAA
ATAGAGCCTG TGTATGACAG TGACGACAGT GATGCCGAAA ACTTCAATAC CATTGGGGAC
ATTCCCTTGT CTGCCTACGA TGAAATGCCA CATTTGGGTT ACGACATCAA CGGGAAGAGA
ATAATGAGAC CAGCCAAGGG TTCTGCTCTC GATCAATTGT TGGAGTCTAT TGACTTGCCT
CAGGGCTGGA CCGGACTTTT GGACCAAAAC ACTGGTTCAT CTTTGAACTT GACAGATGAA
GAGTTGGAAT TGATCAGAAA GATCCAACAA CAGGAAAACA CAGATGCAAA TATCGATCCA
TACGAAGCTA CCATCGAGTG GTTCACATCT AAGGTGGAAG TAATGCCCTT AACAGCAGTT
CCTGAGCCAA AGAGAAGATT TGTTCCTTCC AAACACGAAG CCAAGAGAGT TATGAAGATT
GTCAGGGCCA TCAGGGAAGG AAGAATTATT CCTCCAAGTA AGGTGAAGGA GCAGATTGAA
GAAGAAAGAC TCAACTATGA CTTGTGGAAT GACGATGACA TCGCTGTTGA AGACCATATC
ATGAACTTGA GAGCTCCTAA ATTGCCTCCT CCAACCAACG AAGAATCCTA CAACCCCCCT
GAGGAATACC TTTTGACCGA AGAAGAGAAA AAGCAATGGG AACTGTTGGA TCCTGCTGAC
AGAGAAAGAA ATTTCCTTCC TCAGAAGTTT GGAGCTTTGA GAAAAGTTCC TGGCTACCAA
GAAAGCGTGC GTGAAAGATT CGAAAGATGC TTGGACTTGT ATTTGGCTCC TAGAGTTCGT
CACAATAAGT TGAACATTGA TCCCGAAAGT TTGATTCCTG AATTACCCTC TCCAAAGGAT
TTAAGACCAT TCCCTATCCG TTGTTCTACC GTCTACCAGG GCCATACTGA CAAGATCAGA
ACTATTTCTA TTGATCCTCA AGGCTTGTGG TTGGCCACTG GTTCAGATGA TGGTAGTGTC
AGAATTTGGG AAATCTTGAC AGGAAGACAA GTGTTCAATG TTCAGTTGAT CAACAAAGAA
ATAAACGACG AAGACCATAT CGAGAGTTTG GAATGGAACC CAGACTCCCA AACCGGGATT
TTGGCTGTCT GTGCTGGTGA GAACATCTAC TTGGTTGTTC CACCAATTTT CGGCTTTGAT
ATCGAAAACA TGGGTAGATT GAGAATCGAA TCCGGTTGGG GTTATGACAC TTTTGGTAAC
AAGACCAAGG AGGAAAAGTT CAAGAATGAC GAGGGCAATG AAGATGAAGA TGACGAAGAT
GATAGTGCCA CTTCCACTGC TGTCAAGAAG GACGTAGCCA GATGGTTTCC TCCAAATCAG
GAACAGACCA AGCTCGGTAT ATCTGCCATT ATCCAGTGTC GTAAGACTGT CAAGAAGGTG
TCGTGGCATA GAAAAGGAGA CTACTTCGTC ACCGTGTCTC CAGATAGCAA GAACACAGCC
GTATTGATTC ATCAATTATC CAAGCATTTA TCCCAATCTC CATTCAAGAA GTCCAAGGGT
ATCATCATGG ACGCCAAATT CCATCCATTC AAACCACAAT TGTTTGTAGC CTCGCAACGT
CAAGTGAGAA TCTACGACTT GGCCCAACAA GTATTGGTCA AGAAGTTGAT GCCAGGTGTG
AGATTGTTGT CTACCATCGA TATACACCCT AGAGGTGACA ACTTGTTAGC ATCTTCTTAC
GACAAGAGAG TATTGTGGCA CGACTTGGAT TTGAGTGCCA CTCCTTACAA AACTTTAAGA
TACCACGAGA AGGCAGTCAG ATCAATCAAG TTCCACAAGG GTAACTTGCC GTTGTTTGCA
TCTGCCTCTG ACGATGGTAA CATTCATATT TTCCACGGTA CCGTGTACGA CGACTTGATG
ACTAACCCAT TGTTAGTGCC TTTGAAGAAG TTGACTGGTC ACAAGATTGT GAACAGTATT
GGTATCTTGG ATTTGATTTG GCATCCAAAG GAAGCCTGGT TATTCAGTGC CGGTGCTGAT
GGAACCGCTC GTCTCTGGAC AACCTGA
 
Protein sequence
MDGLLDAEAS EDDEEDEENS EDDEEEDSDK ELNELLGEEE DPSDYDSENF SDEPQESDTR 
SITDAISGVK IRSLSDISSQ DKQEFHTKYS DGSERIIKPE IEPVYDSDDS DAENFNTIGD
IPLSAYDEMP HLGYDINGKR IMRPAKGSAL DQLLESIDLP QGWTGLLDQN TGSSLNLTDE
ELELIRKIQQ QENTDANIDP YEATIEWFTS KVEVMPLTAV PEPKRRFVPS KHEAKRVMKI
VRAIREGRII PPSKVKEQIE EERLNYDLWN DDDIAVEDHI MNLRAPKLPP PTNEESYNPP
EEYLLTEEEK KQWESLDPAD RERNFLPQKF GALRKVPGYQ ESVRERFERC LDLYLAPRVR
HNKLNIDPES LIPELPSPKD LRPFPIRCST VYQGHTDKIR TISIDPQGLW LATGSDDGSV
RIWEILTGRQ VFNVQLINKE INDEDHIESL EWNPDSQTGI LAVCAGENIY LVVPPIFGFD
IENMGRLRIE SGWGYDTFGN KTKEEKFKND EGNEDEDDED DSATSTAVKK DVARWFPPNQ
EQTKLGISAI IQCRKTVKKV SWHRKGDYFV TVSPDSKNTA VLIHQLSKHL SQSPFKKSKG
IIMDAKFHPF KPQLFVASQR QVRIYDLAQQ VLVKKLMPGV RLLSTIDIHP RGDNLLASSY
DKRVLWHDLD LSATPYKTLR YHEKAVRSIK FHKGNLPLFA SASDDGNIHI FHGTVYDDLM
TNPLLVPLKK LTGHKIVNSI GILDLIWHPK EAWLFSAGAD GTARLWTT