Gene PICST_81785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_81785 
SymbolCHS4 
ID4837165 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1950154 
End bp1952782 
Gene Length2629 bp 
Protein Length613 aa 
Translation table12 
GC content47% 
IMG OID640388480 
Productchitin synthase regulatory factor 
Protein accessionXP_001383145 
Protein GI150864366 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.757769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACAC ATCCATACCG CCAGAAGGCG CCTTCATCGG CTCCGTACCC CGTAGAGGGC 
GATTTCGCCA CTACCGGTGA CTACGCTCTT CCCCATATCC TGACCGCGTC TGAGGCAACA
TTGGCTCCTA CCAAAAACGT CTCGTCTTCT TCTGTCAACA AATACAACGG AAGCAACACA
AATCTTGCCA GCGACACGCC TACCATCGCC GATTCCACGA ACTCATCTTC CCACGGCTCT
TCTTCGAACT ACCAGACCCC TGTAGATGAA ACTGCTGCAA ACGTAGAAAA CGTAAATGTA
GGAAGTGTGG CCAGTAACGG CCTTTCTCAC CACAATAGTG GTTCCATTCT CCAGTCGCCA
AAGCTCCACA ATTTGAGCAG TATCACCAAC AATACTAGCA ACACCAGCAG CATCCATGGA
AAATCTCCCA TTCCTGCCAG TGTCGCAACT CCATACCCCC AGTACGACTC GCTGACTCCT
CCTCCTGTAT TCGCATCCAA CACTGCTGTA GTTTCACCTC CATCCAATCC GCTTTCAGCT
ACTTCGTCTG TCACCAACTT CAACTTGAAC CAGCAGACAC CAAACGTCGA TCTGATGTTC
TCCAAGTCGG AAAACAACTT GCCTATGCCA CAGCAAAAGT ATGGCCACAG CCGTTCAGTA
TCGTCTACTT CATCTTTCTT CTACGACAGA GACAACGCGT CGATGGTCGA TTTCAGCCAG
AACATAATCC AGCTGTACCT CGGCTCAAAC TCGACGCATC TCATGCCGCG AATCAAAACA
ATTGAGTTAT ACCGAAAAAA CGCCAAAAAG TCCAACGACC CTACCGTTTT GTTCCAGTAC
GCTCAATACA TGCTTCAAAC AGCCCTTTTG TTGGATGCAG AACCTTCTAA TTTAGGTGTA
GGAGGGGGTA GTAGCAATCA TGGCTCTCAG GGTTCCCAGG GCTCCTCTAG TCAGGGCAAC
ACTCCCTCGC AGTCGATAGA AAACCTGCCT AGGAAAGAGC TCTTCAACAA GAACCTGAAT
GGCCTGTCTC TTTCGATCTC AAAGACTCAC AAGAAGTCGA AATCGGATAC TCTTGATCTC
GGTACCGAGT TGGGGGACGG TTCTACCGTA GACGACAAGC GTTTAAAGCG TGCTCTCTTA
AAGGAAGCCG TTCACTACTT GAAAAAGTTG TCGGACAAAG GTTATGTAGA TGCTCAGTAC
TTGTTGGCTG ACGCGTACAG TTCTGGTGCT TTGGACAGGG TTGAGAATAG AGAAGCATTT
GTGTTGTTCC AAGCAGCTGC TAAACATGGC CATATCGAGT CGGCTTACAG AACCTCCTAC
TGTTATGAAG AGGGTTTAGG AACAGGCAGA GACTCCAGAA AGTCCATCGA GTACTTGAAG
ATGGCAGCCT CTAAAAACCA CCCGGCATCC ATGTACAAGT TGGGTATCTA CTCGTTCTAC
GGCAGAATGG GTATGCCCAA CCATATCAAC ACCAAGAAGC TGGGTATTAA ATGGTTGGAA
AGAGCATCTA ATGTAGCAAA CGAGCTTGTA GCTGCGGCTC CGTTTGAATT GGGTAAGATC
TACTACAACG GGTTCCAGGA CATTGTCATT GCTGACAAGA AGTATGCCTT GGAATTGTAC
TCGCAGGCTG CAGCTTTGGG CCATGTACAA TCTGCTGCTC TATTGGGTCA GTTCTACGAA
GTTGGAGAAA TTGTCCCACA AGACAATAAC TTGTCTATTC ACTACTATAC ACAAGCTGCA
TTGGGCGGCG ACCCTGAATC TATGTTGGCT ATGTGTGCCT GGTACCTTAT TGGCAGCGAT
CCTTTTTTGC CTAAAGATGA GAATGAAGCC TTTGAGTGGG CCAAGAGAGC AGCCGTCTGC
AACTTGCCAA AGGCACAATT CGCATTGGCT AACTTCTACG AAAAGGGCAT TGGTTGCATC
AAGGACGATG AAGAGGCACA GACTTGGTAC AAGCGTGCAG CAGAAGGTGG TGAGGAGAAG
TCGCTAGAAA GGATTTCAGA TCAGGCTGCG GCTACGAAGT TGCGCCGACA AATCGGCAAG
AAAAGGTCAG GTACTGTGGG CAGTAGGAAT GTCAATTCTG TGGAAGGAGG AGTAGCCAAA
GGCAGTGCTG CGCAAGAGAA GGATTGTGTC ATCATGTAGA GCTACAAATC TAACAGTATG
CTGACAAGCT GCATAAACAG CGACCTAACT AGCTAACCGA GGCTAAAATT ACCATTGGCA
TATTGCAGGA GTTACATTAT TGCAAGTGCC AATCTGGAAC GGGATCAAAA TTTTATACAC
TACAACGTGT TTTACCATCA TCGAGAGCTT GACTTCGAAA GCTAGGTTTT CATTCGGGGG
ATTAATAAAT TCACTAATTT CATATATGGA TCAGGCATAT ACAGTTTCAT TCACTTGCCA
AAGAACAAGG GATAGCAATC CTTGGTAGAT TGTGAAACAT ACTGCACACT CAGAGCGGTG
CCTGGAGAAA CAGGAATGAG ACACGTCTAC CAAACGGAAC AAAAATAGAT AAAAGCTTGT
TTGATTTTGC AAAATCTGGA TTCTCGTACT TCTCACCAGT CAGTAACCGA ATTGTGTCTG
TCTGATATTC ATATACTCAT CTTCTTAGTT AATATATTTA TTTTGTTGC
 
Protein sequence
MSTHPYRQKA PSSAPYPVEG DFATTGSVAS NGLSHHNSGS ILQSPKLHNL SSITNNTSNT 
SSIHGKSPIP ASVATPYPQY DSSTPPPVFA SNTAVVSPPS NPLSATSSVT NFNLNQQTPN
VDSMFSKSEN NLPMPQQKYG HSRSVSSTSS FFYDRDNASM VDFSQNIIQS YLGSNSTHLM
PRIKTIELYR KNAKKSNDPT VLFQYAQYML QTALLLDAEP SNLGGSSSQG NTPSQSIENS
PRKELFNKNS NGSSLSISKT HKKSKSDTLD LGTELGDGST VDDKRLKRAL LKEAVHYLKK
LSDKGYVDAQ YLLADAYSSG ALDRVENREA FVLFQAAAKH GHIESAYRTS YCYEEGLGTG
RDSRKSIEYL KMAASKNHPA SMYKLGIYSF YGRMGMPNHI NTKKSGIKWL ERASNVANEL
VAAAPFELGK IYYNGFQDIV IADKKYALEL YSQAAALGHV QSAALLGQFY EVGEIVPQDN
NLSIHYYTQA ALGGDPESML AMCAWYLIGS DPFLPKDENE AFEWAKRAAV CNLPKAQFAL
ANFYEKGIGC IKDDEEAQTW YKRAAEGGEE KSLERISDQA AATKLRRQIG KKRSGTVGIA
KGSAAQEKDC VIM