Gene PICST_30636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30636 
Symbol 
ID4838250 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp565684 
End bp567384 
Gene Length1701 bp 
Protein Length512 aa 
Translation table12 
GC content43% 
IMG OID640389565 
Productpredicted protein 
Protein accessionXP_001383387 
Protein GI150864535 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.931484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACGA TTCCCGTCTT TGGATCCTCG GTTCAAGAGG CCGTACAGGC CACAGTTCGT 
CTCGGTAAAC CACTCTTTGT CTTCTTATCG GTGAACTCGG AAGAGAATCT GGCTACGTTC
CTCGAGCAGT TCTTCCACAG CCAGGAGGCA ATAGATAGCG AAATAGGACA ACTTGTCACG
GAGTCATTTG TGACATTGAA GCTTGTAGAA GACACGGTAG AGTTTGGCTA TTTCCAGCAG
ATATTTTCGA ACTTGATTGT TCCTAGCTTC TATATTATCC AGAATGGAAA ATTGCTAGAC
GTGATTTCTG GAGAGACCAC TGAAGCACAA TTTGTCGAAA AAGTAACCAA TGTAATTATA
GCTGAAACAA ACCAAATCTC AACAGGTCCA GAAACATCAC AAACTGATAC AAATGGTGCA
AATGCCGCTA ATATTTCTAA TTCTTCCATT TCATCGGTTC CTGCTCCTAA CACAGCCACA
CAAATACATT CAGATCCAGT ATCAGATACA TCTGCAACAA ATTCGACAGT CGCAAATGTA
GATCTGCCAA CAGCTTCTTC ATCTTCGGAT AACCAGAGAC ATTCTCAAAG CCCGGTTGGA
CAAGTGGAAA CTGTCAAATC TACCAAGCCA CAATCAGCCC ATGACAGAAC AGCTTCGGAG
TACCATAAAC AATATCTAGC TTCTAGAAAG AAGCAGGAAG AAGAGAGACT CAGGCTTCGA
GTACTCCTTC AGGCGGATCA AAAAGAAAGA CTCTCGAGAC AAAGAGAGAT GGACGAAATT
CTTCATGGTT CAGAATCAAC TTCTCCACAG CCCAAATCAC AATCTCCAGC ACACCCAGCA
CAACATGATG TGTGTTTTCT CTCAATAAAG CTTTTCGATG GAAGTTCATT GAAGCACGAG
TTTCTGTCAT CAGATACATT GAATACTGTA AGGGAGTGGT TGGATAAAGA AACAGAAATA
ATACCTCCCA CAGACTCCCT ACCATCTTTT GCAAGCTCTT CGTATCCGCA GCCTACAAAT
TACGCCTTTC ACCGCCCGAT ATTACCGAGA GAGACTTATA CAGATGAGCA AGAGTTCCAG
AAGCTTGTTG ACCTTGGATT GTGCCCTAGA TCTGCATTGA TCTTGAAGCC TATTTATGAC
GATAAGTACC TGAGTTCGTA TCCTACCAAT AAGACTTCAG GAGGTATATT GAGGGGTGTA
GGCGGAACTT TAGCCAGAGT AGGAAGTGCT TTATATTCGT TCTTTGATTA TGGGGTAGAT
GACACTCAGG AACATCAGCA TCAAGACTAT GATGAACCGG ATGGTTCCAG AAGTCCACGT
GATCCCACCA GTCCTTCCAG ACCTTCAGCT ACAGCATCTG GTTCGTCTCG TGTAGATTTC
CCTGTGAGAC CACCATTGTT TTCAATCGAC AATAACGTGC CTTCATCGTC TTCTCTCATC
AACATCTCGG AACCTGCGAA CAACAATTCG TCTTCTTCTT TACAACAGGA AGGGCCTACG
TCGTTCTTCA TTGACGAGTC TAATAATCCT TCAGTATACA ACAGTAGAGC CTCAACACCC
AAACCGTTAG GATTGTCGCT GATTAGTAGA GTCCAGACTA TTCATGATGA GCAAGATGAT
AAGGACAAGA AAGACGTGGA TACATATAAT GGTAACTCAG TGAACCTTCG TGGAAAGGAT
GATGAAGATA AGAGAGGTTA A
 
Protein sequence
MDTIPVFGSS VQEAVQATVR LGKPLFVFLS VNSEENSATF LEQFFHSQEA IDSEIGQLVT 
ESFVTLKLVE DTVEFGYFQQ IFSNLIVPSF YIIQNGKLLD VISGETTEAQ FVEKVTNVII
AETNQISTGP ETSQTDTNGA NAANISNSSI SSVPAPNTAT QIHSDPVSDT SATNSTVANP
QSAHDRTASE YHKQYLASRK KQEEERLRLR VLLQADQKER LSRQREMDEI LHGSESTSPQ
PKSQSPAHPA QHDVCFLSIK LFDGSSLKHE FSSSDTLNTV REWLDKETEI IPPTDSLPSF
ASSSYPQPTN YAFHRPILPR ETYTDEQEFQ KLVDLGLCPR SALILKPIYD DKYSSSYPTN
KTSGGILRGV GGTLARVGSA LYSFFDYGVD DTQEHQHQDY DEPDGSRSPR DPTSPSRPSA
TASGSSRVDF PVRPPLFSID NNEGPTSFFI DESNNPSVYN SRASTPKPLG LSSISRVQTI
HDEQDDKDKK DVDTYNGNSV NLRGKDDEDK RG