Gene PICST_34942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_34942 
Symbol 
ID4837287 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp777944 
End bp779767 
Gene Length1824 bp 
Protein Length471 aa 
Translation table12 
GC content44% 
IMG OID640388602 
Productpredicted protein 
Protein accessionXP_001382920 
Protein GI150864195 
COG category[K] Transcription 
COG ID[COG5576] Homeodomain-containing transcription factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.442314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCGG ATGAAAACAG CACAACCTCC AACAAGAGGA CTAGAGCATC TGGTGAGGCT 
CTCGATTTCT TGCTCCAGGA ATTTGAAAAA AACCCCAACC CTTCCACAGA GCAACGGAAA
GATATCTCCA GCAAGACGAA CATGTCCGAA AAGGCCGTTC GGATCTGGTT TCAGAACCGT
AGAGCCAAAC TCCGGAAATT TGAACGGTTG AACCGTTTGC AGACTGGAGG TTCAAGCATT
CACTCGTCCC GTTCCAATAG CATCAGCAAT ATAAGCCCTA TCCACCCCAA CTATGGAAAC
CAGGCTATTC CCATCGAGAT CAACGAAAAG TACTGCTTTG TTGACTGCAC TTCACTTAGT
GTAGGCTCCT GGCAGAGGAT TAAGTCAGGT TACCATGACG AAAGACTGCT CCGTAACAAC
CTCATCAACT TGTCGCCCTT CACGATTAAC TCGGTAATGA CCAGTGTAGA TTTGCTAGTG
ATTTTGTCTA AGAAAAATTG CGAAATTAAC TATTTCTTCC TGGCCATCTC CAACAACTCC
AAGATCCTCT TCCGCATATT TTATCCTATA TCTTCTGTTG CTACCTGTTC TTTGCTCGAT
AATAATATCA CCAAAGAGAA CAGCGAGCTT CGTGTTAGTT TGACTCACCA GCCCAAGTTT
TCGGTGTACT TCTTCAACGG AATCAACTCA CAAGCTAACC AGTGGTCCAT CTGTGACGAT
TTCAGCGAAG GTCAACAAGT CAGTCAGGCT TACACTTCAG AAGGAGGTAC GTCCATTCCT
CACGTGTTGG TCGGAATCAA GAGCTCTTTG CAGTACTTGA ACTCGTTTAT AGCTGACAAC
AATAACCTGA CCTACTCACA ATTCCCGACC TCGGTAACAC CATCATTCCA ACAACCATTC
CAAGAAGACA ATACAAGCAG AAATTCCAAC ATCAATAATA CCAACAATAC CAGCAATATC
AATACTACTA AAATCAATAA TAATAATCAT GATTTCTTCA GTACGGAAGA TTTGCTCTGG
GATGAAACCT CATCTTTGGC TCCTACTAAT ACCAACAGAT CCAATACACC ATTGCCATTT
CCACAATCTT CCTCTGGTTC TATCAGCAAC GGCAGTCATT TGAGAAATGC CCAGATGTCT
GGATTCAGCC CGTTGGCCGA TTTCAACTCT GACACTTCTC CTAACTCCAT AGGTAGTACC
AACAGTAATC AAATCAATGT CAAGAACTCG AGCTCTCATA CTCCTGCCTT ACAACAACAC
CATCTGCTCC AGCTGGTGCC TCAATACCAG CCACATACCT CCACGTTCAA TTCTAGCGCG
AATACACCAC ATAGAATATA TTCGCGTCAT TCCATACCAC AGCATAACTC CCATAGCAAT
ATACCTGAAA CCAGTTCTGT TGATGGTTAC GACGTTTTCA ACACAGCAAA CACCCCCGAC
TTCTTCACTA CACTTAGTGG AGATGGCGGT CAGACTCCTT CCAATATGTT AAACCACGAG
AACAGCCCCT CCATGAACTC CCAAGGCCAT TCTAATGGTG TTAATAGTAA CACTATCTCT
ATTCATGCCT ATACACACCA GAACAATAAC CACAACGATT TCTTGGACTC AATACAAACG
TTTCCATCGT CGCATGACTT CGAATTTGGA CTTGGTGGGG ATTTGGCTTC CAACGGTGAA
ACTCCTTTGG GTGGAGCTTT TGATGGTGCT CCTAATGGTG GAAATGCTGC TGGTAACAAT
AATAGCAGTA ACAATAACAA TAATGCTACT AGTGGTGGCA CTTCCAGCAA CGTCGATAGC
TTTATAGACT TTGGCAGCCA CTGA
 
Protein sequence
MSSDENSTTS NKRTRASGEA LDFLLQEFEK NPNPSTEQRK DISSKTNMSE KAVRIWFQNR 
RAKLRKFERL NRLQTGGSSI HSSRSNSISN ISPIHPNYGN QAIPIEINEK YCFVDCTSLS
VGSWQRIKSG YHDERSLRNN LINLSPFTIN SVMTSVDLLV ILSKKNCEIN YFFSAISNNS
KILFRIFYPI SSVATCSLLD NNITKENSEL RVSLTHQPKF SVYFFNGINS QANQWSICDD
FSEGQQVSQA YTSEGGTSIP HVLVGIKSSL QYLNSFIADN NNSTYSQFPT SMSGFSPLAD
FNSDTSPNSI GSTNSNQINV KNSSSHTPAL QQHHSLQSFS NTPDFFTTLS GDGGQTPSNM
LNHENSPSMN SQGHSNGVNS NTISIHAYTH QNNNHNDFLD SIQTFPSSHD FEFGLGGDLA
SNGETPLGGA FDGAPNGGNA AGNNNSSNNN NNATSGGTSS NVDSFIDFGS H