Gene PICST_34653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_34653 
Symbol 
ID4851872 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3059189 
End bp3060559 
Gene Length1371 bp 
Protein Length431 aa 
Translation table 
GC content43% 
IMG OID640393580 
Productpredicted protein 
Protein accessionXP_001386915 
Protein GI126275870 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.236544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTCA GAAAGAGAAG TGACATCATC GGAGGCGGTT CATCTGCTGT TCCAGGCAGA 
GCCCCTCTGG GAATTGGACG AGCCCCCGTC GGTTCTGGAG CAATACCAGG CAGAACGCCT
GCGCATCCTA GACTTTCAAT TAATTCTAGA ATACCAGTAA ATCCAAGAAC TCCGAATGCT
TCTGTTTCTA GAGATTCTGG ATTTATAAAT GGAAGAACTC AGTCTCCACA GGTTGAAGTT
GAAGAAATTT CTGCTTTGAA GAATCCTTCT GTTAGACCAT CCTTAATCTC ATCTCAGCCG
ACAGTATCTA CTGGTACCGG TGATTTGGAC AAACTACTTC TTCACCAAGG ATTGCCATTA
GGACATTCTT TGCTCGTTGA AGAGTCTGGA ACGACTGATT TCGCATCCGT AATATTACGA
GCTTTTGTCT CTCAAGGAAT TATGCACAAC CGGATTAATA AAGACCAGAT TAACTCTCAT
GTGATAGCTG TAGGGATTTC CACCCAATGG ACTGCAAACT TACCTGGTTT GTACAAGGGC
TCCTCTAAAG ATCAAAAGAA AGCTAAGATC CTTGCCAATG AGTCTAAAGT CAGTGTTTCC
AACTTGGCAA CGTCCACTGC TGGTGTGACT TCTAGAGTTG ACAATGACTT GAAAATCGCA
TGGAGATATG GAGTGAATAG CAAGCAGAAA TCGGCATCTC CAGAACCTTT TGAAAACAGT
GCATATGAAT ACTACATCAA CCAATTCGAT ATCACCCAGA AACTTGCCCC TGGTCCAAAT
GCCCAAGATA TTTCGTTTGT TCCTGTAGGT CTTAGTCATA TTCAATTAAT CCAACAGATC
CAGAGCATCA TCCAACGTCA TGTCAAGTTA AATCCAGCTA TTGTGATAAG AATCGCTATC
CCTGGACTTC TTAATCCTAC AGGCTACAAT CCATTGAGCT CTTCGCCTAC ATTTTTATAT
CCGTTTGTTC ACTCCTTGCG AGCCATACTT AGGCAATATA GCCAGAATGT GGTTCTTGTT
GCATCGCTAT CTTCAGATCT CTATCCTCGA GATTCGAACG TAGCCCATGT ACTTGAATCG
TTGGCCGATT CGTGCATTCA CCTTCAGCCA TTCAACCAAG AGATGACCCA GTTGATCGAA
AGAGCCTACA AGAATGAACC ATCCAAGATC CAGCAGGGTC TTGTCAATAT CGTCAAGTTG
CCTGTTCTCT CGGAGAAAGG AATGATGATG ATTCATGAAG GAGAATACGC ATTCAAGAAT
GGAAGAAAGA AATTCGAAAT AGAAGAGTGG GGCATTCCAG TTGAGGACTC TGAAAAAGAG
GAACATACTA CTGCCGAAGG TGGCACTACT AAAAAGAACC TCGACTTCTG A
 
Protein sequence
MSFRKRSDII GGGSSAVPGR APLGIGRAPV GSGAIPGRTP AHPRLSINSR IPVEVEEISA 
LKNPSVRPSL ISSQPTVSTG TGDLDKLLLH QGLPLGHSLL VEESGTTDFA SVILRAFVSQ
GIMHNRINKD QINSHVIAVG ISTQWTANLP GLYKGSSKDQ KKAKILANES KVSVSNLATS
TAGVTSRVDN DLKIAWRYGV NSKQKSASPE PFENSAYEYY INQFDITQKL APGPNAQDIS
FVPVGLSHIQ LIQQIQSIIQ RHVKLNPAIV IRIAIPGLLN PTGYNPLSSS PTFLYPFVHS
LRAILRQYSQ NVVLVASLSS DLYPRDSNVA HVLESLADSC IHLQPFNQEM TQLIERAYKN
EPSKIQQGLV NIVKLPVLSE KGMMMIHEGE YAFKNGRKKF EIEEWGIPVE DSEKEEHTTA
EGGTTKKNLD F