Gene PICST_59093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_59093 
Symbol 
ID4838774 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1299651 
End bp1301381 
Gene Length1731 bp 
Protein Length416 aa 
Translation table12 
GC content38% 
IMG OID640390089 
Productpredicted protein 
Protein accessionXP_001384205 
Protein GI150865118 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0602507 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TACTATGCTG CTAAGGGATT GAAATATGAC GATAAAGCTG CTGCAATACG AGAATTGAAG 
CTGGTCGTAG ATAAATCGGA GGATAATGAA GAGAACAATG AATGGAGATT TAAAGCATGC
AAGCAGATCA TGAAGATCAG TGTAGATATT CAGGATTACG ATGGAGCATT GCAGCAGCTC
AGCAAGTTGA TTGAACTACT TCCAAAAGTC AGCAGGATTT ATTCTGAAGA GTCTTTAATC
AAGATAGTCA TGAATTACTC CATCGTGGGT GATAATCTGT TTGTGACATC TTTGTATGAT
ATGATAACGA AATATACTCT GGAATCGAGT TCTGGAAGTA ATGATAGATT GTTCTTGAAG
ATTTCTCTTA GCAAACTCAA CTACTTTCTT GAAAATGGTG ACTATGCTAA ATGTCCACCA
TTAATAAAGT CTATCAATGA GAAACTCGCC CAAGTCTCAG AAGCAATGAT GAAGTCATAT
GTGCTAGAAG CTATTGCTTG TGAGATTGAG TATGAATCGC ACATGTCAAA TGTCAACCTC
CTCAAATTAA ACCAATTATA CCGAAAGAGT TTGAAGATAA CAACAGCTGT AACACATCCA
AAGATTCTTG GGACGATACG AGAGAGTGGA GGAAAAGTCT CGTTTTACAG AGGAGACTAC
GAAAAAGCTA GAACTGAGTT TTATGAATGT TTTAAGAATT ATGATGAAGC TGGATCTTCC
AAGAAAAAGA AAATACTAAA GTATCTAACC TTATGCTCCT TATTGACAGG GAATGAATTC
AATCCGTTTG AGTCTCAAGA AACACAAACA TATGCACAGC TTCCAGAATT TTCCAATTTA
TTGCTATTAA TGCAATCTTA CGACGATATG GATTTGAAAG GAACTAAACA GATTATAGAA
CATATCCTAA TATCCAAGGA CGAGTTGCTG AACGACGATA TCTTCTTGAA TGCCCATGAG
AAGATCTTGC TTAATTTGAA ATCAAAAGCT ATAATGAACC TCTTCAGCGC CTTCAGAACT
ATCAAGTTCG AGTCGATAAG GCAAGCTGTA GATTTGAGCC AGGAAGACCT TGAAACACAT
ATAATGAAAC TAGTAAATTC TGGAAAACTC ACCCATATCA AGATCGATTT CGTCAATGGG
TATGTAGAGT CTACGAGTGA AAGAAACCTG ATTTTCCCAT TGAGTTTGCG TTCTGAAGAT
ATATACTACA ACTTGAAGGC CATCAATATG TTGGACTTCA ATAGCAATAG CGTTGGAAAT
CCAGGCGATG CAAACAGTGC ACACGAAGAT AGAATGGATG TAGACAATGA AAGAGAAAGG
GAATACCTGC AACCTCTAGA TGAGGCCAAC AAACAAAGCA TTCTCACCAA ATTTTTGTTC
GCTTCAGATT ATCAGAGCGA CAAAGATTGG TTGAAAGCAA TTGACTCCTG GTATCGTTAT
TTGGTCTGTG CTATACCGCC GGCAGTGAAG TCTGAATTGA GTCAGAAAGA TCAGATTTTT
TCGGAGCAAA GGGCTGAAAA ATCAGTCAAC CATCCAGGAA ATACAAAGAC TGACATCGAA
AACGATGTTG CCGACCAGAG TACAAATGCG GGCATCTTGA GTTCTACAAT CAACGGTGAC
ATTGATGATG GTGAAGACGA CGAATACTGT GAAAACGTAA GTAAAGTAGA CCTCCTCACC
AGTTGGTACA AAGAGCTACA AAAGTACTAC AATAGCATCG CCAGTAAATA G
 
Protein sequence
YYAAKGLKYD DKAAAIRELK SVVDKSEDNE ENNEWRFKAC KQIMKISVDI QDYDGALQQL 
SKLIELLPKV SRIYSEESLI KIVMNYSIVG DNSFVTSLYD MITKYTSESS SGSNDRLFLK
ISLSKLNYFL ENGDYAKCPP LIKSINEKLA QVSEAMMKSY VLEAIACEIE YESHMSNVNL
LKLNQLYRKS LKITTAVTHP KILGTIRESG GKVSFYRGDY EKARTEFYEC FKNYDEAGSS
KKKKILKYLT LCSLLTGNEF NPFESQETQT YAQLPEFSNL LLLMQSYDDM DLKGTKQIIE
HILISKDELS NDDIFLNAHE KILLNLKSKA IMNLFSAFRT IKFESIRQAV DLSQEDLETH
IMKLVNSGKL THIKIDFVNG YVESTNDEYC ENVSKVDLLT SWYKELQKYY NSIASK