Gene PICST_76461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_76461 
Symbol 
ID4836724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2679606 
End bp2681086 
Gene Length1481 bp 
Protein Length442 aa 
Translation table12 
GC content46% 
IMG OID640388039 
Productpredicted protein 
Protein accessionXP_001382748 
Protein GI150864061 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCAGACGTC AAAAAGCATA CACACAGCGC CCATTGACTC AAAAACAACT GGGTTATACT 
ACCAGGAGTA CTTGAATCAG CATATAGTCC TTTTTTGCAG CCATGGCGTC GCAGATCCAT
TTTCTAGACA TCAAGGGTAA ACCCTTGCTT TCTCGAGATT ACAAAGGAGA TATCCCCACC
AACACCATCG AGAAGTTCCC GCTCCTACTC TTGGAGCTTG AGAACGCCGC TGATGATGGT
GACTTCAAGC CGTTTGTACA CAGTCAGGGA ATCAACTACA TCTTCATCAA CCATAACAAT
CTCTATCTCT GTGCCTTGAC ACGGAAGAAC GAGAACATCA TGGCTATCAT CGTATTCTTA
CTGAAGTTAA TAGAAGTGTT GACGCAGTAC TTTAAGTCGC TTGAAGAGGA ATCAATACGC
GACAACTTCG TGATAATCTA TGAGTTGTTG GATGAAATGA TGGACTACGG TGTACCCCAG
ACCACAGACA CCAAGATCCT CAAGGAGTAC ATAACCCAAG ACTACTACAA GTTGGTGAGA
AGCACACCTT CGCATTTGGT CCAGCCACCC AATGCCGTCA CAAATGCGGT GTCGTGGAGA
AAGGACGGCA TTTTCTACAA AAAAAACGAA GCTTTCTTGG ATGTGGTGGA ATCTATCAAC
ATGTTGATCA ACGCCAGTGG ACAGGTGTTG AACAGTGAGA TATTGGGAGA GGTCAAAATC
AAGTCCCACT TGAGTGGAAT GCCAGATTTG CGCTTGGGAT TGAACGATAA GGGCATTTTC
AGCTCTAGTT CCGATCTTGA GGCCGGCGAA CAAACTGCCA ACGCTAAAGG TATCGAGATG
GAGGATATCA AATTCCACCA GTGTGTCAGA TTGTCCAAGT TTGAAAATGA ACGTATCATT
ACTTTCATTC CACCCGATGG TGAATTTACT CTCATGTCGT ACCGGTTGTC GTCGGCGCAA
TACTTGATGA AGCCGCTTCT CTTGGTCAAC TGTAAATTCA AAGTGCACAA ACACCTGAGA
ATTGAGATTC TCTGCTCCAT CAGGGCTCAG ATTAAAAAGA AGTCAACTGC CAACAATGTT
GAAGTCATCA TCCCCATCCC AGAAGACGCA GATACCCCCA AGTTTGTGCC TGAGTACGGA
ACTGTTAAGT GGATTCCTGA AAAATCGTGT GTCATCTGGA AATTGAAGAC TTTCCCCGGA
GGCAAGCAGT TCCATATGCG TGCCGAATTG GGATTACCTG CGGTGACAGA CCCCGAAGAC
ATACTCCTGA AAAAGCCCAT CAAGGTCAAC TTCTCCATCC CGTACTTCAC CACGAGTGGC
ATCCAGGTCA GATACTTAAG AATCAACGAG CCCAAGTTGC AATACCAGTC ATACCCCTGG
GTCCGCTACA TCACCCAAAG TGGAGACGAC TACACCGTCC GGACAAAGTA GAGTTCATAG
AGAATAGGAA GTATAGGAGT TGACATAATT ATGATGTAAC T
 
Protein sequence
MASQIHFLDI KGKPLLSRDY KGDIPTNTIE KFPLLLLELE NAADDGDFKP FVHSQGINYI 
FINHNNLYLC ALTRKNENIM AIIVFLSKLI EVLTQYFKSL EEESIRDNFV IIYELLDEMM
DYGVPQTTDT KILKEYITQD YYKLVRSTPS HLVQPPNAVT NAVSWRKDGI FYKKNEAFLD
VVESINMLIN ASGQVLNSEI LGEVKIKSHL SGMPDLRLGL NDKGIFSSSS DLEAGEQTAN
AKGIEMEDIK FHQCVRLSKF ENERIITFIP PDGEFTLMSY RLSSAQYLMK PLLLVNCKFK
VHKHSRIEIL CSIRAQIKKK STANNVEVII PIPEDADTPK FVPEYGTVKW IPEKSCVIWK
LKTFPGGKQF HMRAELGLPA VTDPEDILSK KPIKVNFSIP YFTTSGIQVR YLRINEPKLQ
YQSYPWVRYI TQSGDDYTVR TK