Gene PICST_78790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_78790 
Symbol 
ID4840121 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp271432 
End bp273346 
Gene Length1915 bp 
Protein Length405 aa 
Translation table12 
GC content42% 
IMG OID640391436 
Productpredicted protein 
Protein accessionXP_001385390 
Protein GI126137734 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.372401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.242751 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCCCAC TTGTGCTCGC AACTCAACCA GTACTTGCGC CGACTTGCCA GTCGACCTCA 
TCCTTAGAGG CTCCGTACCA ATTTGCCAAA TCTATATAGG TTCCGTTAGC TAATTTTCAT
ATCTTGTGAA TTTGACATCT GTGTTTCCAA CTCTGAACAC GGACTTTCGC TATAGCTTCC
ATCTTTGCCC TGCAAAAGTT AAGTGTTGTC TGTTGTTGGC CAATCACCTT TTTTCGCTGA
TTTGGTCATA AGAGTGTTAA CATAGTGGGA ACAAAACTGT TCATATAGAT TTGATTTTAC
ACTGAACTTT ACCCAGAACA TTTCCATTTT ATTTCGGCAA AGGCATATAG ATTACTCCAT
TTTCTCTTCT TGAACATTTA CCTGTGTCCT TTACTCATTG TATCTAAAAA TGTCTGTCCT
ATTCCTAAAG TACGCCCTAA AGCCGTTCCT ATTCGTTTTC CAGTTGCTAA ACAGAATTTT
CTGGTCCGGA TTAAACGGCC GTACCGTTTT CCAGCTAGTG CTAAACTTCT GGCTCAACTT
TTCACCAGTC TTTATTTGGT TACTTATGTT TAAGAATGCC GGAATTATAC CCAAGGAAAT
CCGTCCTAAG ATTTACGTCG CATTGGCTAT GCATGTTGAC GACTATATGT TCAACTTCGT
CGGTCATCCG CTCATCTCCA CTGTAGCTCT TGTGAGCTTA GTGTCTGGGG CTTGGTTGAT
CTACTACGTG TTTTATAGAA CCCCCACCTC CAAAAAGCAA GAACAGTCAT ATTCGGCTCT
TTCCAATGTT TACAAAAATG AACTCCATAA TGGACATTCC ATCGATTCCG ACGATCCTAC
AGCTGTCGGA TCGTCGTCCG AGACTTCTTC CGATTTAGAA GATTTTAATG AATACGAATT
AACAGATATG AATTCAAGTT CGTCTGATCT CGGGGACGTG TCTATTTTCA ATTCTCCAGG
AGATTGGCAA GACGACACCC ATCACAGTTC GATTGAGTTT TTCAAAAATT TATCGTCCAA
TGCCATTTCT ACACAGACCT CAGAAACCAA CCGTAGGATA TGGAGAACTA TCAAAACCAG
AGGATACGGG CCATTAAACT GCTGGAACTT GTCTCCACCA ATTCTTATGG CATTGAGTTG
GTTCCTACTT AACATTGACT ACTGGTTCAA GGACCCAATT AACACTCCTA AGGACTTACT
TGCATGGACT TCTTATGTTT TGTTTCATTT CTTTGTTCCT TTATTCACTG CCATATGGTT
ATATGTATTC CACGCCCCTG GAGCTTTGAG ATTGTTTTCA TTTGGACTTG GAATGCAAAA
TATAGCAGGT GTTTGCACCC ACTTGCTTTT CCCCAATGCT CCACCTTGGT TCATCCACTT
ATACGACGAA GATGCGGAAG CAACTTATGA CTTGCCTGGT TATGCCGCTG GATTAACCAG
AGTCGATATG GCCATGGGAA CCCATCTCAA TTCCAACGGT TTCCATGCTT CACCCATTGT
GTTTGGAGCT TTGCCATCTT TGCATTCAGC CATGGCAGTG ATGGCTTTCT TCTTTGTCTC
GTACTACTCG AGATGGACAA CCCTAAAATT GCTTGCCGCA TCTTTTGTAG CATTACAATG
GTGGGCGACA ATTTACTTGG ACCACCACTG GCGTTTAGAC TTGGTTGTTG GCATGTTGTA
TGCGATTACC AGCTTCACGT TGTTATATTG TTGGCCCAGG GGAATTAAAA AAGTTGATTC
AGATTTCATG AAAGCTAGAC TACGATTTGA TTTCAAGAAT GGATCGACTA TGGGAATGAG
AGTTTTCAGG AATACCCGCT TACAGAACTT TTTCGATCCT TTAGCATAGA CATATAATAC
ATTTACCTAC GCCTTTAATC TACGAATGCA TATCGGTCTA CGATCGATCT TATAA
 
Protein sequence
MSVLFLKYAL KPFLFVFQLL NRIFWSGLNG RTVFQLVLNF WLNFSPVFIW LLMFKNAGII 
PKEIRPKIYV ALAMHVDDYM FNFVGHPLIS TVALVSLVSG AWLIYYVFYR TPTSKKQEHS
IEFFKNLSSN AISTQTSETN RRIWRTIKTR GYGPLNCWNL SPPILMALSW FLLNIDYWFK
DPINTPKDLL AWTSYVLFHF FVPLFTAIWL YVFHAPGALR LFSFGLGMQN IAGVCTHLLF
PNAPPWFIHL YDEDAEATYD LPGYAAGLTR VDMAMGTHLN SNGFHASPIV FGALPSLHSA
MAVMAFFFVS YYSRWTTLKL LAASFVALQW WATIYLDHHW RLDLVVGMLY AITSFTLLYC
WPRGIKKVDS DFMKARLRFD FKNGSTMGMR VFRNTRLQNF FDPLA