Gene PICST_33812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33812 
Symbol 
ID4840814 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp538910 
End bp540269 
Gene Length1360 bp 
Protein Length354 aa 
Translation table12 
GC content39% 
IMG OID640392129 
Productpredicted protein 
Protein accessionXP_001386497 
Protein GI150866785 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.680191 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.108909 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTGGC GAAAATTGGG CTTCAGTGCT GTAAGTAGTC ACACGAATGT TAATTGTGAT 
ACCTAATTTG CGAACATCAA AGAATATGAA ACTTTGAGCT TCATAACTTA ATACACTCAT
ACTATAATAC TGGAGATACG ATATATCATA CAAGGCTGAA TGGTAGCACA GTACTAGTAC
GATTCAATAC TGTTTTGATA GTAGATTCTC CCTTTATGCT GTGCTGTATT CCACTATGAA
GTATCGTATG ATCTATATCG TTTCCATATT GTACACCTTC CGATGATGTT CTGTGATATC
TTCCTCAATT ATACTAACAT ATCCAGTTTG GCACGTTTCT AGGCGTCTCA GTTTACAAAC
GGCTAGAGAT TAAGAATAAC ATCTACATAC AATCACCCAT TAATTCCTAT GGAAGCATCC
AGAGCTTGGC CAGCAAGGTG TTTTCAACGT CTACAAGCAC TTCGTCAAGG TCGTCCTTCA
ACTTCAAGCT TTTCAATCTT GGAATCTTGG GATTGCTTCT CTTCATTAAC TTTGTCAAAT
GGGCCATCTT TGGTAAACTT TCACCTACCG AAATCAGAAA TCTCAAACAC AAGATCAACT
ATACCATCTG GGAGTTTGCC TTTGGATTCA TGATCTTCTA TGTAAAGTCA CGATCGATCG
GATTGCAAGT AATTCAGAAC GAGTTGTTCA AATTTGCTGG CCTCTTCTTT TCTGTGCTTT
TGCTTAAATG TTTTCATTAT CTTTCTATAG ACAGAGTCAG CTCCATCTTC AACACAAATT
CCAACTCGCG GGCCGAGGTG AAGTATCAGG GACTAAGACT CTTCGTTGGG CTCATAATTT
TGGCATTTAT TGACAACTTG TTGATCTCTC GTTTCTTGTA CGAAGTGTAT CAGAACTACT
ACTGGTCAGA TAAAATGATC GAGATGTCGA AAGTAACACT CCAGGAAAAC ATTTTGACAG
CTATCTTTGG ATTTGAGATC TTGCACATCG GGCCGTTAAT TTTCTTGACA ATCTTGAAGT
ATTGCTTGGA TTTCTACGAA TATTTCCACT TCCATCTGGT GTGGCCCGAG GGCAATGCTC
CACTTACTAC AGAATTGGAG TTGAATACCT GGAAAGAAAC AAAGATGAAG ATTATATATG
TGACAGAGTT CGTAGTGAAT TTGTTACGTT TCACCATGCT CTGCATATTT TCCATCGTCT
TTTTATCGCT TCACACTTTT CCCTTCCATA TCTTGCCATC TTCGTACTTG AGTTTGAGAG
TTTTAGTGGT GAAAACAAGA CAGTTGATCA ACTTCAAAAA GAAGCAGTTC ACATTGAAGA
AACTTACGAT TCCCGCTACA CTCGAAGACC ACCTGGAGCA
 
Protein sequence
MYWRKLGFSA FGTFLGVSVY KRLEIKNNIY IQSPINSYGS IQSLASKVFS TSTSTSSRSS 
FNFKLFNLGI LGLLLFINFV KWAIFGKLSP TEIRNLKHKI NYTIWEFAFG FMIFYVKSRS
IGLQVIQNEL FKFAGLFFSV LLLKCFHYLS IDRVSSIFNT NSNSRAEVKY QGLRLFVGLI
ILAFIDNLLI SRFLYEVYQN YYWSDKMIEM SKVTLQENIL TAIFGFEILH IGPLIFLTIL
KYCLDFYEYF HFHSVWPEGN APLTTELELN TWKETKMKII YVTEFVVNLL RFTMLCIFSI
VFLSLHTFPF HILPSSYLSL RVLVVKTRQL INFKKKQFTL KKLTIPATLE DHSE