Gene PICST_52783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_52783 
Symbol 
ID4851518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2039891 
End bp2041126 
Gene Length1236 bp 
Protein Length411 aa 
Translation table 
GC content45% 
IMG OID640393226 
Productpredicted protein 
Protein accessionXP_001387628 
Protein GI126274728 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.423874 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTCAT CCTCTAGCAT CTCCAGACGA AACTTGTCTT TTTTCTGGAA GATGTTGGCT 
CTCGATAAAC CCGAGTCCAA GGAGTCCGAA CGTCCGCTCT ACAACTGGGA CGAATCACCT
TACGAAGATA TTAGAACACG TGCAGCGTTC ATAAGAGCCA AAGCCCTCTG TCCGGTCACC
AAGAAGCCAG TCAACTTCGT TTGTCCATAT TCAGGAATTC CTACTCACCA TTCGAGAGAA
GCCTGGGAAA GCGATACTGA ATACCACAAG AGAAAAACCT ACGAGTTGTT GAAGAAGGTC
AACTTGTACG AGCACGATGT CAGATCTGGT AGAAAATTCG ACGAATTTGT ATTTCCCCTC
GAACAAAATA ATGATTACAT GGTCAACTTG TCTAGTTGGG ATTCGTTCTT CTACACGAGA
GATTTCGCTC CCATGAACAC AGAATTCAAC TTGGCTGCTG CCACTAAAGT ATTGACATAT
CCCATGACAA TTGCGGCTAT TATTAACAAA TATTCACCCT ACGAACCACA GCCTAAGGGA
CCAGTAACTG TAGAAGGTCT TCGTTCCTTG GCAGCTTTGA AGTATACTTT ATATCCTCCA
TACACTAAAT CCACTGACGC TGTCACTTTC AAAGAAAGAC CCATGAGAAT TTTCATTCTC
GGCGCTAAAA TGGAATCCAT GTTGCCTGGT TACGTATGGA AACAGTTTGG TTATCTTTTC
CCAGAAACCA AGTTCGAAAT CCATTTGGTA GGCCCGGAAG CTTATTTTGA TAAGGAGACC
AGATCGTTCG GCCCTACAAA TGAGCCTCAT GGCCGTGCCC TAGTCAAAAG ATTTGACGAG
CAAATCACTC TTCACTACCA TACGAGATAC TTCCACGAGC TCTACGACAT GGGTGACTTG
TTCCCATTCG ACCCATACTT GGATATTTTC TTTTTGTTCC ATCCCGGGTT CGGCACGGCT
GACTCCATTT ACTGGGACAA AGCCATGAAG GGATTGCTAG AGTCCAAATG TCCCATCTAC
GTCAGTGGGT ACCACGACAA GGACATGAAG CGAGAAATAC AGTGGTTGGA AAATCACCCC
TTGCACGACG AAATGGATGT GTTGATGACT CAAACAGACA ACAAGTTTGC CTGTACCAAG
ATCGACTTGG TGGACATCAA CCCCACGGAA ACATTTAACT CCAATAGTCA ATTATATGCA
TTCAGAGGTA AGAGATACCA CGCCATTAAG ACCTAA
 
Protein sequence
MGSSSSISRR NLSFFWKMLA LDKPESKESE RPLYNWDESP YEDIRTRAAF IRAKALCPVT 
KKPVNFVCPY SGIPTHHSRE AWESDTEYHK RKTYELLKKV NLYEHDVRSG RKFDEFVFPL
EQNNDYMVNL SSWDSFFYTR DFAPMNTEFN LAAATKVLTY PMTIAAIINK YSPYEPQPKG
PVTVEGLRSL AALKYTLYPP YTKSTDAVTF KERPMRIFIL GAKMESMLPG YVWKQFGYLF
PETKFEIHLV GPEAYFDKET RSFGPTNEPH GRALVKRFDE QITLHYHTRY FHELYDMGDL
FPFDPYLDIF FLFHPGFGTA DSIYWDKAMK GLLESKCPIY VSGYHDKDMK REIQWLENHP
LHDEMDVLMT QTDNKFACTK IDLVDINPTE TFNSNSQLYA FRGKRYHAIK T