Gene PICST_28339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28339 
Symbol 
ID4851116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp967388 
End bp969067 
Gene Length1680 bp 
Protein Length559 aa 
Translation table 
GC content38% 
IMG OID640392824 
Producthypothetical protein 
Protein accessionXP_001387823 
Protein GI126274102 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.908779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0915689 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTCAA TTCCTTTAGA ATTGCTACAG TTTCCCGAAC ATATCATTCA AACGATAATT 
GACGCACTTC CCTTCAAGTT GGTGCTTTGT CTAGCTTCTA ACGGCAGTTC TATATTCCAC
AAGCAATTGG TAAATCGAGT CTTTAGAACT GTTGAACTTG GCCCTTTTAC ATCAAGCAAC
CGTGCATTGA AGGTAGAATG GGATTCCTCT ATTCTCTGGG AAAGACCCTA TTATACTATG
CTTAAGGAGA TAGAAGCAAA GTACATGAAC TTTACAAGTA TCAAGATATT AGACCTTTTC
TTCGAAGAGA ACTCAGAAAT CCATGTCAAT AACCTTATAG TGCGTGATAC ACCAAATCAA
GAGGTCTTGA ATCAAATACA TCGCTTAACT AAGAAGATGA AGAGGGTACT GTTAAAGTGC
GAGACTGATT CCACAAATCA ACAAGTTGAT AAAGAACTCA ATCTTCCTGA GGGTCTTTAT
GGCTTTTCTT CTAAGTACTT TCCTGACATG AGAGTATCAT TTCCTGAAAA TCTTCAGAGG
CTTGAACTTG AGGTTTTTGA CTCGGTGAGT ATTTTGGATA AGTTACCTTC AAGATTGACG
CATCTTCAAT TGTATTCAGT TGAAGAACTA CCATTAATAG CACTCCGAGA CTTTTGTCTT
TTTCCTAGGA CCTTGAAACA TTTGGAACTA GGTAATTGTA TTGATTTCGG AAATGAAACA
GAGGTTAAGA TAGACTTACC TCTCTCTTTA GAGAGTTTCC ATGTTTCGTC AGAAGTCAAC
CCCTCTGTGT GTCTAGACAT TTCCCACTTG ACGAATTTGA GAAAAATGAC GGTTAGTTTC
TTCGGAGATT CTGCTCCTAT GTCAATATTG AAGTTTCCTA TTCTGATAGA AGAACTATCT
TTACAATCAA CCAGTTTTCC ATTTGAAACG GAAGTGATTC TGGGACTAGA AAGATTGAAA
CTGTTTGGTC AGTTGGTATT ATATCCACTG CATTGGATTC CCAGTTGTAA ACTCTCACTT
CCTGATTCGG TTGAGAGCAT ATCAATTGAC TGCGGCGGGC ACTCTTTTGA GAGTGTTGTT
GAATATCCAA AATATCTAAG GACCATATTC TTAACTAGAT GTGGATCTAT AAACCCTCTT
GGAATACTTG ATAATCTAAT AGTTTTATCA ATTGACAATT CAGAGTTTGC ATACTATGCT
CACCCTATGG TACAACCCAT CGAGAACAAC TGGCGTAACT TGGACATGTT AGAATCGCTA
AAGGAGTTAT CTATTATTGA TAGTAGTATG GAAAACATCC CAAAGCTACC TCCCTTTCTT
CGTTTCCTTG ACCTCCGATG CAATGCACTT GAAAAGATAG ACACCCAACT TCCAGACACT
CTAGTGGTGA TTATTGTGTC AAAGAACAAA CTAGTTCAGT TTGATGGATC GGGTTACAAG
CAATTGAGAA AGTTGGACCT TTCCGATAAT TTAATTAGTG AGCTATCATA CATCACTTTG
AAGTTACCTC CTGGACTACG TGAGTTATCG CTAAAGGGAA ATCCAATCAC TTGTGTGGCA
CCTGGGTTTG CAATCTCAAA AAGTTTGCAA CTAGATATAA GAGATGAAAT GCAATTTGAG
GAAGCCAAAA TATTAATGCA GCACGGAAAA AACCTTTACG AGAGTATGGC TTACGAGTGA
 
Protein sequence
MGSIPLELLQ FPEHIIQTII DALPFKLVLC LASNGSSIFH KQLVNRVFRT VELGPFTSSN 
RALKVEWDSS ILWERPYYTM LKEIEAKYMN FTSIKILDLF FEENSEIHVN NLIVRDTPNQ
EVLNQIHRLT KKMKRVLLKC ETDSTNQQVD KELNLPEGLY GFSSKYFPDM RVSFPENLQR
LELEVFDSVS ILDKLPSRLT HLQLYSVEEL PLIALRDFCL FPRTLKHLEL GNCIDFGNET
EVKIDLPLSL ESFHVSSEVN PSVCLDISHL TNLRKMTVSF FGDSAPMSIL KFPILIEELS
LQSTSFPFET EVILGLERLK LFGQLVLYPL HWIPSCKLSL PDSVESISID CGGHSFESVV
EYPKYLRTIF LTRCGSINPL GILDNLIVLS IDNSEFAYYA HPMVQPIENN WRNLDMLESL
KELSIIDSSM ENIPKLPPFL RFLDLRCNAL EKIDTQLPDT LVVIIVSKNK LVQFDGSGYK
QLRKLDLSDN LISELSYITL KLPPGLRELS LKGNPITCVA PGFAISKSLQ LDIRDEMQFE
EAKILMQHGK NLYESMAYE