Gene PICST_28190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28190 
Symbol 
ID4850969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp584270 
End bp586303 
Gene Length2034 bp 
Protein Length677 aa 
Translation table 
GC content41% 
IMG OID640392677 
Productpredicted protein 
Protein accessionXP_001387753 
Protein GI126273930 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.458855 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.78983 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCCA TAAGTCGAAA ACCGAGGGCA GTGTCCCGAA AGATTCTGAC TCCCTCACCC 
AGATTTGCAA GTCTAACACA AGCCAACTCA ACTTCACAAC AGCAGCCATT GCAGGCCGTA
GACAAACAGG ATCTTCTTCA GAAATACCGG GAGGTAGGAA ATGGCCACGA AGCAGGTGAA
GACATATTTG GTGGAGCTGA AGAGTCGGAT TTTGGCTTGG GTGACGATTT CAAAACTTTG
AAAATTTCGA GCCTGGATCT CAATACGATC AGACAAAACC ATGCGGAGGC CAGGAAAGCG
ATAAAATTCG ATGATGAACT TGGGCTACAA GACCAGGGCA CAATTTTTGG CAAGAATGGC
TACTCTCCAC TAGACTCATT GTCCACCTCT TCACCAAAAC AGTCTGTTTT CACACCAGTT
TCCCGTAATT CAGAGTCTAC GGATCGAAAT AATGGCATCA GAAGAGTGAC AAAGGAATCT
TTATCGGATT TCAGTGAAGG AGAAGATACG GACATAACAT CAGAGTTCAA TGATAACGAT
TTCGAAGACT TGGATAACAT CTTTGGAAAT GAAGAAAGCG GCATTTACGA TAAAATGAAC
AAGATTCTTT CCAACAAAAA GCTGGCATTA CAGAAACAGG CTGATTCAGA AGAAATTGAG
CTCAGAAGCC AGTTGGAAAA ACAGCAAGAG CAACAAAGAA ATACGCTCCT GGATGTGAAC
GCTACGCTCA GATTAAGGGA TTTCAACAAA ATTCAGATAG ATGCCCCTAA AAACAACTTG
ACTTCACAGA ATTTAAACAT ACTTGATCAG ATTGAAAATG AGAAGACTGT CAATTACGAA
TATACCAGAG ATGATTTCGA AGAGTTCGAA ACTGGCTTTG AGGATAACTT TGAAAGCAAT
CTCAAGAACA GCAGATCTGT GGGACCAAAA GCCACAAATA TGGCTACGGT TCGATCCAAA
GCATCTATGC CTATTCTCAG CAGAAATAAC TCTTCTTCAG TAAGGCGATT CAAGTCAAAT
ATGGATCTAG TAGGGAGTTA TGGCTTTGAG AACATTGATG AAGAGCTGAT GCATAATGAA
CCAGAGTTCA ACTACAACAA CAACGTAATC CGTAAGTTAG ACAGAATACC GTCGTTCTAC
AACAGCAACA GCAACAGCCG AAACAGCGAG CTTTCTTCAC GAAAATCACA GCTTCTTACC
AAATACAAGG AGCAAGCACT TTCTGAAAAA GAGAAAAAGA GACAAAGTAG ACTTGCGAGG
GCTGGACCTG AACAAAGCAA GCATCCCAAA CTAGGATTGG TGAAGTATTT AAATAATAAC
TCGGTAATTA AAAACCCTTC TATACCGACA AACAACAAGA TGATGCGGTA CAACTCTGTA
CGACAAGAAT GGGAAGGAAA CGAACACGAT CTTCTCCGAT TTGATAGCTT GAGCAAGCCT
TCGCTTATAA CGATGAATGA GCTCCAAGAT CCGATTGATG ACGATTCCCT CAAACCGAAA
ATAGGAAAGT TAGACGTCAA GGACAGTAGA AATCCTCATA TGGTGTACGA CAACGAGAAC
AGAAGATGGA TCAACTTGAG GGAGGAAGAC GACTCCATTT TCAACGACAT TGAAGATCTT
GTTGAGGATA ATGGAAATCT TGCCAAAAAG GAATACGTGT TGGCATCACC TCCACGCCAA
ATTAAACCAA ACACACTAGC ATTCAAGGGT TCGATCTCTC CTTTTGTAGT ACCAAATATT
ACGCATTTGC AATCACCCAT CACCACTCGG GGAATAAGTC AGTTCACCCA GAGGACAGCT
TCCAGCAATA CAAATTCGTC TGCTACAGAA AGCTCAGAAG AGGAAGTAGA TGAAGCGTTC
AAACTTTCCG CCAAGCTTAT AGACAAGTTC TACAAAGAAG AAGTGAAGAT TATCAAGAAA
ACCCAGCATT GGTTCAATGC CAACGAGGCT TACGATTACA ATATCAAGAA AATGAATTTC
ACAGATACCG AGTACTACTG GGAAATCCGC AAGATGGTTA TGGAGAATGA ATAA
 
Protein sequence
MNPISRKPRA VSRKILTPSP RFASLTQANS TSQQQPLQAV DKQDLLQKYR EVGNGHEAGE 
DIFGGAEESD FGLGDDFKTL KISSLDLNTI RQNHAEARKA IKFDDELGLQ DQGTIFGKNG
YSPLDSLSTS SPKQSVFTPV SRNSESTDRN NGIRRVTKES LSDFSEGEDT DITSEFNDND
FEDLDNIFGN EESGIYDKMN KILSNKKLAL QKQADSEEIE LRSQLEKQQE QQRNTLLDVN
ATLRLRDFNK IQIDAPKNNL TSQNLNILDQ IENEKTVNYE YTRDDFEEFE TGFEDNFESN
LKNSRSVGPK ATNMATVRSK ASMPILSRNN SSSVRRFKSN MDLVGSYGFE NIDEELMHNE
PEFNYNNNVI RKLDRIPSFY NSNSNSRNSE LSSRKSQLLT KYKEQALSEK EKKRQSRLAR
AGPEQSKHPK LGLVKYLNNN SVIKNPSIPT NNKMMRYNSV RQEWEGNEHD LLRFDSLSKP
SLITMNELQD PIDDDSLKPK IGKLDVKDSR NPHMVYDNEN RRWINLREED DSIFNDIEDL
VEDNGNLAKK EYVLASPPRQ IKPNTLAFKG SISPFVVPNI THLQSPITTR GISQFTQRTA
SSNTNSSATE SSEEEVDEAF KLSAKLIDKF YKEEVKIIKK TQHWFNANEA YDYNIKKMNF
TDTEYYWEIR KMVMENE