Gene PICST_37839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_37839 
Symbol 
ID4851006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp695196 
End bp696662 
Gene Length1467 bp 
Protein Length488 aa 
Translation table 
GC content43% 
IMG OID640392714 
Productpredicted protein 
Protein accessionXP_001387771 
Protein GI126273967 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.350145 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.85819 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGAAC TTGAAGGCGA CGACAAGGCA ATGGCTAGAA AGATTTACCT TGTTAACAAC 
GCCTTGGACG AGATAGGGTT CACCTGGTTC CATGTTAAAT GCTTTGTCGT TGCTGGCTAT
GGTTATGTTG CAGATTCTCT CTTGGGTATG GCCCAGTCGA CTGTTGCAAC TTACGTGAAT
TTGCAATTCA ACCAAACATA TCCCTTGAGC ACACAAGTTC TCTACATTGG CCTCTTTTCG
GGATGTGTCT TCTGGGGGTT GAGTGGTGAT ATCATTGGAA GAAAGCTTGC TTTCAACTTG
ACCCTATTCC TCTGTGCCAT CTTAAGTTTT CTTGTTGGTG CCATGAGCAG TTTCCCCATG
TATTGTTTCA TGTTGGCGAT TAGTTCATTT GCTCTAGGCG GTAACTTAGC TATTGATGCT
ACAGTGTTTT TGGAGTTCTT GCCATTCAAC TACCAGTGGT TGACGACTTT CTTTGCCTGT
TGGTGGTCTC TTGGCCAAGC AGTTGGATAT GGTGTGGCTT ATGCCTTTGT TGTTCCAGAA
AAGTGGCATT GCACCAGTGC CGATAACTGT CCCTCTGAAA GTAACAGAGG ATGGAGGTAT
GTGTGGTATG TCGATGCAGG TATAGTATTC TTCTTTGCCG TTATCAGATT GATGCTCAAG
TTAGAAGAAA CCCCAAAGTT CTTGGTAACC AACAACAGGG ATGCTGAATG TGTAGAGCAG
TTGCAGGCAA TTGCTAAGAA GTACAACAGA ACTTGTTCCT TGACCTTAGA AGACTTGCAG
GCTTGTGGTG AAGTAAAGAA AAATGACTTT AAGATGAGCG ACCCTAAGTT GAAGGACTTC
TTCAGCAGTA GTATAAAAAA TAGTAAAGCA TTGTTCAGCA CTAAGAAGAT GAGCATTAAC
ACCTTGATGT TATTCATGTC TTGGTTTGGT ATTGGTATCG CTTATCCACT TTGGGGTACT
TTTTTGCCAG TTTACATTGC TTCTAAAGGT GGTCATACCT CTGCCGACGA CGCTGCTGGT
GTTTATGGCG ATGCTTTGCT CTCCACTTGT TTGTCTTTCT TTGGTCCAGT AATTGGTGGT
CTTCTCATTT TAATTCCTCG GGTAGGAAGA AGAGGTACTC TTTGTATTGG TGGTATAACT
TCTATGATCT TTTTTATGGC TTATACGACA GTTAGAACAA GACCAGGTGC TCTCGGTTTC
TCAACAGCAG CCTACATCTG CATCTATATC TACTACGGAT GTCTCTACGG TTACACACCA
GAATGTCTTC CAAGTTACTG CAGAGCTACT GGCTCCGGGT TGGCATTTGT CTTCAACAGA
ATAGCGGGCC TCATTGTCCC AGTGATTGCT TATTACGCTA AACCTACCAC TAGTGTACCA
ATTTGGGTGT GTGCTTCTTT CATTGGTCTA ATTGGAATTG GCTCCTTGTT TTTTCCATTC
GAGCCTTCAA GACAAAGATC TGTCTAA
 
Protein sequence
MLELEGDDKA MARKIYLVNN ALDEIGFTWF HVKCFVVAGY GYVADSLLGM AQSTVATYVN 
LQFNQTYPLS TQVLYIGLFS GCVFWGLSGD IIGRKLAFNL TLFLCAILSF LVGAMSSFPM
YCFMLAISSF ALGGNLAIDA TVFLEFLPFN YQWLTTFFAC WWSLGQAVGY GVAYAFVVPE
KWHCTSADNC PSESNRGWRY VWYVDAGIVF FFAVIRLMLK LEETPKFLVT NNRDAECVEQ
LQAIAKKYNR TCSLTLEDLQ ACGEVKKNDF KMSDPKLKDF FSSSIKNSKA LFSTKKMSIN
TLMLFMSWFG IGIAYPLWGT FLPVYIASKG GHTSADDAAG VYGDALLSTC LSFFGPVIGG
LLILIPRVGR RGTLCIGGIT SMIFFMAYTT VRTRPGALGF STAAYICIYI YYGCLYGYTP
ECLPSYCRAT GSGLAFVFNR IAGLIVPVIA YYAKPTTSVP IWVCASFIGL IGIGSLFFPF
EPSRQRSV