Gene PICST_49223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_49223 
SymbolGRP3.4 
ID4840508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp1060895 
End bp1061905 
Gene Length1011 bp 
Protein Length336 aa 
Translation table12 
GC content43% 
IMG OID640391823 
Productprotein induced by osmotic stress 
Protein accessionXP_001386209 
Protein GI150866566 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.376808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACTG TCTTTGTCTC CGGAGCCAAT GGGTTCATTG GACAGCATAC TGTTCAGCAA 
TTGTTAGAAG CTGGTTACAC TGTGATTGGA TCCGTAAGAT CGCAAGAAAA AGGTAAAAAG
TTGCTGACGG CATTGAAAAG CGACAAGTTC TCGTTCGTAG TCATTCCCAA CATTGCTGAC
GTTGGAGCTT TTGACAAAGT GTTGGATGAC AATAAGCAAA TCACCACATT TTTGCATATT
GCTTCTCCCT TCAGATTTAA TGTTCAAGAC ATTGAGAAGG AAATATTGAT TCCTGCAATT
GAAGGTACCA GAAACGTCTT GACCTCTATC AAGGACCATG CTCCACAGGT TACGAAGGTT
GTTGTTACAT CTTCTGATGC TGCTGCCAGA GAAAACGACG ACAAGAACCC AGACCTCACT
CTTGACGAGT CGGTATGGAG CAAGGCTACT TACGAAAGTT CTAAACACGA CCCAGTAGCT
GCATATCTTG GCTCCAAGCC TTTGGCTGAG AAATTGGCGT GGAAGTTTGT TGAAGAGGAA
AAGCCAAACT TCAAGTTGAT CACCGTGTTG CCAAGTTACA CCTTTGGCCC TCAGATTGAT
GATTCCTTAG TGTTGAAGGA CTTGAACTCG TCCTCTAAGG TGTTTGAAGA AATCATTACG
CTGAACCCAG ACTCTCAATT GTATACTCAC AATGGTAGCT TTGTTGATGT GAGAGACGCT
GCCAAAGCTC ATTTAGTAGC CTTCCAAAAT GACGAAGCCA TTGGCAAGAG ATTAATCTTG
TCCAGTAATA GATTCACCTC ACAAACTATT AGAGATATTC TTCTCAAGGA ATTCCCACAA
TTCAAGGGAC AGATCTTCGA AGGTGTGCCT GGTGAAGACA TTGAAGATAT CAAGCAAATG
CCTGTGTTAA ACTACTCACA AACCAACAAC ATCTTGGGCT TCAAATTCAG AGACATCAAG
ACATCCTCAG TTGATGCGGT AGCACAACTC TTGAGAGTTA GAGATGCTTA A
 
Protein sequence
MTTVFVSGAN GFIGQHTVQQ LLEAGYTVIG SVRSQEKGKK LSTALKSDKF SFVVIPNIAD 
VGAFDKVLDD NKQITTFLHI ASPFRFNVQD IEKEILIPAI EGTRNVLTSI KDHAPQVTKV
VVTSSDAAAR ENDDKNPDLT LDESVWSKAT YESSKHDPVA AYLGSKPLAE KLAWKFVEEE
KPNFKLITVL PSYTFGPQID DSLVLKDLNS SSKVFEEIIT SNPDSQLYTH NGSFVDVRDA
AKAHLVAFQN DEAIGKRLIL SSNRFTSQTI RDILLKEFPQ FKGQIFEGVP GEDIEDIKQM
PVLNYSQTNN ILGFKFRDIK TSSVDAVAQL LRVRDA