Gene PICST_82104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_82104 
SymbolSOL2 
ID4837337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1402240 
End bp1403221 
Gene Length982 bp 
Protein Length260 aa 
Translation table12 
GC content46% 
IMG OID640388652 
Product6-phosphogluconolactonase-like protein 
Protein accessionXP_001382491 
Protein GI150863868 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0363] 6-phosphogluconolactonase/Glucosamine-6-phosphate isomerase/deaminase 
TIGRFAM ID[TIGR01198] 6-phosphogluconolactonase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.650309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.457187 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AACATGTGTG CCAAAGTATA CTCGTTTGCC GAGTCGGTCG AGGTGGCGAA TTCCGTGGGA 
AAGCACATCT TGAAGCTCCA AAAATCGGCT ATCGCCAGTT CCGACTCGTT CAAGATTGCA
CTTTCCGGAG GCTCACTTGG CAATGTTTTG AAGAAAGCTT TGATTGACAA CACATCTATT
GCTTCTGAAG TTCAATGGGA CAAATGGGAA GTGTACTTCA GCGACGAGAG GTTGGTGCCG
TTGGATCATC CCGACTCCAA CTACGGTTTG TTCAACGAGT TGGTGTTGAA GAACTTGCCT
TCTGGCACCA AGCATCCATC GTTGCACGTG ATTGACGAAT CACTTTTGAC CGGTAAAGAT
GGGCAAGCCG CAGGTGCAGA TCTGGACAAG GACAAGCAGA TTGCCGAAGC CTACGCAGCC
AGCTTGCCAC AGGATGCAAA ATTTGACTTG ATCTTGTTGG GATGTGGTCC AGACGGCCAT
ACCTGCTCTC TTTTCCCTGG CCACAAGTTG TTGAATGAAC GTGATCAGCT CATTGCTTAC
ATCAGCGACT CGCCCAAACT TCCACCAAGA AGGATCACCT TCACCTTCCC AGTGTTGGAG
AGAGCTACAG CCATTGCTTT TGTGGCAGAA GGTGCCGGAA AGGCTGCGAT TTTGAAAGAC
ATTTTTGGCG ACGAGCTGTC CAAGTTGCCT TCTAAGTTAG TCAACGACAT ATCTGGGGTT
GAGGTGTCGT GGTTCGTAGA CAACAGTGCT GTCGAGGGAG TAAGAGTAAT CACTTCCAAG
TACTAAAGGT ATACGGTAAT GCTTGAACGA GGATTGCCGG GTATGGGTGG TTGAAATCCA
TAACTATATA TAAGTGTATC GTAGCTGTCA TGGCTATCAT GGCAATTCAA GCTGTCGTAT
ACGTATACAT TTGTAGTTCA TCTCAGTGTG AACCAGATAT TCATATATAA CTAAATGTCT
TATATATACA ATATATGAAT AG
 
Protein sequence
MCAKVYSFAE SVEVANSVGK HILKLQKSAI ASSDSFKIAL SGGSLGNVLK KALIDNTSIA 
SEVQWDKWEV YFSDERLVPL DHPDSNYGLF NELVLKNLPS GTKHPSLHVI DESLLTGKDG
QAAGADSDKD KQIAEAYAAS LPQDAKFDLI LLGCGPDGHT CSLFPGHKLL NERDQLIAYI
SDSPKLPPRR ITFTFPVLER ATAIAFVAEG AGKAAILKDI FGDESSKLPS KLVNDISGVE
VSWFVDNSAV EGVRVITSKY