Gene PICST_51621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_51621 
Symbol 
ID4850776 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp43659 
End bp44840 
Gene Length1182 bp 
Protein Length372 aa 
Translation table 
GC content40% 
IMG OID640392484 
Productpredicted protein 
Protein accessionXP_001387662 
Protein GI126273520 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0478079 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATAGCACAAA TGAAGGACAA AGTTCATATG AACGATTCCA ATATTGCTGT AATAAAAGCG 
TCAGTGGGAG GTGTTGGACA TAATATTAGT TTAGCCTCTC ACTATTTCTT GTCGTCCAAA
ACAGCATTGA GAAAGTCAAA GTTTGTATCA ATCGTCGGTA ACGACTTTGC TGGTAGGGCT
ATTCTCAATC AACTCAAATC AAGCGAGTTT GATGCCTCTG GTATCTTGGT AACTTCTAGC
GATAAAGCAA CTGCTCAATA TCTGGCAACA CATGACCTGG ATGGCAACCT CGTTGTTGCT
GCAGCTGATA TGTCCATTAT CGAAGAGGAC TTTTCCGAAT TCGTCATTGA TCAGGTAAGT
CCACAAGTTT CCAACATTGT ATTTGATTGC AATCTTTCTC CAACTCTTGT AAATAAAGTG
ATGAATTCAC TACAGGCTTC AAATAAAGTA GTAAATGTGA TAATTGAGCC AACGTCGTTA
CCAAAATCGT CACGGATTGC GAAATTGATA AACCGTCCTT TCCCAAACAA CTTCATCAAG
TTGATAACTC CAACAGTGCT GGAACTAGAA GCTATTCATG CTTCTTTCTA CAACAACGAC
AAATACGATG ACTACGACGA ATGGTTCCCA GTATTGGATT CATTAGGTGT TGACTCTTTC
TTCAGGGACA AACTCAATCT TCTTGCTAGC AAAAGATTTC CAGTTCTCAA AGACTTGATG
GATCGAGGTA CCTTGCAGCA AAGTTTTCAG CTTGTCCCTT TCTTCCAGAA TATATTGATT
AAGATTGGAA GCAAAGGAGC CATAATGGTT AGTCTTTCAG AGAACGTTAA TGACTACAAA
TCCATTCCTA CAACTTCGAA GTACAAGCCT GCATTCATCC TCGCATCGGA GGGCAGGGTT
ACCAGTGACA ACAAGCGTAT GGGTGTGGTA ATAGAGTATT TCCCAATACC AATGGAGAAT
GAAAACTTAG ATATCATTAA TGTCACTGGA GCTGGTGATT CCATGCTTGG AACTCTAGTA
GCTCAATTGC TGAATTCTCC ATACAATTGG TTGGCAACAG AAATAAATAG CGTTGAACAA
GAATGGAGCA AATGGGAACA TATCTACAAA GCTCAATTAG CTAGTGGGTT GACCCTTACT
TGTGAATCGG CCGTTAGCAG TGGCATACAA CTTATTGAGT GA
 
Protein sequence
IAQMKDKVHM NDSNIAVIKA SVGGVGHNIS LASHYFLSKS KFVSIVGNDF AGRAILNQLK 
SSEFDASATA QYLATHDLDG NLVVAAADMS IIEEDFSEFV IDQVSPQVSN IVFDCNLSPT
LVNKVMNSLQ ASNKVVNVII EPTSLPKSSR IAKLINRPFP NNFIKLITPT VLELEAIHAS
FYNNDKYDDY DEWFPVLDSL GVDSFFRDKL NLLASKRFPV LKDLMDRGTL QQSFQLVPFF
QNILIKIGSK GAIMVSLSEN VNDYKSIPTT SKYKPAFILA SEGRRMGVVI EYFPIPMENE
NLDIINVTGA GDSMLGTLVA QLLNSPYNWL ATEINSVEQE WSKWEHIYKA QLASGLTLTC
ESAVSSGIQL IE