Gene PICST_30682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30682 
Symbol 
ID4837843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp668510 
End bp669529 
Gene Length1020 bp 
Protein Length339 aa 
Translation table12 
GC content42% 
IMG OID640389158 
Productpredicted protein 
Protein accessionXP_001383411 
Protein GI150864551 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.673385 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0672265 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATCAA CCAAAGCTCT TCCATATCCA GATGAAGTCA ATATTGACAA ACTACAAGGT 
ACCAAGTACG ATTTCGCCAG AGATTTGATT GGCTATGGTG AAAAGTCACT AGATGCTAAA
TGGCCCAGTG GCAAGAAGGT TGCTATTTCC TTTGTGTTGA ATTACGAAGA AGGTGGTGAA
AGGTCGCTAT CCTTGGGAGA TGACACTCAA GAATTCACCT TGACTACTCC ATCAAAAGGT
GTTCCTATTC CATTTAGACT GTTTGATCTT GAATCCGAAT ATGACTATGG TTCCAGAGCC
GGTGTGTGGA GAATCTTCAG ATTGTTCAAG AAGTACAATT ACCCGCTCAC TGGATACATT
GTTGGTAAGG CTGCTGAAAG AAATCCAGAG GTTATGAAAG CATTTCTCAG GGATGGCCAC
GAAATCGCTT CTCATGCCTA CCGCTGGATC CCTTACGCTG GATTGGAACC GGAAGTGGAG
AAAGGATATA TTATCAAGCA ATTGCAAGAA CTCAAGAACA TCACTGGTGA ATACCCCAAG
GGCTGGTACT ACGGGAGACT TTCCACTCAT GCCTTGGGTT TGGTTACTGA AGTATACAGA
GAGCTCGGTA TTCCTCTTGA ATACATCAGT GACTACTACG GTGACGATGT TCCAAGATGG
ATCGAAGTTC CTGCGGAAAA AGATTTACCA AAAGAAGAAA AGAAGGGTTT GTTATTGGTT
CCTTACTCTT ATGACTGTAA TGATTTCAGA TTCTTGAACC CTAATGGTTT CAGATCCGAT
TCAGCTTTCT TGGAACACTT GATCAATGCG TTCACGACCT TGTATGAAGA AGCTGACGAA
TGTGGAGCAA AAATGATGAC AGTTGGTCTT CATTGCCGTA TTATTGGTAA GCCAGGCTAC
TTCCAATCGT TAAAAAAGTT TATTGAACAC ATTAGTCAAT TTGAAGACGT GTGGGTTTGT
CGTAGAATTG ACATTGCCAA TCATTTCAAG GAAACCTTCC CATATTCTCC TTCAGAATAA
 
Protein sequence
MQSTKALPYP DEVNIDKLQG TKYDFARDLI GYGEKSLDAK WPSGKKVAIS FVLNYEEGGE 
RSLSLGDDTQ EFTLTTPSKG VPIPFRSFDL ESEYDYGSRA GVWRIFRLFK KYNYPLTGYI
VGKAAERNPE VMKAFLRDGH EIASHAYRWI PYAGLEPEVE KGYIIKQLQE LKNITGEYPK
GWYYGRLSTH ALGLVTEVYR ELGIPLEYIS DYYGDDVPRW IEVPAEKDLP KEEKKGLLLV
PYSYDCNDFR FLNPNGFRSD SAFLEHLINA FTTLYEEADE CGAKMMTVGL HCRIIGKPGY
FQSLKKFIEH ISQFEDVWVC RRIDIANHFK ETFPYSPSE