Gene PICST_29233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29233 
Symbol 
ID4851961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3306489 
End bp3308411 
Gene Length1923 bp 
Protein Length640 aa 
Translation table 
GC content37% 
IMG OID640393669 
Productpredicted protein 
Protein accessionXP_001387202 
Protein GI126276149 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.116558 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAAG AAGACTTGGA GAGTGAAGTA GTTGAAAAAA ACACCTACTG CTTCGAAGAT 
AATGAATACT GCAAATTAGC TTTAGATGAC TCTAATCAGC TTTTACCTTA CAAAGAAAAG
ATTTCTAAGA AGTGGGTCCT CACTGTCATC TTTAGTATTA TTCTATCATT GTTTTCGTTT
TGCTTATATT TGTTTGTTAG ATACAGCTAT TCAAACGATT CCGTCAAATC CTTTGGACAT
AATGTAGATG AATGGGTGAC AATGACTGCA ATTGATGGTG AACCTATGCA AGTAATCGAA
AACAATTTTG AAAGGATTCC TTTGGAGGCA AAAGTCATGA ATACTACGGT CTACGAGAAA
CAGAAATTAG TAAATGGTTT GAGTGCTACT TTCTCCAAGT TTGAAGACTT CGAATTTGAC
TCTGTTTTTT TGACATTGAA CTTCACCAAC AATGATAACG ATACCGAAAA TGTCAATGTA
ATTGAGATCT CCATCGATGG TCATCCAGTA TGGAGAACTT CCCCACCACT TTCTAAGGTA
GGCACCACAA CCTACTCAAG CACCAGTAAA GACATTTCCA AGTACATTTC ACTCTTTGGT
AAAAGTTCCA ACCAATTCAA AGTGCAAATC TTGGAAGGGA GTCATGACAA AATCAGCTTT
GCTCTTGCTC TTACATTAGC GAACTGCGGA AAAGAAAAGC CTGTTATAGG TTCTCCAATC
ACAGTGAGTT CGCTTTTCAA TTCAACTGAG CCTGCAAACG ATATAATAGC ATTGACCAAA
TCAAATGGAG AAGTGTTTGA CTTCTCGAAG AACGACAAGT TCTTAGTTGA ATTACCAAAG
TTCAGTGGAA AGACATTTGC TGCAAAGCTA GAATTATTTG TCTCTGCTGG AAAGTCTGAC
GTTGAATTTT TCATTCTTAA TAGAAATTCT CCGTTTCGTC TTTTGAACAT TTTCATTAAT
GAACAACTAA TTGCTACTAT CTCACCAAAC CCTATTTTGT TTCACTCGAA CTCTATCATT
CCCAGCGCGT ACTCTGGGCC TATTGCTCCT TTTGGTAGTT TTACCGGATT TAGCTATGAA
GTAAATTTGG CCAACGTTTT GCCAATTTTG TGGGGTCAGA AATCCACCTT AGAAGTTCAA
CTTGTTTCTC CTGTCAACGA TATGTTCCCC TTGGAGATTG AAAACGTATT TTCCCAAAAG
GATAATTATG GTGTCAACGA TTTTAGATTT TCATCCCCTG CAAAGCCAAT AATGAAGGGC
GACAATCAAG CCAATACAGA ATGGTACGTG TCTGGCAATA TCTTTCTGTG GGAAAACAAA
GACATTTTGA CTTCTCAAGG TGAGATATTA GGGTCTGGAA TAACCGAAAC TTATAGTAGT
AGTTTACGAT ATGACCCTCG CAAAATTGCT ACTAAAATTT TAACTGTTTC AGACGATTTT
TATTCTTCCC ATTCGTCGAC TCTTCGATTT TCTTTAAGGA ACCAAACTCT GTTAAACTTC
ACTGTAAATC AAAAGGGTTC TACTAAGAGT TATTTAACTG AGCACCAGAT TAATGGCAAA
ATTTCACTTG ATGTCGTTGG TCACAAACTG AGTATGTTTG GCCACATGTA TACATACTTA
ATTGTTCTTG ACGGAAATGC CACTAACGAG TTATTACACG AATATCGTTT CTTTTCATGG
TCCATAATAA ATGAGGTCAT TATATCTGGT GAAGATGCTG GATGCTTAAA TTCAGCTAAA
ATCTACTCCG GCGCCAAAGT GGGTAAAGTT ACCAATCGCC TATGCGAGTC TGAGTCAACC
AAAGCATTCG ACTCATATTC AATGGACAAT AGCATAGTCA AGCTGAGGTT TGAATCTTTC
CGTGATTCTG AAAGGCATTC AAGATTATTC ACATTCCAGC CTAACAAAAG CCAAATGTTC
TGA
 
Protein sequence
MSKEDLESEV VEKNTYCFED NEYCKLALDD SNQLLPYKEK ISKKWVLTVI FSIILSLFSF 
CLYLFVRYSY SNDSVKSFGH NVDEWVTMTA IDGEPMQVIE NNFERIPLEA KVMNTTVYEK
QKLVNGLSAT FSKFEDFEFD SVFLTLNFTN NDNDTENVNV IEISIDGHPV WRTSPPLSKV
GTTTYSSTSK DISKYISLFG KSSNQFKVQI LEGSHDKISF ALALTLANCG KEKPVIGSPI
TVSSLFNSTE PANDIIALTK SNGEVFDFSK NDKFLVELPK FSGKTFAAKL ELFVSAGKSD
VEFFILNRNS PFRLLNIFIN EQLIATISPN PILFHSNSII PSAYSGPIAP FGSFTGFSYE
VNLANVLPIL WGQKSTLEVQ LVSPVNDMFP LEIENVFSQK DNYGVNDFRF SSPAKPIMKG
DNQANTEWYV SGNIFLWENK DILTSQGEIL GSGITETYSS SLRYDPRKIA TKILTVSDDF
YSSHSSTLRF SLRNQTLLNF TVNQKGSTKS YLTEHQINGK ISLDVVGHKL SMFGHMYTYL
IVLDGNATNE LLHEYRFFSW SIINEVIISG EDAGCLNSAK IYSGAKVGKV TNRLCESEST
KAFDSYSMDN SIVKLRFESF RDSERHSRLF TFQPNKSQMF