Gene PICST_50942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_50942 
Symbol 
ID4840840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp151770 
End bp152825 
Gene Length1056 bp 
Protein Length351 aa 
Translation table12 
GC content45% 
IMG OID640392155 
Productpredicted protein 
Protein accessionXP_001386426 
Protein GI126139808 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1635] Flavoprotein involved in thiazole biosynthesis 
TIGRFAM ID[TIGR00292] thiazole biosynthesis enzyme 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.920544 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCCTC CAACTAGAAT CGAAACTACC ACTTCAGTTG TAGAAGTGAA CCTTGCTAAA 
GTCAGCAAGA AATCTATCAA ACTTGAGTCT CAAGCTGACA ATGCCGAAGT CACATTCGCA
GACTGGGAAA ATTTCAAATT TGCACCCATC CGTGAGTCTA CAGTTTCCCG TGCTATGACC
AAACGTTACT TTGCTGACTT GGACAAATAC ACTGAATCTG ATGTTGTTAT CGTTGGTGCC
GGTTCTGCTG GTTTATCTGC TGCTTACGTT TTGGCCAAGA ACAGACCAAA CTTGAAAATT
GCTATCATTG AAGCTTCTGT ATCTCCTGGT GGAGGGTGTT GGCTCGGTGG ACAGCTTTTC
TCGGCCATGG TGTTGAGAAA GCCTGCCCAT CTCTTCTTGG ATGAATTAGA AATTCAATAC
GATGACGAAG GAGACTATGT TGTTGTCAAA CACGCTGCTT TGTTCATGTC CACTTTGTTG
TCTAAAGTTT TGCAATTTCC TAATGTCAAG TTGTTCAACG CTACTGCAGT TGAAGACTTG
ATCACCAGAA GAGATGAGAA CACTGGTGAA TTGAGAATCG CAGGTGTGGT GACCAACTGG
ACTTTGGTTG CATTGAACCA CGACACTCAA TCTTGTATGG ATCCTAATAC CATCAACTGT
AACATTGTAT TGTCTACTAC TGGCCACGAT GGTCCATTTG GTGCTTTCTC AGCCAAGAGA
TTGGAAGAAC TCGGTAAGGC TCCTAAGGAC ATCACCCAAG GCTTCAGACC TCAAGAACGT
GCACAACCTG TTGCAGCATC TGCTGATGGT TTCCAATTGG GAGGCATGAG GGGCCTTGAC
ATGAACAAGG CTGAAGATGC CATTGTCAAG GGTACCAGAG AAGTTGTTCC AGGATTGGTC
ATTGCTGGTA TGGAATTGGC TGAAGTTGAC GGTTCTAACA GAATGGGTCC TACTTTTGGA
GCCATGGCTC TTTCTGGTGT CAAGGCTGCT GAGTCTGTGT TAAACGCTTT TGACTTGAGA
AAGAAGCAAA ACGAAACTTG CTATGGTGCC CAGTAA
 
Protein sequence
MAPPTRIETT TSVVEVNLAK VSKKSIKLES QADNAEVTFA DWENFKFAPI RESTVSRAMT 
KRYFADLDKY TESDVVIVGA GSAGLSAAYV LAKNRPNLKI AIIEASVSPG GGCWLGGQLF
SAMVLRKPAH LFLDELEIQY DDEGDYVVVK HAALFMSTLL SKVLQFPNVK LFNATAVEDL
ITRRDENTGE LRIAGVVTNW TLVALNHDTQ SCMDPNTINC NIVLSTTGHD GPFGAFSAKR
LEELGKAPKD ITQGFRPQER AQPVAASADG FQLGGMRGLD MNKAEDAIVK GTREVVPGLV
IAGMELAEVD GSNRMGPTFG AMALSGVKAA ESVLNAFDLR KKQNETCYGA Q