Gene PICST_67724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_67724 
Symbol 
ID4838430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1786206 
End bp1787431 
Gene Length1226 bp 
Protein Length381 aa 
Translation table12 
GC content43% 
IMG OID640389745 
Productpredicted protein 
Protein accessionXP_001384296 
Protein GI150865184 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.579873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AACATTGGCG ATGTTGGATG AAGGTGATCA CAGGACAGAA CTTCCTCGCG TATTATCTAG 
TGCAGATTAC CAGAGAGCTG GAAAGCACCA ATTTGGAGAT GGGTTTAAAT TCCTTCCACT
ACAGGAAAGA GCAATCAGGG ATATGGCATC CTCTCACACG GTGGTGAAGA TTGACAAAGG
CAAGACGGAA TGTTGCATTC TTCTAATGCT TGCTGACAAG ATCTACATCG ACACCTGGGG
AATCAAAGAC CACTGTGTCA GTTTGTTGGT AGTGCCGTGT TGGTCTGAGC TCAACTCCAC
TGTTGCTAGA ATTGAACAGG CAGGATTAAA GGTCCGCTAC ATCGAGGATG GGGTCAACGA
TCCTCTAGAA TATGACGTTC TTGTCTTGCA ACAGACGGCA AATGATTTTG GGCAAATCCA
TGAGCTTACC AAACAAATTC GGAAGAGCGG AGAATGCTCG GCACATGTTC GTCGCGTTAT
CATAGAAGAT GCTCACTACT TGCAGATGTC ATCTATAAGC GATAATTCTA TCAAAGATAT
ACCGTATGCT TTCATGTCGT CTTTCCTTGC TCCAGAAGCA ACATTGATGG CAAACATGAG
CATTGACAGC TACAATGTAG TGGAAGACGA GGACAGACAA ATTCGACGCG TCGAAATGCA
AGTTGACGTT AAGGAATCAG AGACTGATGT GACCAACAAT GTAATGCTGT ACCTCGGCAC
TTTAGTACTG TACAATTCCT TGATTATTGT CGACAATGAT GCTAGAGTTG AACACTTGAT
TGAATTTCTC GACGATCTTG TGCCGTGCAT AGGAATCGTA GGATCGTCAA CACAAGAAAG
GAGAGAAAAA GCCAATGAGA TCAAAGAGGG ATTGATGGAT GAAAATGTGT GTGTAGTTGC
TACTGCCAAG TCATTGGTAG GTCTTGACTT CGAAGTGGAG GAGGTCATAA TTGCGTACGC
GGTCACGTCG GAGATCTCAT TGTTGCTTGC GACCAAGATG ACCGATAGCT TGGTCTATTT
GTGCTTGGTA AAGGACAGAA ATAGTGACTA CCAACGAAAA TGTCTCTATA GCATCATGCA
AGAATATTTG CTATTGAAGG CTTCAACTTG TGCCAACGAA GGTAGTAATT ACTGCTCCAA
TTGTGAGGCT AGTTAGGTAT TTAGAAATAA AGTAGACAAA AAGAGGTCTT TAACATGATC
TTCTTGCTGT CACATATTTG GATTTT
 
Protein sequence
MLDEGDHRTE LPRVLSSADY QRAGKHQFGD GFKFLPLQER AIRDMASSHT VVKIDKGKTE 
CCILLMLADK IYIDTWGIKD HCVSLLVVPC WSELNSTVAR IEQAGLKVRY IEDGVNDPLE
YDVLVLQQTA NDFGQIHELT KQIRKSGECS AHVRRVIIED AHYLQMSSIS DNSIKDIPYA
FMSSFLAPEA TLMANMSIDS YNVVEDEDRQ IRRVEMQVDV KESETDVTNN VMSYLGTLVS
YNSLIIVDND ARVEHLIEFL DDLVPCIGIV GSSTQERREK ANEIKEGLMD ENVCVVATAK
SLVGLDFEVE EVIIAYAVTS EISLLLATKM TDSLVYLCLV KDRNSDYQRK CLYSIMQEYL
LLKASTCANE GSNYCSNCEA S