Gene PICST_56149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_56149 
Symbol 
ID4837200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2010402 
End bp2011514 
Gene Length1113 bp 
Protein Length370 aa 
Translation table12 
GC content45% 
IMG OID640388515 
Productpredicted protein 
Protein accessionXP_001382614 
Protein GI150863957 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.462379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.700451 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCTACA TCAAGAATCA GTATATTCGT TATATACGAA AAAAGCCTTT ATCATTGATA 
GCACCGATTT CCGTGTTATT GCTCGTGTAT TTCTACTTCT TTGCGGCACA TGGCTCTTCC
TCCTCATCTT CTGGCAACAA ATACAGCTAC AAGAAGAAAT CCCGAGGTTT GTTTGCAAAG
AACAGAGACC TGGTGATTCT TAAAAACTTG CCTAAGAATC ACATCAGCCA CTACGACTTG
AACAAGTTGT CCACTTCTGC CGATTCGCTT GCAAAGAAGG AGGAGGTGTT GATTTTGACG
CCCATGTCAC GTTTCACGCC ACAGTACTGG GATAACATCC AGAAGTTGAC GTATGAACAC
AGCTTGATTC TGTTGGGATT CATTTTGCCT CGTAACAAAG ACGGTGATGT AGCACTTAAG
CATTTGGAAG AAGCAATCAA AGACGCCAAA GCGGCCAACC AGTTGAAATA CAAGAAGATC
ACCATATTGA GACAAGACAC GAACTCTCTT AACTCGCAGT TGGAGAAGGA CAGACATGCA
CTCAATGTGC AGAAAGAAAG AAGACTGATG ATGGCTCTTG CCAGAAATTC GTTACTTTTC
ACGACTATTG CGCCAACTAC GTCTTGGATT TTATGGCTAG ATGCCGACAT CGTAGAAACT
CCTGCTGGAT TGATTCAGGA TTTGACGTCA CACAATAAAC CAGTTATTCT GGCCAACGTG
TACCAGAGAT ACGAAGACGA ATCGACACAA CAACCATCCA TCAGACCGTA TGACTTCAAC
AACTGGGTAG AATCAGAAGA AGGCTTGAAA ATCGCTGCAG GTTTGGCAGA CGACGAGATT
GTAGTTGAAG GTTACGCTGA GATGGCTACC TACAGACCGC TCATGGCTCA TTTCTATGAC
GCCAAAGGTG ACGTCCATAC CGAAATGCAA TTGGATGGTG TTGGAGGAGG TGCTGTCATG
GTCAAGGCTG ATGTCCACAG AGATGGAGCC ATGTTCCCTT CGTTTCCATT CTACCATTTG
ATAGAAACAG AGGGTTTTGC CAAGATGGCT AAACGCTTGG GCTACGAGGT GTTTGGTTTG
CCCAACTACT TGGTATACCA CTTCAACGAG TGA
 
Protein sequence
MVYIKNQYIR YIRKKPLSLI APISVLLLVY FYFFAAHGSS SSSSGNKYSY KKKSRGLFAK 
NRDSVILKNL PKNHISHYDL NKLSTSADSL AKKEEVLILT PMSRFTPQYW DNIQKLTYEH
SLISLGFILP RNKDGDVALK HLEEAIKDAK AANQLKYKKI TILRQDTNSL NSQLEKDRHA
LNVQKERRSM MALARNSLLF TTIAPTTSWI LWLDADIVET PAGLIQDLTS HNKPVISANV
YQRYEDESTQ QPSIRPYDFN NWVESEEGLK IAAGLADDEI VVEGYAEMAT YRPLMAHFYD
AKGDVHTEMQ LDGVGGGAVM VKADVHRDGA MFPSFPFYHL IETEGFAKMA KRLGYEVFGL
PNYLVYHFNE