Gene PICST_33275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33275 
Symbol 
ID4840753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp189957 
End bp192086 
Gene Length2130 bp 
Protein Length709 aa 
Translation table12 
GC content38% 
IMG OID640392068 
Productpredicted protein 
Protein accessionXP_001386052 
Protein GI150866446 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCAA CTAAGATCAT TATTCGAAGA TTCAGTGCTT CAGTTCGAAT ACGACAAAAG 
TCGGAACCAG TTCGTAACCG ACTAGAAGCG GCGGTAGCCA ATTTCATCTC AAAAGTTCAG
AAAAGCCAAT CCAAAAGCTC AGGATCTATT CTCAGTGCCA TAGATCTCCT CATAGACAAA
CACAACGAGA CTCTCGCTGC ACACTTCGGC ATTCCTCATG AAACCAACGA CTACTTAAAG
GTGAAACAGC AGATATGGGA CAAACTTTCT GTAGGGGACA TTTCCCTAGC CCAAGATCAG
GAATCTGTAG AAGCACAGCG AAGTAACTTG TTGAATCTGC ACTCACTTAG ATTGTCAAGC
TCATCCAGTA CATCACTCTC TAAATTTGTA AAATGTTTAT ACCCGTTGCA TAAATCGACA
TCTCCAAGTC TCAGCTCCAG CAACAGTAAC AAAGTGCTTG ATATACCTTC AAGTTTGGGT
AAGTATGAGT TAATAAACCA TAACCAGCTC CACAGAACCT TTTATGAATT GCCGTCCCCG
CAGCCTTTCT ACCTTAGCCC CCAGCATTTG AATGACTTTT TGCATCGGTT TCTATTCAAT
AAGCGTGACT TCATCAAGTC AAACATCGTA TCATTATCCA ATTTGAACCA TTTTGCACCC
TATCGTTTCT TTGAATACTA CAACCAGATG CTTTTTAAGA GACAAGAATA CGTATCTATG
GTCACAAAAA TCTTGCAAGA TCTTCAAACA TATGGATTTG AGACTTCACT CACCGAACAG
AATCAAATTA TCTTCTATAC TTTCTTCAAA GACAAAGCTC CTATAGTACA TAAAATCAAT
GAAGCTGTGG GGAAACTCAA GGAACTAGGA ATGTCAGTAA CACAGGATGC TTACCCAACG
TTTGATCTTG ACACTTATAA CCAATTAAAA GAGTCCATGA TTGCCAAGTA TGGTAAATTG
CATATCAGCA GCTTGAACAT ATTCTTGCTA CATGCGATTA GACACAGCCA AGAGCTGGTA
ATTGTAGATG TGCTACAACA AATTGGTTTT GCTGAACTTG TTGGTGGCAA CAGCCTTAGT
GAGAGATTAT CGACACCTGA TGAAACTACT TTCGCACTAT TATTCGAGTA CTTTTCATCT
TCAAGCTATG CATCGAGTAA AGGATTATCG CTGTTTGTAG AAGTGTTTGA GAAAGCAGTT
GCGGACAAGT CGTTAGTTTT TGATGTCAAA CTTATCAATG TATTGCTTAA GTGTTTATGT
GAGAATGAAG AGTTGAATTT TGCAGAAAAC ATCGTCTCTA AACTTTTTTT GCAAGAAGTT
CCTAAATTAC AAGTTGACGA ATCGGAAGCT GTTGTACTCA AACAAATGAG TCCAAGAGAT
AAGTTGGCTT ACAAGAAATT GGAATTGATT TATAATAATA TCAGAAAAGT GACGAACGAT
TTTGAGATTT CATATACCAT TTTACCAACT GAAAGTACCT TTAAGCCTTT GATTTCGAGC
TACTGTCTTC CATTGTCAAA TACAAAGAAT AGCTTTCGCA AGGTAAATTA CTTAACTAAT
TTCATGGAAA CATACTACAA GTTGCCCATA ACTACTAGAA CATTCAAGTT AATATTTCTG
AAGTTTGAGG CATCTAAGGA AGAAGGATGG GAGTTGTTTG ACCTTATAAA TATCACAAGC
AAGTTGATTT CATTGCATGA CTATTCTTAT AACTTAAGTG AGGATACTTC TTTGTTCGGC
ACTGATGGTC AGTTTTCCAA GTTAGATAGG ATCGAGAAAT CGTCACAATT GACAAACTTC
ATTAATAATC ATTTGGTGAA GCAGATTGAT CTCAATATTC CAATAGCACA AGGCAGCTTT
GTTAAACTTA ATGATAGCCT TATTACTTCT GTGTATCATG CATTTATGGA AACGATAAAG
CGAAGTGCTC CAAACAACAA GGAGGGGGCC GATGAATTAG TAAGACGCAT TCAAAGGCAG
TATGACAGTC TTCGAGAAAG GATAAATAAG TCCAGAGGTC CAGTGGACCA AGAGGGCTTT
ACCACAACTA CACGAGATAT TTACGCATTA GATGAACTTA ACTACATCAA GAAGGCATTC
TTAATAGACT TGATTGATGT AGTTGGCTGA
 
Protein sequence
MKATKIIIRR FSASVRIRQK SEPVRNRLEA AVANFISKVQ KSQSKSSGSI LSAIDLLIDK 
HNETLAAHFG IPHETNDYLK VKQQIWDKLS VGDISLAQDQ ESVEAQRSNL LNSHSLRLSS
SSSTSLSKFV KCLYPLHKST SPSLSSSNSN KVLDIPSSLG KYELINHNQL HRTFYELPSP
QPFYLSPQHL NDFLHRFLFN KRDFIKSNIV SLSNLNHFAP YRFFEYYNQM LFKRQEYVSM
VTKILQDLQT YGFETSLTEQ NQIIFYTFFK DKAPIVHKIN EAVGKLKELG MSVTQDAYPT
FDLDTYNQLK ESMIAKYGKL HISSLNIFLL HAIRHSQESV IVDVLQQIGF AELVGGNSLS
ERLSTPDETT FALLFEYFSS SSYASSKGLS SFVEVFEKAV ADKSLVFDVK LINVLLKCLC
ENEELNFAEN IVSKLFLQEV PKLQVDESEA VVLKQMSPRD KLAYKKLELI YNNIRKVTND
FEISYTILPT ESTFKPLISS YCLPLSNTKN SFRKVNYLTN FMETYYKLPI TTRTFKLIFS
KFEASKEEGW ELFDLINITS KLISLHDYSY NLSEDTSLFG TDGQFSKLDR IEKSSQLTNF
INNHLVKQID LNIPIAQGSF VKLNDSLITS VYHAFMETIK RSAPNNKEGA DELVRRIQRQ
YDSLRERINK SRGPVDQEGF TTTTRDIYAL DELNYIKKAF LIDLIDVVG