Gene PICST_42423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_42423 
Symbol 
ID4837459 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp633953 
End bp636295 
Gene Length2343 bp 
Protein Length726 aa 
Translation table12 
GC content38% 
IMG OID640388774 
Productpredicted protein 
Protein accessionXP_001382884 
Protein GI150864166 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00348392 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTTGGACG AAGAACTCAT AGACGACAAG ATGAGGCTAG CCATCTTGTA CTTCAAAGCC 
GCGGACTTTG AACGAGCGCT CAATTTGTAC AATGAATTGG TCGAGATGGT GGCTTCTATC
CTGGCTGTAG AAGCGCAAAA AATCAGAAAA CACGTCTACA ACTTGGCAGA AAAGCCCATT
GTGGGAGCAT GTGTACATCC GAAGTTGGGA CTGATTCTCG ACCAACGAGC TGCCACTCAC
GAGAAGTTGG ACCAGTTCTC TAAAGCACTA GAAGACTCTC GTAGAATAGT CAAATTAGAG
CCGATTAGTT GTAAAGGATA TCTCAGGGTT GGTAAATTGC TTCTCAAGTT GAAGCAGGAC
GTAGAAGCGT ACAAAACCTA CCAAAGAGGT GTATACATTA TCGAAAAAGT TATAAAAGAG
CATCTGGTTT CAGTACCAGA GAAGCTCTTC TCGCAATTGA AAACTCAATA TAAGCTGTTG
AACAAGACGC TCAAGACTAA AAGACAAAAC AAATCTCAAG AACTGCAATA TCTATCAAGA
GAAGATTCGC ATGAACTGGA TAGTTCTCTA AGAATGAAGT TTTCTCAGGG AAGCAGCAAT
GCCAAGGGCA TAACATTGTC TCAGGAAAGT GGTCAGGCAC TGAGACTCTC TAGTATCAAT
GGCTTTACTA CTGCATTGGA ACATATGCTT CCTTTGAAAA GAACAGCTTC CACCTCTCAA
ATATCAGCCC CAACAACTAG TTCCAAGAGA GCCAAGCGTA TGGGTTCAGT TTCAAAACCG
ATAGCAGACC CCTTCCAGCA TCTCCCGCTA GAAATCATAG AGCTCATATT CCAAAACTTG
GCCTTTAAAC AACTCCTATC TTGTCATCTA GTCAACAAAT TGTGGTACTA TAATTTGACT
AAGATTCCCA ATTTGTATTG TACTCGTGTC AATTTAAAGT CGAACATCAC ACTTTCTGAA
TTCACCCATG GGATTCGATT GGTTAAAAGA GTAGCCCACA AGTCGTATTC ACAACAAATC
AAAGCCCTTA AAATGCGATC CGCAATCAAT GCTTCACAAT TGCAAAAAAT CATAGAAATA
ATCATTAATG AACCAAACTT TTCCATACGA TCGTTGGATA TGTTTGACAA ATATCTTAGT
TTTCAGCTTC TATTGAATAA GCTCAGCAAG TTCAGTTGGA AATTGAACAA CTTGTCCAAT
TTGGAGTACC TTCGCTTAGG AGTGAATTCA AGTCTTCGAT ATGAGAATAT CATATTGGAA
TTATTCAAGA AGTTGAAGAC TCTCTATATT GTAATCCTAT ACTCCGACAT GAGTGGACCT
AATAACCAAA TTCTACCTAC TACAGAAAAG TATTTCAATC GTTTGCACGA GAAATCAGTC
AATAATTCAG ATGATTACGA ATCTATGCTG AATTTGACTC TTGTAAACCA TCCGAAGTTA
CTACCAGGAG AAAGTCAAGT AGCTCCGAAG TTTGAAACTT ACAATCCATA CCCAATTTTC
CTTGATAGAA GCTTTTCAAA TTTGGTAGAA TTGAGTTTGG TCTCTTTTGA TTTCTTTAAT
AGGTTGCCTC TTTTGGGAGA ATTTTTCTGT AAATGCAAAC TGTTGACCAA ATTGATGCTT
GAAAACAATT TTAACTTTCG TATGCTTGAT TTGTTTCAGA TGCTCCAAAA TTATAACCCA
AGTTTCCGGT TGGAGAAATT ACTATTCAGA GAGCCCAAAA TCATTAGTAC GACCACCATG
AATGAATTCT CAACCGATGA CTTACCTCAG TTGAACAGCT TGCAACTGTT GGATGTGTAC
GGCTCATCAT TGACCACAAA GGGGTTGATG AAATTGCTTA GAATTTGCAA CAAAGGAAAT
AAGGAACTTA CCACTTTAGT AATGGGAAAT TCCACATACT TGCATTTCAA GACCGATGCT
TTTCAGACTA GCAATCGGAA CCTATTTTCA TTTGTACAAA TGCTTCGAAT TGCACCCAAC
TTAGAGAATC TATACCTCAA CGAATTAGAT ATAGACAATC AAACTATGAA GCAGCTTAAT
AAGGATATAG AATCTATTGG TTATGTCAAT TGCAAGCTTA AAGTGCTTGA CTTGAGTTTC
TGCAACAGAG TTGAGGGTAT TGGACTAATT GATTTGTTCA AAGCGTTTCC TTCTAATATT
AAACAGATTA ACGAGAATAG TGGAAGTGCA TTTCGGATCC GAGAATTAAT CATAGACGGT
ATAGAGATCA ACATAGCTAC ATTGCGTTTA CTACAGAAGC ACAATTTTGT CAGCACCATC
AAAAATGATC CCAACAAGAA GAGATGGAGA CAGTACGGGG TGAATACTTT GGTTCCAGTT
TGA
 
Protein sequence
MLDEELIDDK MRLAILYFKA ADFERALNLY NELVEMVASI SAVEAQKIRK HVYNLAEKPI 
VGACVHPKLG SILDQRAATH EKLDQFSKAL EDSRRIVKLE PISCKGYLRV GKLLLKLKQD
VEAYKTYQRG VYIIEKVIKE HSVSVPEKLF SQLKTQYKSL NKTLKTKRQN KSQESQYLSR
EDSHESDSSL RMNSKRAKRM GSVSKPIADP FQHLPLEIIE LIFQNLAFKQ LLSCHLVNKL
WYYNLTKIPN LYCTRVNLKS NITLSEFTHG IRLVKRVAHK SYSQQIKALK MRSAINASQL
QKIIEIIINE PNFSIRSLDM FDKYLSFQLL LNKLSKFSWK LNNLSNLEYL RLGVNSSLRY
ENIILELFKK LKTLYIVILY SDMSGPNNQI LPTTEKYFNR LHEKSVNNSD DYESMSNLTL
VNHPKLLPGE SQVAPKFETY NPYPIFLDRS FSNLVELSLV SFDFFNRLPL LGEFFCKCKS
LTKLMLENNF NFRMLDLFQM LQNYNPSFRL EKLLFREPKI ISTTTMNEFS TDDLPQLNSL
QSLDVYGSSL TTKGLMKLLR ICNKGNKELT TLVMGNSTYL HFKTDAFQTS NRNLFSFVQM
LRIAPNLENL YLNELDIDNQ TMKQLNKDIE SIGYVNCKLK VLDLSFCNRV EGIGLIDLFK
AFPSNIKQIN ENSGSAFRIR ELIIDGIEIN IATLRLLQKH NFVSTIKNDP NKKRWRQYGV
NTLVPV