Gene PICST_28340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28340 
Symbol 
ID4851117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp969415 
End bp971529 
Gene Length2115 bp 
Protein Length704 aa 
Translation table 
GC content38% 
IMG OID640392825 
ProductLeucine rich repeat protein 
Protein accessionXP_001387824 
Protein GI126274103 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.669391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.295604 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGATT TCTGCAATGG CTTTCAGCGG TTGCCAGATC ACTGTATCGA GTTCATAATC 
AATACTTTGC CACTTTCTAC ATTGCAGAGT TTAATTTCCA ACGGTAGTTC TCCTTTCCAA
GAGCAATTTA AAAGAGCCTT CTACAGAAAT GTGATTGTAA CTAACTACAG TGGGAAAGAG
CCCAGAATCG ATTTAGGTTT CCTCAAAGAA ACTGAGTTAT TCAATTCCTT AGACGTGATC
ACCGCTCCCT TTGACTGTGA GACATTGTAT CAGCACATGG AAAAGTACCC ATATATCCAT
ATTCAAAATG CGACGTTTAA ACTTCACTCG TCTACCAACT CAATCGATTG GGTGCAAAAG
TTGGCTGATA ATATCAACAA AATACACATC AACTTTGATC CGTATAGATA TCATGATTCT
TTTCTTCTGG ACTTGTGCAG AAAGAGGCTA CCTAGCAAGG TTTTTGGTCT TTCCACGAAG
TATGTTAAAG ATATGTTATT TCCATTTTAC TTGCAATCAC TTGAATTGCA CCTAACTAAT
GATGAATTTG ATGTAAAAAG CCTTCCCCAA CACCTTGAAC ATCTTGCTAT TCATGTTTCA
AGACCTATGA CATTACTCTC AGAAAAGGAG TTGAATCTTC TACCCAGACA AATAAGATCT
CTCAGCCTAG ATAAATGCTT CCGTATTGTG GAAGAGTATG ATGGGCTTGT TAAGTTGGAC
TTGCCTACGC ACTTAACGAC ATTGAAGCTT CAGATGGAGT ACTCTGGAAA GTCAGTTATA
GACATATCTC ATTTAAGATC CTTGAACAGG TTTGATTCGC ATCTCTCTGA ATTACAATTG
TCTTATATGA AATTGCCTAT ACAGTTGAAT CGAATAAGGT GCAAAGGCTC ATTCCTCGAT
ACAGAACAAT TTCTTGATGG AATACAACCC AGCCTAGCTA ATTCAGAGCA TCTCGCAAAC
TTGACAAATG TAGACATTTG GGATTTGAAA TCAGAATCTT GCATCACTAT TCCAGACACC
GTCAGGGATG TATCTATTAT TTCAGCAAAC CTTGATTCTC CTTTGCAGCT GGTCATAACT
TTTCCAGAAG GCTTATCCCA GCTTCAACTC GACTATTGCA GACCTGTGGA AATGACACAG
GAACTTAGAC ACTTGACTTC TTTGAAGATC TTTGGTGAAG AAATCATTTT CTTGCCCAGG
ATGGACAATT TGCAGAAGCT AATGATACAT CATCTGAATA ACTTATTTCC AGGATTCTGG
GATTTCTTGG AATCTCTTAA GGAATTGCAA TTCTTAGAAA TCTCTGGATT TGGTATTGAA
ACAGTTCAAT ACTTGCCTCA TAAACTTGAA GAACTTGTTT TGAACGATAA CGTGATAAAA
GAATTCTGCT TTGAGTTGCC TCCCAATATA AAATCTATTT CTCTACGTAT GAACAAATTA
AGCAAGTTCA TTGTTAGTGG AACCAGTAAA TTGAAGAAGT TAATACTTGA TGATAACCTG
TTCAAAGTTT TGACGGCCTC CTGCTTGCAA ATACCAGACA GTATTTGTGA ATTGTCTATG
GACTTTTGCC ATATCACTTC AGTGGATAAG CACTTTAATT TTCCTGAGAG TACGCGCTTG
CTTCGTCTCT CCAACAATAG AATAGAAAAT GTAGAGAACA TTCTATTGTC ATTGCCTCCT
CGAATTGTCT GTTTCAATCT AAGTAGGTAT GACAACAGGT CTAGAACTCC GGTGACAAGA
AAAAGAATCG ACGTCAGAAG CAAGTCACTT TGGAATGTTG ATGTAAATAA TGCTCTCATG
GGTCATGAAG TTGTATGGAA TGGTTGCACC AATTTGCAGT ATATCAACTT GGCTAACAAT
TTGTTGGGAA ACGTCGACAC CAATTGCTTT CCCCTGTCAC TCAAAGGCAT CAAGTTCAAC
CACTCAACTA TAGACAAGCT TGAGGGTGGC TTTGAAAGGT TTGCTGATTT AGAAGAGGCT
GACTTGCGCA ACTGCTCCGG AGAGTTTGCT AAAACCGTAA CTGGACTGGA TGCATTTGGA
CCAAAAATAT TAGTTACTCA GCATTTAGAA GAACCAACGA ATGGAATTCA ATCTTTGAGT
CTTTCGCAGC AATAA
 
Protein sequence
MSDFCNGFQR LPDHCIEFII NTLPLSTLQS LISNGSSPFQ EQFKRAFYRN VIVTNYSGKE 
PRIDLGFLKE TELFNSLDVI TAPFDCETLY QHMEKYPYIH IQNATFKLHS STNSIDWVQK
LADNINKIHI NFDPYRYHDS FLLDLCRKRL PSKVFGLSTK YVKDMLFPFY LQSLELHLTN
DEFDVKSLPQ HLEHLAIHVS RPMTLLSEKE LNLLPRQIRS LSLDKCFRIV EEYDGLVKLD
LPTHLTTLKL QMEYSGKSVI DISHLRSLNR FDSHLSELQL SYMKLPIQLN RIRCKGSFLD
TEQFLDGIQP SLANSEHLAN LTNVDIWDLK SESCITIPDT VRDVSIISAN LDSPLQLVIT
FPEGLSQLQL DYCRPVEMTQ ELRHLTSLKI FGEEIIFLPR MDNLQKLMIH HLNNLFPGFW
DFLESLKELQ FLEISGFGIE TVQYLPHKLE ELVLNDNVIK EFCFELPPNI KSISLRMNKL
SKFIVSGTSK LKKLILDDNL FKVLTASCLQ IPDSICELSM DFCHITSVDK HFNFPESTRL
LRLSNNRIEN VENILLSLPP RIVCFNLSRY DNRSRTPVTR KRIDVRSKSL WNVDVNNALM
GHEVVWNGCT NLQYINLANN LLGNVDTNCF PLSLKGIKFN HSTIDKLEGG FERFADLEEA
DLRNCSGEFA KTVTGLDAFG PKILVTQHLE EPTNGIQSLS LSQQ