Gene PICST_46351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_46351 
Symbol 
ID4839595 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1377475 
End bp1380360 
Gene Length2886 bp 
Protein Length937 aa 
Translation table12 
GC content41% 
IMG OID640390910 
Productpredicted protein 
Protein accessionXP_001385271 
Protein GI150865880 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0158707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCAAATGGTT TTGTCAAGTT CATAAGATCC ATTTTCGGTT ACAGAAAGAC TTCGTTAACC 
TTATTTGTGA TTTTGACCTA TGTTGCTGTT TTGCTTCTTG CCTACTTGGA CCATTCGCTA
TACTACTCTG TAGACTTACC CACATCTCAC AAGGAACAAG AATTGCTCCA CCAGGCTTGG
GTAGACCTCC AGCACATTGC CAAATACGAG CACGCTTATG GTTCCAGTGG CAATGACTAC
GTCCACGACT ATCTTGAGTC GCGAATCGTT AGTGCTGTGG CACACAAATC GTATGTAGAG
TATGATAACG ATTTGAACTA CACCAACAAC ATCATGTTTG GCAGCCGTTC CGAATTAAGT
GGCAACCTGT TCAACTCCGT GTCATACTAT GAGAGCAACA ACTTGGTTGT TCGCATCAAC
GGAACCGACG AGACCCTACC AGCGTTGCTT TTGAGTGCTC ACTTTGACTC TGTTCCCTCT
TCGTTCGGAG TGACAGATGA CGGAATGGGA ATCGCTAGTC TATTAGGAGT TTTGTACTAT
TATACCGGTA AATCCACAGC TCGTCCTCGA AGAACCATCG TCTTGAACTT CAACAATGAC
GAAGAATTCG GCCTCTATGG AGCCACATCC TTCTTGTCGC ATCCATGGGC TACAGGTGTG
CATTACTTCC TCAACTTAGA AGGCACTGGA GCTGGTGGAA AAGCCATTTT GTTTAGGGGA
ACAGACTACG GTATAACCAA GTACTTTAAG GGAGTCAGGT ACCCTTATGG AACGTCTATT
TTCCAGCAGG GATTCAACAA CCACTTAATT CACAGTGAAA CAGACTACAA GATTTACAAG
GAAAAGGGAG GCTTGAGAGG TTTGGATGTA GCTTTCTACA AACCAAGAGA CCTTTACCAC
ACTGCTGGTG ATAACATCAA GAACATCGAT ATCAAGTCGT TGTGGCACAT GCTTTCCAAC
GCTTTGGATT TCACTGCCAT AGTCACAAAG GGCAAGATTG ACTTGGATGC CGATTCCTTG
GACTCTGAAT CGAGCAAATC CAATACAGAC ACTGCTGTGT ACACATCATT CTTGAACTTC
TTCTTTGCTT TCCCTACTTC TCAAGTGGTT GTAGCAAGTA TTTTGCTTTT GGTACTCATT
CCTGGGATCA GTATCCCGTT CTTGATTATC ATATTTGGGT ATAAAAAGAA CTGGGAATTG
TCCTTCGTCA ATGTCACCAA GTTTCCCATC TCGCTAGCTA TTTCTGCTGC ACTCTTGAAT
CTCTTTACCA ACGGTTTTAT CGTTCCATTC AATCAATTCT TGCCCAACTC GTCTCCCTTT
GCCTTGGTCG CCATTTTGTT CGCAACTTTC TTGTTGTTGA ACTACTTGAT CTTGAATGGT
ATCAATTTGA TTTTTGTTTC ATACAAAATC GTCAACCATG ACGAAAAGTT GATTTCCATC
ATTGAAACTT CCTTCTTGTA TTGGGTTGTT TTGATCTACT CGACTGCTAA ATTGGCTAAC
AATGTTATCG GTGACGACCA CTCAGGTGAA TTCCCTATTA TTTTCCTCTG TGCTTTGCAA
GCTGTAGCCT CCATCTTTGG CTTGATTGGC TGGAGTTTCA AACCCGTTCC AAAAGAACAT
TATGTCGTAG TGCCCCAAGA AGAAGCTGAG CCATTGTTGG GTAGTAGTGA CAACTTCAAT
TATGGCTCTC CTGATGTCGA AGATGACAGA CTTGTTTCTG ACGGTTCGTC TTTGTCGCTT
AATTTCACTG GCGAAAATAG TGCTGAAAGA AAGTTGAAGG ATTTCATCAA AACCTTTAGT
TATGATTGGT CAATTCAATT CTTGACTATT GTTCCAATTT CAACTTACTT GATTTACAAC
TCAGGTTTCT TGGTTGTTGA TGGAATCAAT AAGTCTATCC AAGAGTCGCT TATTTCCCAG
AATTTGATTT ACAAGCTCTT GCAAACTTTT GCTATTTCGT TGTCCATACC CCTTTTGCCA
TTCATCTTCA AGGTTAACAG ATTGTTTGTC TTAGCTCTTT TTTTAATTTC GACCATTGGT
GTCTTGTTCG TCGCCACGGC TGATTCATTC AATGTTGCCA ATCCATTGAA ATTGAGATTC
ATCCAGTACA TTGATCTCGA CAAGTCCGCA CAAGATAGTT TTGTTTCTGT TATCGGACGT
GAAGCTTCAC CATTGCAGTT CGTCTTGAGC GATATTCCTT CTGTGAAAGA TTCAAAGGGA
GCAGTTGCCT GTGTTCCTAC TAGAGATGGA CTTCAAGATT GCTCATATAA GAGTCTGCTT
GATCCAAAAC TTGTGCCAGG AGCTAAAAGC TTTGATGACT ATTTGAAAGT TGATATTCTC
AAGAATTCTT CATCAAATGT CGATTATCCG TTTGGACTCT TAACTGGTGA GATCAGGATT
AGAGTTCCAA AGAACAGAGA GTGTGTTTTG GATTTCAAAC CAAGTGAGTC AACAAAGATA
GTGTCTCCAT TTAAAGATTC TCCAGTAAAA ACTGTCATAG TATACAAGGG TAAGAAGTCG
GCCACTACTA AAGAAGTTGA AGCTGAAAGT ATACCAGAAG GTTTCTCCAA AGACAAAGAT
GGTAATTACG TCTACAAGGA TCTTGTAGGC ATAGATCAAC TTCAGTTGAA CAAGCTTGAC
TGGGATAAAA GTTACCATGT AGGTTTCCAA TGGGTGCCGA ACTTTTCAGA TGTTGATATC
AACATGAAAA AGTCCGCTAC AAACAAATTG AATGTCAGCG TCAAATGCTT CTGGGCTGAA
TTGGGCAAGG GTGAAGAATC AACTATCCCT GCCTATGAAG AGTTGTTGCA TTATAGTCCA
AACTATGTTA GCTGGGCCAA TAGTGCCAAG GGGTTGGTGT CGGTCTCCAA AACCGTCGAA
TTATAA
 
Protein sequence
PNGFVKFIRS IFGYRKTSLT LFVILTYVAV LLLAYLDHSL YYSVDLPTSH KEQELLHQAW 
VDLQHIAKYE HAYGSSGNDY VHDYLESRIV SAVAHKSYVE YDNDLNYTNN IMFGSRSELS
GNSFNSVSYY ESNNLVVRIN GTDETLPALL LSAHFDSVPS SFGVTDDGMG IASLLGVLYY
YTGKSTARPR RTIVLNFNND EEFGLYGATS FLSHPWATGV HYFLNLEGTG AGGKAILFRG
TDYGITKYFK GVRYPYGTSI FQQGFNNHLI HSETDYKIYK EKGGLRGLDV AFYKPRDLYH
TAGDNIKNID IKSLWHMLSN ALDFTAIVTK GKIDLDADSL DSESSKSNTD TAVYTSFLNF
FFAFPTSQVV VASILLLVLI PGISIPFLII IFGYKKNWEL SFVNVTKFPI SLAISAALLN
LFTNGFIVPF NQFLPNSSPF ALVAILFATF LLLNYLILNG INLIFVSYKI VNHDEKLISI
IETSFLYWVV LIYSTAKLAN NVIGDDHSGE FPIIFLCALQ AVASIFGLIG WSFKPVPKEH
YVVVPQEEAE PLLGSSDNFN YGSPDVEDDR LVSDGSYDWS IQFLTIVPIS TYLIYNSGFL
VVDGINKSIQ ESLISQNLIY KLLQTFAISL SIPLLPFIFK VNRLFVLALF LISTIGVLFV
ATADSFNVAN PLKLRFIQYI DLDKSAQDSF VSVIGREASP LQFVLSDIPS VKDSKGAVAC
VPTRDGLQDC SYKSSLDPKL VPGAKSFDDY LKVDILKNSS SNVDYPFGLL TGEIRIRVPK
NRECVLDFKP SESTKIVSPF KDSPVKTVIV YKGKKSATTK EVEAESIPEG FSKDKDGNYV
YKDLVGIDQL QLNKLDWDKS YHVGFQWVPN FSDVDINMKK SATNKLNVSV KCFWAELGKG
EESTIPAYEE LLHYSPNYVS WANSAKGLVS VSKTVEL