Gene PICST_31848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31848 
Symbol 
ID4839194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp40089 
End bp41738 
Gene Length1650 bp 
Protein Length549 aa 
Translation table12 
GC content41% 
IMG OID640390509 
Productpredicted protein 
Protein accessionXP_001385030 
Protein GI150865703 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGAAT CGTCACAAAA AAGTAAGAGG ACACTAGGGA AACCTAACAG CGAAGAAATA 
ACACATCTCC CTCCTTCTCC TGTTAACGAG CAAGATGCAG CAATTCTCGA GCAACAGCCT
TTGTTGGATC ATAATCATGA ATTTCTTGAT CCAGATGATC CCATAGTTTC TCCATTGAAC
TTGTATAACG TTCAGTTAAT GAAAATTGGT GTTACGGTAC TAATATTTTT CAATGCAATT
GTAGGATTTG CACTTATATT GACGGATTTC ATCTCAATTC CCGGACTTAA CAACCGAGGT
AAGTCTTTTT TGGAGTTGGA CCTTATACTC GTGGCATTGC TAACTAATGC CATCACTCTT
TGGTGCTTCA CGGTTCCAGT GTACTATGAC CGAATCCTAG GCTACATCAC TGGTGGGCTT
CTTCTCTTAG ACTTGCTCGT GATTGGAGTA GTTACATACA CTAGACACCA ATTTGGCTGG
ATAGGAATTA TTATTTTGAT CTGGACTGGA CTCAATGTTT TGGTGAATGC TCTTGTAGAC
TACTGGGTAG AGAGGGAGAA ACGAGTACAG GAGGTAAGAT ACACTGGAAG AGTTGAGAAA
AGATGGTCGT TGTCCGAGTT ATTGATTGCT TTAGTAAAAA TTACCGTCAA ATTATTCTTA
CTATGGGTAG TCTGGTGTAT TAGTCTTACG TTTTGGTTGC AAACGTTTGA TTCACACGAG
AAACCATGGG GAAAAATGGT TGCTGTGAAT GATAATTCTT TCAAAGTTCA CCTCGCTTGT
TTTGGCAATG TTCATAATAA CACGAAATCT AGCCAACCTA TAATTCTAGT CGAGGGTGGA
CAAATGATTG CTACAGAAGT CTTCCAAGAG TGGATTGAAG AACTATATCA CTTGAACAAA
ATCGACCGAT ACTGTATTTG GGACAGACCA GGCTACGGAT TTTCGGATAG TGCACCTTCT
CCGGTTTCAA TAGGAATCAT TACTGAGTAT CTTATTGAGG CTCTCAATAA GGAGGAAATT
GAAGGTCCCT TTTCGTTGGT GGGTTTCGAC ATTGGAGGAC TATATTCGAG AGTGTTTGCT
TCTAGAAACC CGAACAAAGT TCATTCGTTA CTTCTCGTAG ACAGTTGGCA CGAAGATTTG
TTGAAAAGGT GGCCCTTTAG TGGATCCAAC CGAAAGAATG AGAAGTCTAC AGTTTTCAAG
AATATTATTG AGCTAATGGA CAATATCACT GGATTTAAGC TTTGGTTTAG AGGCTTGGTC
TCACCATTGG GGATTGTGTC TAATATCCAT TGGTTTTTGC ACCCATTCAA ACATCTGAGC
AAAAGTCGAA TTTTCGGGTC CGACATGCGT TATCAACCGA AGTATATACG AGCTAGACTA
CAAGAGCAGA TTACGTCTAC ATTATTGTCG TATTCTGAAG TCAAGGAGTC GACTGTGCAT
GACCTTCCGT TGAGTGTGAT CTCATCTGGG TTTATGATCA AGAACTCATT GAACTGGGGC
AAATGGCAGC AGGAGATTAG TAAGATCAGT TCGAACACTG TCGAGTGGGT CATTGCTGAA
AACAGCAACC ACGAAATCTG GAAAAGTCCT CGAGGCAGAG AACAACTCCA GCAGTTACTT
ATGCGTGTAA TAGGAGGGAA GACATACTGA
 
Protein sequence
MPESSQKSKR TLGKPNSEEI THLPPSPVNE QDAAILEQQP LLDHNHEFLD PDDPIVSPLN 
LYNVQLMKIG VTVLIFFNAI VGFALILTDF ISIPGLNNRG KSFLELDLIL VALLTNAITL
WCFTVPVYYD RILGYITGGL LLLDLLVIGV VTYTRHQFGW IGIIILIWTG LNVLVNALVD
YWVEREKRVQ EVRYTGRVEK RWSLSELLIA LVKITVKLFL LWVVWCISLT FWLQTFDSHE
KPWGKMVAVN DNSFKVHLAC FGNVHNNTKS SQPIILVEGG QMIATEVFQE WIEELYHLNK
IDRYCIWDRP GYGFSDSAPS PVSIGIITEY LIEALNKEEI EGPFSLVGFD IGGLYSRVFA
SRNPNKVHSL LLVDSWHEDL LKRWPFSGSN RKNEKSTVFK NIIELMDNIT GFKLWFRGLV
SPLGIVSNIH WFLHPFKHSS KSRIFGSDMR YQPKYIRARL QEQITSTLLS YSEVKESTVH
DLPLSVISSG FMIKNSLNWG KWQQEISKIS SNTVEWVIAE NSNHEIWKSP RGREQLQQLL
MRVIGGKTY