Gene PICST_29646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29646 
Symbol 
ID4837173 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp766101 
End bp767994 
Gene Length1894 bp 
Protein Length618 aa 
Translation table12 
GC content44% 
IMG OID640388488 
Productpredicted protein 
Protein accessionXP_001382917 
Protein GI150864192 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0379511 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTAC CACAGCCTCC GTTCAAAGTC AAAACCAAGG TGTCTTGGGC TGGAGAAGAG 
GATGGTGACT TGGGGTTTCT TGAAAACGAG ATTATTGAAG TGTACTCCAT CGTCGATGAA
TCATGGTGGT CGGGAAAGCT CAGACGAAAC AGAGCTGAAG GTATCTTCCC CAAGGACTAC
GTTGAGGTGA TTCCAGAGTT GAATAAATCA GTATCCAGTC ACTCCATTGG TACCCCCAAA
GCACAGACTC CCGTCCAGAC TCCCATCCAG ACTCCCGTCA AAGAAGCTGA CTATGGTAAG
TTCAAACATA GCAGAGCCAG CGGAACTCCT ACGGGAATGT ATAACTCTTA CAACGGTTCC
AAGTCAGTAT CTCCCAAGAG ATACCACCAT GTCAATGTTG ATGCTGACGG CTCATTTGAA
GTCGAAGATG CTGTCAATTG CACCTATGAT GGCATCTACG ATTCTTCTAA TGGCATGTCG
CTGTCAAACC GAAGCTCACC CAAGAAAATC AGATCTGCCA ACAAGCTCGT TTCATCTACC
CAACATCCTC GGTACCGAAA CAGCCATAAT GGGTACCAGC AGCAGCAACA CGAGGATTTT
GACAGAGAAC GCGAAATGGA AAACTTTCGT ACCCTCCAAC AGCAACAGAA CTACCACATC
AAGAAACTGC CTCTGCACAA TATGAAATCG CTGGTTCAAA CCCAACAGCA AGAATTCAGA
CAATCCTCAT ATTCACCGGA AAATAGTAAC TCCATCTATG GAAAACATCA AGGAAAGTCC
AATTCAAGAT CAGGCTCGCA ACAGAACTTG TCTCAATCTG CTGTCCAGGG ACATCCTCCG
CAAGTAAAGT CGCAATCGTA TGTTGATATA CCCTACAGAA ATCAACAGAA TAACACCGTA
GAAAGTTTTG TTTCCGAGGG ATCTCCCTCA CGTCAAGCTA GGAGGGCCAG ATACGACGCT
GATGCCGTGA ATGAATACGA GATCATCTCC CAGAAAAGGG CCCAATTGGA GTTGGAGTTA
CTGCAGTTGA AGCAGTTGGA AAGGTCTACC CAGAAGAAAA GAGTCCACAA CCCCCATCTT
CAACACAAGC TGGGCCAAGT CAGTCGTGGC AGCAATGATG ATTCCTTAGT GAACTCATTA
GAATCTAGCT ATATTAGCGA AGACCTCTTA TCTTCCAAGA AGAACTACTC TTCCAGAGAG
GATCTTGGAA AGAAGCTTGC TAAGTTCGAA AGTGTTGACG ACGAAGATGA TGACTATTTC
AACGATAACG AGTCGTCTCC ACCACCTCCT CCACCAAAAC ATACTACACC TATCAAGGCT
ATTGAAGCTA TCAGAGACAA CGATAGTCCA CTTAGAAAAA GTGGAAACAA AGTTCCTTTT
GATGCTGATG ACTTCAGGTT TTCTGGTTCA AATCGCCTCC ATCAGGGTCA AGTATCCGAA
GAAGATGTCT ACAGAGTGTC CCAGCTTCAG CAGGATGACT TGAAGAACTC GATCAAGTCG
TTGCAAAGTG ATGTCTTAAA TTTATCTGAA CTCAGTGCCA CAAGTGCTGG TTCTTTTATG
AGACACAAGT ACGAGAGAGA CATCCAACAA CAAGAAATGA AGTTACACAG CTTATCCATC
AATGAAGAAG AGGAAAAAAG ATCACCTAAG CAAAACGGGA AAGATGTTAT GGACTCCGTT
TTCCAGGAAA AGAAATCGAG GCATCCTAAT ATTTTCAAGA TGTTGTTGCT GAAGAAGAGC
GACAACGAGA TCAATGTTAT TGAAAGAAAG CTCCAAAAGG ACGAAGAAAT TGACTGGACC
ACTTTCAAAA TGGATCTTAA TCGTATGAAC TCGTTGACTT CTCATGACAA ACAAGCCAGA
ACCAGAAGAG TGGTCAGAGA AGAGGGCTCG TTGA
 
Protein sequence
MSVPQPPFKV KTKVSWAGEE DGDLGFLENE IIEVYSIVDE SWWSGKLRRN RAEGIFPKDY 
VEVIPELNKS VSSHSIGTPK AQTPVQTPIQ TPVKEADYGK FKHSRASGTP TGMYNSYNGS
KSVSPKRYHH VNVDADGSFE VEDAVNCTYD GIYDSSNGMS SSNRSSPKKI RSANKLVSST
QHPRYRNSHN GYQQQQHEDF DREREMENFR TLQQQQNYHI KKSPSHNMKS SVQTQQQEFR
QSSYSPENSN SIYGKHQGKS NSRSGSQQNL SQSAVQGHPP QVKSQSYVDI PYRNQQNNTV
ESFVSEGSPS RQARRARYDA DAVNEYEIIS QKRAQLELEL SQLKQLERST QKKRVHNPHL
QHKSGQVSRG SNDDSLVNSL ESSYISEDLL SSKKNYSSRE DLGKKLAKFE SVDDEDDDYF
NDNESSPPPP PPKHTTPIKA IEAIRDNDSP LRKSGNKVPF DADDFRFSGS NRLHQGQVSE
EDVYRVSQLQ QDDLKNSIKS LQSDVLNLSE LSATSAGSFM RHKYERDIQQ QEMKLHSLSI
NEEEEKRSPK QNGKDVMDSV FQEKKSRHPN IFKMLLSKKS DNEINVIERK LQKDEEIDWT
TFKMDLNQPE EWSEKRAR