Gene PICST_33355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33355 
Symbol 
ID4840527 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp399988 
End bp402255 
Gene Length2268 bp 
Protein Length735 aa 
Translation table12 
GC content40% 
IMG OID640391842 
Productpredicted protein 
Protein accessionXP_001386094 
Protein GI150866474 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0714809 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTCC TAGCGGAGTT GCTCAATGCA TTTGTTAGAG ATGTTGATAT CAGCATAGAT 
TTCGATATGG ATCTAGAGGA ATTCTCCAAT TGGATCATTA ATGCTACCAA CTCCGGTTTG
CCCAATGACG CATCACTTAG AAAGACAGCT TACGATCTCA ACGCCGAATG CAACAGTCAG
GCATTCTGTC TCCTGTTTTC TGAAAACGAC TCTGCTTCTC CAGAAACCAT CGAGTTGAAA
TTCATACAGG CCAAGACCTT ACAGATCGCG AAATACTTCA ACGATATCCA TGCATTCTCA
GGACTTTTTA AAGAAGTCCA TAGTTATACA ACATTTTTAC CTTGGTTCAA AGGTCTTGTA
GAACTGTACC GTTACTACTG GGTCAACTAT GGCTCTCTCA ATGCTGGCTC TGTAAATTTT
GACCAGTTTC TTCGTTTAGA CTCTTACTCA GACCAGTTCG ACATCATGAT TGAACCGTTG
AATAAAGACG CCTACAGCGA CAAGTTATCT GTTTCCAATT GGTTCAAGAA CGTTATTTTA
CCATTAATTA ACTATAACAA CAACAATTTC GGCCCATTGT TGAACTGGCT TTTTTTGGAA
GAAACTGTTG TACTTTCGCA GCCAGCTTCT AGAAAGTACG ACATTTGGAA CAAGGCTCTT
AAGGCAATTA TAGAGTATCC CAGTATAGGC TTCAATGACT TTAAGGATGT AATTCTGTAT
TTCCTAGCAA GCTGCTATTA TTTTGCACTC TACCACGAGA ACTCTGACCG AATACTTACC
AGTGAAGTCA TCAAGAAATA CGATTTGATC AAAGATACAT TGGCCTTGTT ATGGACAGAA
AAGGAAGCGC CATCATTTGA CATACAGGTG TCTGAGCTTC CTGCCTATGC TTCTTTTATT
GAATTCCTAA AGACAGAGAG CAATCCCTTG AGACCACTAT TTGAACCTAC AGGCAGTTCT
ATCGTAGCAT TAAGTGAAAT CATCGATACT TGCCAATCGT TGTATCCTAT AAACAAGCTT
TCAATTGCAA GATACTTGGA ATTGAAGAAT CCAAGTTCAG CCCACGACGA CAAGGGCAAA
GAAGTGAGGA AAATCATTGC AAATGTAAAT CCTGGCAACT ACAACAGCTT GTTGAGCTCT
GCTAAATTGT TTTATGGCAA ATTTGTAGAA GACAATGATG TAGAAAAAAC AAAGGTCAAG
GAAATAGTCC TTGAAAGATT GCTCTTCGCC AACTTGTTTG AAGTTGTTCT GGAATTGTAC
TCGACCCCAG AGTTCAGATT ACCTGTTAAT AATTACTATA CTCTCATTTC AGACAAGTTT
TGGGATTCGT TCAACAATTC TTCCAGTCTT AACGAAAAAA TCGGAAGATT CAAGGATGCC
AATAATTGTG TTAATCTCTT CGACTCAATT TCTGCAGATC CGGAATTGGA GCCAGAGAAT
AAGGATCAGA TCATTCGCAT CAAGCATTTA TTGAAGGCTA TATTCAACAT CAAGAACTTC
AAACTTTCTT TGGAAAAGGG GAAACCTTTC ACACCATTTC AATTAGTTGA TAAATTTAGA
AACGTGGGCC AGTTTGCAGT GGGAGAAAAG TCCACGCCAC TTGACTTGAT AACAATCATT
TTGGAACAGA ACGCCAAGTC ATACTTGGCT TTTGAAAAGC TCTACAAGAT CTTGAACGAT
TTGTTATTGT TCTTTGAAGA TGGAACACAT GAGTCTGACA CTCACTACTT CAACAAGTTG
AAGTCCGCAT GCATTGAGTC GTCATTGGTT GCTAACGATT TCCAGTTTGC CTATGCACAG
AGTATGGGTT TATTCGATCA CTATGTGAAG AGCGACAGCA ACTTGAACGA TATATGGTTG
ACATTCTACC AGGTTGGAAA GTATGTGTCG CCGTTGTGGT TTGACGATGA CTCTTATCAA
GAGGAAAGGA TCCAGATTCT CTGCAAGCAA CGAGAAATCT TGCTGAGAAC CATACAGATT
ATTCAGCCTA ACAGCTTGAC GTCAGACAAC AGCAAGGTGA TCTTGAGCCA ATGGGAAAGA
GTCAATAGCC AGATCGAAGA GCATTACACT AGTGACTCAG TATCTAAGGA ACTCCAATTC
CTGGAATATA GTGCTCATTC GTTCTCTGAA GTGACAGACA ACCTTGGATA TATGGCTAAC
GAATTGATCA GCGATGCTAC GGCTACGACC AATAAAACCA GCGAAAAGTT GTCCAATCTC
TTTGTTTCCG GATTGGGCTG GGCAATCGGA GCCAATCAAC AGCAGTAG
 
Protein sequence
MGFLAELLNA FVRDVDISID FDMDLEEFSN WIINATNSGL PNDASLRKTA YDLNAECNSQ 
AFCLSFSEND SASPETIELK FIQAKTLQIA KYFNDIHAFS GLFKEVHSYT TFLPWFKGLV
ESYRYYWVNY GSLNAGSVNF DQFLRLDSYS DQFDIMIEPL NKDAYSDKLS VSNWFKNVIL
PLINYNNNNF GPLLNWLFLE ETVVLSQPAS RKYDIWNKAL KAIIEYPSIG FNDFKDVISY
FLASCYYFAL YHENSDRILT SEVIKKYDLI KDTLALLWTE KEAPSFDIQV SELPAYASFI
EFLKTESNPL RPLFEPTGSS IVALSEIIDT CQSLYPINKL SIARYLELKN PSSAHDDKGK
EVRKIIANVN PGNYNSLLSS AKLFYGKFVE DNDVEKTKVK EIVLERLLFA NLFEVVSELY
STPEFRLPVN NYYTLISDKF WDSFNNSSSL NEKIGRFKDA NNCVNLFDSI SADPELEPEN
KDQIIRIKHL LKAIFNIKNF KLSLEKGKPF TPFQLVDKFR NVGQFAVGEK STPLDLITII
LEQNAKSYLA FEKLYKILND LLLFFEDGTH ESDTHYFNKL KSACIESSLV ANDFQFAYAQ
SMGLFDHYVK SDSNLNDIWL TFYQVGKYVS PLWFDDDSYQ EERIQILCKQ REILSRTIQI
IQPNSLTSDN SKVILSQWER VNSQIEEHYT MTDNLGYMAN ELISDATATT NKTSEKLSNL
FVSGLGWAIG ANQQQ