Gene PICST_34117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_34117 
Symbol 
ID4850971 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp592605 
End bp594725 
Gene Length2121 bp 
Protein Length516 aa 
Translation table 
GC content42% 
IMG OID640392679 
Productpredicted protein 
Protein accessionXP_001387344 
Protein GI126273932 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00012281 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.103056 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGTAA ATAATATTGA ACTCATCAGG AATTGCGCTT CCTGCAGGAG GCTACATCTG 
CTTTGGTCTT TCCTTCTTAA CTATGCTATT TCAGGTTGAT GATGAAATTA CAAACAAAGG
CTAAACACGG TTTGTGGAAT TCAGTCGTTA AAGCGAAATA TCCGAGTCTA ACAGAAGCAC
TGGCAGAATT TTTGACTTTG TATTTTCCCT TTTTTACCAA TTGCTACTTC CTTCTATCTG
CGTCTGTCTC TGTAGCGCTA ATAACCATTC TTCACTGTTT GTGGTCTTCA ATCCAATAGC
CACAATATTC AAGAGTCAGA AGATAGTCAA GAAGAATCGA GTTGTTTTTT CTTGAATTTC
ATAAGATTTC ACAAGAAGGC CATTTCAAAG AATTTTAATA CTACTTTTCC TCAATATAAA
GCTATCATCA CTAGCCTAGA GTTGAAGTTA GACAGAGAAG AGTATAGAAT ACTCAACACT
ACAATTGTCA ACTTATAATT CAGTATGAGC TACAACAATA AGAAGTTCGA TCCTCCTTCC
TACCCTCCTC CGTCAGCAAA CAACTACGAC CCTCAGAACC AACAGGGTTA TGGAGGGTAC
CAACAGCCTC AATACTCGCA GGAGCCTGAA GAGGAATTGT ATGATCAAAC CGACTTGGAA
AATGGCCAAT TCCAGAATTA CAGCGAGAAG CCAGTGAGTT CCGAAAACTT TGAGGAAAGT
TTTAAGATAG AAAAGCCAAA ATGGAATGAC TGGCCATTCA CACTTTTCTT CCTCGCTGTC
GTTGTTGGAT TCGTAGCTGT AGCAGTAATT ACTATCAATG CCTTGAGAGC TAGGTTCGGT
TTTGAAGGAA CAGGCATCTA TGGCTCGTCC AATACCTTCA GTTTGAATAC GAATACGATC
GTCTTGTTCG CTTTTGTCAT TGTCGTGGGG TTGGTATTGT CTACTTTGAT TATGGTATAC
GCTCGTATGG CTCCCCGGAT TTTTATCACT ACTGGTTTGA TCTTGAATGT GGTATTGGGC
ATCGGTACAG CCATTTATTA TTTCGTAGTT CACTATTACT CTGCTGCCAT TGTGTTCTTG
GTGTTCTCGC TCTTCTCGGC CTACTGCTAC TGGAGTTGCC GTAGCAGGAT CCCGTTCAGT
GCGACCGTTC TTGAGATTAC CATTGATGTC ATGAAAAGAT ACCCCTCAAC CTTGGTAGTT
TCCCTCATTG GTATAATTGC TTCGGCTGCT TTCAGTGCTT TGTTTAGTAT TGTAATTGTC
GCTACTTATG TTAAGTTTGA TCCTAACCCA AACAACGAAG GTTGTTCTGT TGGTGGAGGC
AACTGTTCGC AAGCCAAGTT GGTAGGTGTT CTTGTGTTTG TGTTCTTTGC TGGATTCTAC
ATATCTGAAG TTTTCAGAAA TGTCATCCAT GTTGTCATAG CTGGTATTTA TGGTACCTGG
TATTACTTGG CTGGATCCGA TCAGGGTGCC CCAAGAGTTC CAGCTTTAGG TGCTTTGAAG
AGAGCCTTGA CTTACTGTTT TGGTTCTATT TGTTTCGGTT CTTTGATCGT AGCTTTTATC
CAACTTCTCA GGGCATTTAT CCAAGCTCTT AGACAGAATG CCTTAGCTGG TGGTGACAAC
TGCGCCTTCT GTGCTCTCTG TATTCTTGAT TTGATCGTTG GTTTCATCGA CTGGATGGTC
CGTTACTTCA ACCATTATGC TTACTGCTAC GTTGCTTTAT ACGGAAAGAG TTATCTCAGA
TCAGCAAAGG ACACCTTCGA CTTGCTCCGT TATAAAGGTA TGGATGCTTT GATTAATGAC
TGTTTCATTA ATACTGCATT GAATTTCTAT GCCTTGTTTG TTGCCTTCGT CACTGCTCTC
TTGTCTTTCC TCTACTTGAG ATTCACTGAA CCAGATTACA ATGCTGACGG TAACTTCTAT
GCGCCAGTTA TGGCGTTTGC CTTCTTAATC TCTGGACAGA TCACCCGTGT TGCTACTTCA
GTCATCGAGT CTGGTATTTC TACATTCTTC GTCGCCTTGG CTAAGGACCC AGAAGTGTTC
CAGATGACTA ACAGGAACAG ATTCGACGAG ATCTTCAGAA ACTACCCCCA GGTATTACAG
AAGATCACCA GTGACCATTA G
 
Protein sequence
MQKFDPPSYP PPPQYSQEPE EELYDQTDLE NGQFQNYSEK PVSSENFEES FKIEKPKWND 
WPFTLFFLAV VVGFVAVAVI TINALRARFG FEGTGIYGSS NTFSLNTNTI VLFAFVIVVG
LVLSTLIMVY ARMAPRIFIT TGLILNVVLG IGTAIYYFVV HYYSAAIVFL VFSLFSAYCY
WSCRSRIPFS ATVLEITIDV MKRYPSTLVV SLIGIIASAA FSALFSIVIV ATYVKFDPNP
NNEGCSVGGG NCSQAKLVGV LVFVFFAGFY ISEVFRNVIH VVIAGIYGTW YYLAGSDQGA
PRVPALGALK RALTYCFGSI CFGSLIVAFI QLLRAFIQAL RQNALAGGDN CAFCALCILD
LIVGFIDWMV RYFNHYAYCY VALYGKSYLR SAKDTFDLLR YKGMDALIND CFINTALNFY
ALFVAFVTAL LSFLYLRFTE PDYNADGNFY APVMAFAFLI SGQITRVATS VIESGISTFF
VALAKDPEVF QMTNRNRFDE IFRNYPQVLQ KITSDH