Gene PICST_81386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_81386 
Symbol 
ID4836899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2442529 
End bp2444190 
Gene Length1662 bp 
Protein Length553 aa 
Translation table12 
GC content46% 
IMG OID640388214 
Productpredicted protein 
Protein accessionXP_001382696 
Protein GI150864020 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02340] T-complex protein 1, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.672282 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTCCA CGGCTCGTTC AGATACATTA TTCCTCGGAG CCCAGAAAAT CTCGGGCGAC 
GACGTCCGTA ATCAGACAGT ATTGGCTACC CAGGCTGTGG CCAATGTCGT CAAGTCTTCA
TTAGGACCTG TAGGCTTGGA CAAGATGCTC GTCGATGACA TTGGTGACGT GACTGTGACA
AACGATGGTG CAACTATTTT GTCTTTACTA GACGTGCAGC ATCCCGCCGG CAAGATCTTG
GTGGAATTGG CCCAACAACA GGATCGTGAA GTTGGAGACG GTACCACCTC CGTAGTGATT
ATTGCCAGTG AGCTCTTGAA ACGAGCCCAT GAGTTGGTCA AGAACAAAAT ACACCCCACC
ACCATCATCT CGGGCTACCG TGTGGCGTTG AGGGAAGCTA TCCGTTACAT CAATGAGGTA
TTGTCGCAGC TGGTTGACAG CTTGGGCAAA GACACCATAG TAAATATTGC AAAGACTTCG
ATGTCGTCCA AGATCATTGG CTCTGACTCG GATTTCTTCT CCCAAATGGT TGTGGATGCC
ATGTTGGCGG TTAAGACGAC GAATGGCAAG GGTGAAACAA AATACCCAGT CAAGGCCGTT
AACATCTTGA AGGCACACGG CAAGTCATCG ACCGAATCGA TGCTCGTAGA TGGTTATGCC
TTGAACTGTA CTGTAGCTTC TCAGGCCATG GTCAAGTCTG TAAAAAATGC AAGAATCGCT
TGTTTGGACA TAAATTTGCA GAAGGCTAGA ATGGCTATGG GTGTTCAGAT AAATATCGAC
GATCCAGACC AGTTAGAGGA GATCAGAAAG AGAGAGTACG GCATAATCAT TGAGAGAATT
CGTAAAATCT TGGGTGCTGG AGCCAACGTC ATTTTGACCA CTAAGGGTAT CGATGACTTG
TGCTTGAAGG AATTCGTGGA AGCTGGCGCC ATGGCAGTCA GACGTTGTAA GAAGGAAGAC
TTACGTAGAA TAGCCAGAGC CACAGGTGCT ACATTAGTAA GCAGTTTGTC CAACTTAGAA
GGTGAAGAAA CCTTCGATGC TTCGTCATTG GGTTCTGCTG AAGAAGTTGT ACAGACCAGA
ATCAGCGATG ACGAATGTAT TTTGGTCAAG GGAACGAAGC AGCACTCTTC CTCTTCTATC
ATATTGAGAG GTCCTAACGA TTACTCTTTA GACGAAATGG AAAGATCCTT GCATGACTCT
TTGTCGGTTG TCAAAAGAAC CTTGGAAAGT GGCAATATAG TGCCTGGTGG AGGTGCTGTT
GAAACTGCTT TGAACATCTA CTTGGAAAAC TTCGCTACTA CAGTTGGCTC GAGAGAACAG
TTGGCCATTG CTGAGTTCGC TAATGCCTTG TTGGTAATCC CAAAGACTTT AGCTGTCAAT
GCAGCTAAGG ATGCTTCAGA TTTGGTTTCC AAGTTAAGAA CCTACCATGC TGCTTCGCAG
ACTGCTTTGC CAACTGACAA GAAGAGAAAG TACAAGAACT ACGGCTTAGA CTTGATCGAA
GGTAAGATTG TCAACGAAAT CTCCCACGGT GTCTTGGAGC CTACGATTTC TAAGGTCAAG
TCCTTGAAGT CGGCCTTAGA AGCATGTGTT GCCATCTTAA GAATTGATAC CATGATTGAG
GTCAACCCAG AAGCACCAAA AGAAGACCCT CACGACCATT AG
 
Protein sequence
MYSTARSDTL FLGAQKISGD DVRNQTVLAT QAVANVVKSS LGPVGLDKML VDDIGDVTVT 
NDGATILSLL DVQHPAGKIL VELAQQQDRE VGDGTTSVVI IASELLKRAH ELVKNKIHPT
TIISGYRVAL REAIRYINEV LSQSVDSLGK DTIVNIAKTS MSSKIIGSDS DFFSQMVVDA
MLAVKTTNGK GETKYPVKAV NILKAHGKSS TESMLVDGYA LNCTVASQAM VKSVKNARIA
CLDINLQKAR MAMGVQINID DPDQLEEIRK REYGIIIERI RKILGAGANV ILTTKGIDDL
CLKEFVEAGA MAVRRCKKED LRRIARATGA TLVSSLSNLE GEETFDASSL GSAEEVVQTR
ISDDECILVK GTKQHSSSSI ILRGPNDYSL DEMERSLHDS LSVVKRTLES GNIVPGGGAV
ETALNIYLEN FATTVGSREQ LAIAEFANAL LVIPKTLAVN AAKDASDLVS KLRTYHAASQ
TALPTDKKRK YKNYGLDLIE GKIVNEISHG VLEPTISKVK SLKSALEACV AILRIDTMIE
VNPEAPKEDP HDH