Gene PICST_60303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_60303 
Symbol 
ID4839082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp695996 
End bp697297 
Gene Length1302 bp 
Protein Length433 aa 
Translation table12 
GC content45% 
IMG OID640390397 
Productpredicted protein 
Protein accessionXP_001384798 
Protein GI150865539 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGCT CTGCCGATTT CTTATCGAAG GGAATAGACT TAGTCCAGAA GGCTATCGAT 
GCCGACACTG CCACCCGCTA TGAGGAAGCT TACAAACTCT ACTACAACGG CTTGGAGTAC
TTGATGTTGG CTATCAAATA CGAGAAGAAT CAGAAGTCCA AGGAACTCGT CAAGTCCAAG
TTCACTGAGT ACTTGACTAG AGCTGAACAG TTGAAAGATC ACTTGGAAAA GCAACTGAAC
AAGTCAAACT CTGCCGAAAG CTCATCAACG AACGGATCTA CAAAGGCAAA GAAGAGTGGC
GACGGTGACG ATGACGATGC CGATACTAAG AAGTTGAGAG GAGCTTTAGC TGGTGCCATT
TTGTCAGAGA AGCCCAATGT CAAATGGGAG GATATTGCTG GATTGGACGC AGCCAAGGAG
GCGTTGAAGG AAGCCGTGAT TTTACCGGTC AAGTTCCCCC AATTATTCGT CGGGAACAGA
AAGCCTACGT CCGGTATCTT GTTGTTTGGG CCTCCAGGTA CGGGTAAGTC ATATTTGGCC
AAGGCTGTGG CCACCGAAGC CAACTCTACT TTCTTCTCAG TTTCATCGTC TGATTTGGTA
TCCAAATGGA TGGGTGAATC CGAAAGATTA GTCAAGCAGT TGTTTACAAT GGCCAGAGAA
AACAAGCCGG CCATTATCTT CATCGATGAA GTTGATGCTT TGTGTGGTCC CAGGGGAGAA
GGAGAAAGTG AAGCGCTGAG GAGAATAAAG ACAGAACTAT TGGTTCAGAT GAACGGGGTT
GGAAACGATT CTAGTGGTGT GTTAGTCTTG GGAGCAACCA ATATTCCATG GCAATTGGAC
GCCGCCATCA GAAGAAGATT CGAAAGAAGA ATCTATATTG CTTTGCCAGA AGTAGAGGCC
AGGACTAGGA TGTTTGAAAT CAATATCGGT GGTGTTCCTT GTGAATGTAC TCCTCAGGAC
TACAAGGCCT TGGCCGAGAT GACTGATGGA TACTCTGGAC ACGATGTGGC CGTTGTAGTA
AGAGACGCAT TAATGCAGCC TATTAGAAAA ATCCAGCAAG CAACCCACTT CAAGCTGGTA
TTAGATGACG ACGGGAATGA AAAGTTGACT CCTTGTTCTC CAGGAGATGA TGGCGCAAGA
GAAATGAACT GGATGGATAT TGGAACAGAC GAATTAAAGG AACCTCCATT GACAATTAAA
GACTTCATCA AATCCATAAA GAGTAATAGA CCTACTGTCA ATGAAGCCGA TATTCAAAAC
CACATTAAAT TCACCGAAGA TTTTGGTCAA GAAGGAAACT GA
 
Protein sequence
MSGSADFLSK GIDLVQKAID ADTATRYEEA YKLYYNGLEY LMLAIKYEKN QKSKELVKSK 
FTEYLTRAEQ LKDHLEKQSN KSNSAESSST NGSTKAKKSG DGDDDDADTK KLRGALAGAI
LSEKPNVKWE DIAGLDAAKE ALKEAVILPV KFPQLFVGNR KPTSGILLFG PPGTGKSYLA
KAVATEANST FFSVSSSDLV SKWMGESERL VKQLFTMARE NKPAIIFIDE VDALCGPRGE
GESEASRRIK TELLVQMNGV GNDSSGVLVL GATNIPWQLD AAIRRRFERR IYIALPEVEA
RTRMFEINIG GVPCECTPQD YKALAEMTDG YSGHDVAVVV RDALMQPIRK IQQATHFKSV
LDDDGNEKLT PCSPGDDGAR EMNWMDIGTD ELKEPPLTIK DFIKSIKSNR PTVNEADIQN
HIKFTEDFGQ EGN