Gene PICST_82202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_82202 
Symbol 
ID4837417 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1719235 
End bp1722381 
Gene Length3147 bp 
Protein Length997 aa 
Translation table12 
GC content41% 
IMG OID640388732 
Productpredicted protein 
Protein accessionXP_001382552 
Protein GI126132054 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0532] Translation initiation factor 2 (IF-2; GTPase) 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00491] translation initiation factor aIF-2/yIF-2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.236958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGA AATCTAAAAA AGGAGCTAAA TCTGGAGGAG ATTTCTGGGA CGATGAAGAA 
CTCGACCAGG AAAACACCGG AGCTGAACAC CTTGGTGAAG AACCAACCGA AAATGGCGAT
GCTCCACAAG AGACTTCAGC ACAAGAAGAA CCAGAAGCTG CATCTGCCGA TGACATCGCT
GGTGATTTCT TAAGTTCTAT CCGTAAATCC AAGCAAAAAA AGGACCAAAA GGAAGAGGAA
GACAAGAAGA CCAAAGATGG TCCTAAGATC TTATCCAAGA AGGAAAAGGA AAGATTGAAG
AAGGAAGCTG AAAAACAGTT GAAGAAGGAA CAGGCAGCAA AGAAGAAAGC GCAACAAGCT
ACTAAGAAAG AACAAATCAA GGAAGCTAAC AAGCAAAATG CAGCCGCTGC TAATGCATCC
GCTTCTGCAA CTCCTGAACC AGAAGCTGAA GAAGTCGAAG CCAAGAAGAC CCCAGCTAAG
AAAGGAGGTA AGAAGGCCCC GGCTGGTCTT GCTGCTTTGA GAAAGCGATT AGAGTTGAAA
AAGCAACTTG AAGAGGAGCA GCAGAGATTA GAGGAAGAAG AGGAAGCCAA GAGATTGGAA
GAAGAGAGAT TGGCAGCTGA AGAAGAAGAA AGAAAAGAAG CTGTTAGAGC CGCTAAGAAA
GAAAAGGACA GATTAAAGAA AGAACAATTG AAAGCAGAAG GTAAGTTGTT AACTAAGAAA
CAAAAGGAAG AGAAGAAGTT ACAAGAGCGT CGTCGTCAGC AATTGTTACA AGGTAATGTT
ACTGTTGCTG GTTTACAACA ATCTCCTGAC GGACCTAAAC CAAAGAAGGT TGTCTACACT
AAGAAGAAGT CTACCAAAGC TAAAACTTTC ATTCAAAAGC CTGCTTCTAC TGCCTCTGAA
TCAAAGAAGG TCAATGAGGA GGAAGACGAA GTTCCTGTCG ATGACTGGGA AAAAATGGCT
CTCGATGACG ATGAACCAGT GGCCGATGAT TGGGAAGCTG CTTTGAATGA AGATGTTGAA
GATAAAGCTG ACGTTGAAGA GGATAATGAT GCTGAAGCTG AAGAAGAAGA AGCGGAACGT
AAAGCTAGGG AGGAAGCCGA ACGTAAATCT CAAGAAGAAG CTGCTAGAAA GAAGAAAGAA
GAAGAGGCAA GAGCTGCTGC AGCTGCTGCC GCCGCCGCCG AACAAGCTAA GTTGGCTGCC
CAAAAGACTA TTTCTCCGGA AAAGGATTTG CGTTCTCCTA TTTGTTGTAT TTTGGGTCAC
GTCGATACTG GAAAGACTAA ATTATTGGAC AAAGTTCGTC AAACTAATGT TCAAGGAGGT
GAAGCTGGTG GTATAACCCA ACAAATCGGT GCTACTTACT TTCCAATAGA TGCTCTTAAG
CACAAGACTT CTGTCATGGC TCAATACGAA AAGCAGACTT TCGATGTTCC TGGTTTGTTG
ATTATTGATA CACCGGGACA TGAGTCCTTC ACAAACTTGA GATCTCGTGG TTCTTCCTTA
TGTAATATTG CGATTTTGGT CATTGACATC ATGCATGGTT TGGAACAACA AACTCTTGAA
TCTATAAGAT TATTGAGGGA CAGAAAAGCA CCATTCGTAG TTGCTTTGAA CAAAATTGAT
AGATTGTATG ACTGGAAAGA GATTCCAAAC AACTCGTTTA GAGATTCCTT CGCTAAGCAA
TCGAAGGCGG TTCAAGCAGA ATTTCATAAC AGATACGAAC AGATCAGGCT TGCCTTATCT
GAACAAGGTT TGAACTCAGA ATTGTATTTC CAAAACAAGA ATATTTCAAA GTACGTTTCC
ATTGTTCCAA CTTCAGCTGT CACTGGAGAA GGTGTTCCAG ATTTGTTGTG GTTGTTGCTT
GAGTTGACTC AGAAGAGAAT GTCTAAGCAA TTGATGTACT TAAGTAAGGT TGAAGCTACA
ATTTTGGAAG TCAAGGTTGT TGAAGGTTTC GGTTACACTA TTGATGTTGT GTTGTCAAAC
GGTATTTTGA GGGAGGGTGA TAGAGTTGTA TTGTGTGGTT TGAACGGGCC AATAGCGACA
AATATCAGAG CATTATTAAC TCCTCCACCT GCTCGTGAAT TGCGTGTCAA ATCTGAATAT
GTTCACCACA AGGAGGTCAA GGCTGCTTTG GGTGTCAAGA TTGCTGCTAA TGATTTGGAA
AAGGCTGTTG CTGGTTCCAG ATTGATCGTT GTCGGTGAAG ACGACGATGA AGATGAAATT
ATGGAAGAAG TTATGGATGA CTTAACAGGT TTGTTGGACT CGGTTGATAC ATCTGGTAAG
GGTGTGGTAG TTCAAGCTTC TACATTGGGT TCCTTGGAAG CCTTGTTGGA TTTCCTTAAG
GATATGAAGA TTCCAGTTAT GTCTATTGGT TTGGGTCCAG TCTACAAGAG AGATGTAATG
AAAGCTACAA CCATGCTAGA GAAAGCTCCA GAACTAGCAG TTATGTTGTG TTTCGATGTC
AAGGTTGATA AGGAAGCTGA ACAATATGCT GACGAACAAA ACATTAAGAT TTTCAATGCT
GATATTATCT ACCATTTGTT CGATGCATTT ACTGCTTACC AGGAAAAGCT TCTCGAAATC
CGTCGAAAGG ATTTCATGGA ATATGCTGTC TTGCCATGTG TCTTGAAGAC AATTCAAATT
ATCAACAAGC GTAACCCAAT GATCATTGGT GTTGACGTTG TTGAGGGCGC CGTTCGTATC
GGTACTCCAA TATGTGCTGT TCGTCAAGAT CCTGTTACAA AGCAGCCTAA CATCATGGTT
TTGGGCAAGG TAGTTTCTTT GGAAGTTAAC CACAAATCTC ACGACATTAT TAAGAAGGGC
CAAACTTCTG CTGGTGTTGC CATGAGATTG GACAATCCAT CATCTGCCCA ACCAACCTGG
GGAAGACACG TTGATGAGAC TGATAACTTG TACTCATTAA TCACTCGTAG GTCAATTGAT
ACCTTGAAGG ATCCAGCTTT CCGTGACACT GTCTCCAGAG ATGACTGGCT CTTGATCAAA
AAGTTGAAGC CAGTGTTCGA CATTAAATAA AATCCTAGTT GGTTACTTTA TCTCTTTTCT
CATGTCCACA TTTCATTTTC TACATATAGG CATGCATAGT TACAAAAAAT AATTTATACG
ATAGTAACCT AATTGTATTT TTACTAC
 
Protein sequence
MAKKSKKGAK SGGDFWDDEE LDQENTGAEH LETSAQEEPE AASADDIAGD FLSSIRKSKQ 
KKDQKEEEDK KTKDGPKILS KKEKERLKKE AEKQLKKEQA AKKKAQQATK KEQIKEANKQ
NAAAANASAS ATPEPEAEEV EAKKTPAKKG GKKAPAGLAA LRKRLELKKQ LEEEQQRLEE
EEEAKRLEEE RLAAEEEERK EAVRAAKKEK DRLKKEQLKA EGKLLTKKQK EEKKLQERRR
QQLLQGNVTV AGLQQSPDGP KPKKVVYTKK KSTKAKTFIQ KPASTASESK KVNEEEDEVP
VDDWEKMALD DDEPVADDWE AALNEDVEDK ADVEEDNDAE AEEEEAERKA REEAERKSQE
EAARKKKEEE ARAAAAAAAA AEQAKLAAQK TISPEKDLRS PICCILGHVD TGKTKLLDKV
RQTNVQGGEA GGITQQIGAT YFPIDALKHK TSVMAQYEKQ TFDVPGLLII DTPGHESFTN
LRSRGSSLCN IAILVIDIMH GLEQQTLESI RLLRDRKAPF VVALNKIDRL YDWKEIPNNS
FRDSFAKQSK AVQAEFHNRY EQIRLALSEQ GLNSELYFQN KNISKYVSIV PTSAVTGEGV
PDLLWLLLEL TQKRMSKQLM YLSKVEATIL EVKVVEGFGY TIDVVLSNGI LREGDRVVLC
GLNGPIATNI RALLTPPPAR ELRVKSEYVH HKEVKAALGV KIAANDLEKA VAGSRLIVVG
EDDDEDEIME EVMDDLTGLL DSVDTSGKGV VVQASTLGSL EALLDFLKDM KIPVMSIGLG
PVYKRDVMKA TTMLEKAPEL AVMLCFDVKV DKEAEQYADE QNIKIFNADI IYHLFDAFTA
YQEKLLEIRR KDFMEYAVLP CVLKTIQIIN KRNPMIIGVD VVEGAVRIGT PICAVRQDPV
TKQPNIMVLG KVVSLEVNHK SHDIIKKGQT SAGVAMRLDN PSSAQPTWGR HVDETDNLYS
LITRRSIDTL KDPAFRDTVS RDDWLLIKKL KPVFDIK