Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_82202 |
Symbol | |
ID | 4837417 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 1719235 |
End bp | 1722381 |
Gene Length | 3147 bp |
Protein Length | 997 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640388732 |
Product | predicted protein |
Protein accession | XP_001382552 |
Protein GI | 126132054 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0532] Translation initiation factor 2 (IF-2; GTPase) |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR00491] translation initiation factor aIF-2/yIF-2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.236958 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAAGA AATCTAAAAA AGGAGCTAAA TCTGGAGGAG ATTTCTGGGA CGATGAAGAA CTCGACCAGG AAAACACCGG AGCTGAACAC CTTGGTGAAG AACCAACCGA AAATGGCGAT GCTCCACAAG AGACTTCAGC ACAAGAAGAA CCAGAAGCTG CATCTGCCGA TGACATCGCT GGTGATTTCT TAAGTTCTAT CCGTAAATCC AAGCAAAAAA AGGACCAAAA GGAAGAGGAA GACAAGAAGA CCAAAGATGG TCCTAAGATC TTATCCAAGA AGGAAAAGGA AAGATTGAAG AAGGAAGCTG AAAAACAGTT GAAGAAGGAA CAGGCAGCAA AGAAGAAAGC GCAACAAGCT ACTAAGAAAG AACAAATCAA GGAAGCTAAC AAGCAAAATG CAGCCGCTGC TAATGCATCC GCTTCTGCAA CTCCTGAACC AGAAGCTGAA GAAGTCGAAG CCAAGAAGAC CCCAGCTAAG AAAGGAGGTA AGAAGGCCCC GGCTGGTCTT GCTGCTTTGA GAAAGCGATT AGAGTTGAAA AAGCAACTTG AAGAGGAGCA GCAGAGATTA GAGGAAGAAG AGGAAGCCAA GAGATTGGAA GAAGAGAGAT TGGCAGCTGA AGAAGAAGAA AGAAAAGAAG CTGTTAGAGC CGCTAAGAAA GAAAAGGACA GATTAAAGAA AGAACAATTG AAAGCAGAAG GTAAGTTGTT AACTAAGAAA CAAAAGGAAG AGAAGAAGTT ACAAGAGCGT CGTCGTCAGC AATTGTTACA AGGTAATGTT ACTGTTGCTG GTTTACAACA ATCTCCTGAC GGACCTAAAC CAAAGAAGGT TGTCTACACT AAGAAGAAGT CTACCAAAGC TAAAACTTTC ATTCAAAAGC CTGCTTCTAC TGCCTCTGAA TCAAAGAAGG TCAATGAGGA GGAAGACGAA GTTCCTGTCG ATGACTGGGA AAAAATGGCT CTCGATGACG ATGAACCAGT GGCCGATGAT TGGGAAGCTG CTTTGAATGA AGATGTTGAA GATAAAGCTG ACGTTGAAGA GGATAATGAT GCTGAAGCTG AAGAAGAAGA AGCGGAACGT AAAGCTAGGG AGGAAGCCGA ACGTAAATCT CAAGAAGAAG CTGCTAGAAA GAAGAAAGAA GAAGAGGCAA GAGCTGCTGC AGCTGCTGCC GCCGCCGCCG AACAAGCTAA GTTGGCTGCC CAAAAGACTA TTTCTCCGGA AAAGGATTTG CGTTCTCCTA TTTGTTGTAT TTTGGGTCAC GTCGATACTG GAAAGACTAA ATTATTGGAC AAAGTTCGTC AAACTAATGT TCAAGGAGGT GAAGCTGGTG GTATAACCCA ACAAATCGGT GCTACTTACT TTCCAATAGA TGCTCTTAAG CACAAGACTT CTGTCATGGC TCAATACGAA AAGCAGACTT TCGATGTTCC TGGTTTGTTG ATTATTGATA CACCGGGACA TGAGTCCTTC ACAAACTTGA GATCTCGTGG TTCTTCCTTA TGTAATATTG CGATTTTGGT CATTGACATC ATGCATGGTT TGGAACAACA AACTCTTGAA TCTATAAGAT TATTGAGGGA CAGAAAAGCA CCATTCGTAG TTGCTTTGAA CAAAATTGAT AGATTGTATG ACTGGAAAGA GATTCCAAAC AACTCGTTTA GAGATTCCTT CGCTAAGCAA TCGAAGGCGG TTCAAGCAGA ATTTCATAAC AGATACGAAC AGATCAGGCT TGCCTTATCT GAACAAGGTT TGAACTCAGA ATTGTATTTC CAAAACAAGA ATATTTCAAA GTACGTTTCC ATTGTTCCAA CTTCAGCTGT CACTGGAGAA GGTGTTCCAG ATTTGTTGTG GTTGTTGCTT GAGTTGACTC AGAAGAGAAT GTCTAAGCAA TTGATGTACT TAAGTAAGGT TGAAGCTACA ATTTTGGAAG TCAAGGTTGT TGAAGGTTTC GGTTACACTA TTGATGTTGT GTTGTCAAAC GGTATTTTGA GGGAGGGTGA TAGAGTTGTA TTGTGTGGTT TGAACGGGCC AATAGCGACA AATATCAGAG CATTATTAAC TCCTCCACCT GCTCGTGAAT TGCGTGTCAA ATCTGAATAT GTTCACCACA AGGAGGTCAA GGCTGCTTTG GGTGTCAAGA TTGCTGCTAA TGATTTGGAA AAGGCTGTTG CTGGTTCCAG ATTGATCGTT GTCGGTGAAG ACGACGATGA AGATGAAATT ATGGAAGAAG TTATGGATGA CTTAACAGGT TTGTTGGACT CGGTTGATAC ATCTGGTAAG GGTGTGGTAG TTCAAGCTTC TACATTGGGT TCCTTGGAAG CCTTGTTGGA TTTCCTTAAG GATATGAAGA TTCCAGTTAT GTCTATTGGT TTGGGTCCAG TCTACAAGAG AGATGTAATG AAAGCTACAA CCATGCTAGA GAAAGCTCCA GAACTAGCAG TTATGTTGTG TTTCGATGTC AAGGTTGATA AGGAAGCTGA ACAATATGCT GACGAACAAA ACATTAAGAT TTTCAATGCT GATATTATCT ACCATTTGTT CGATGCATTT ACTGCTTACC AGGAAAAGCT TCTCGAAATC CGTCGAAAGG ATTTCATGGA ATATGCTGTC TTGCCATGTG TCTTGAAGAC AATTCAAATT ATCAACAAGC GTAACCCAAT GATCATTGGT GTTGACGTTG TTGAGGGCGC CGTTCGTATC GGTACTCCAA TATGTGCTGT TCGTCAAGAT CCTGTTACAA AGCAGCCTAA CATCATGGTT TTGGGCAAGG TAGTTTCTTT GGAAGTTAAC CACAAATCTC ACGACATTAT TAAGAAGGGC CAAACTTCTG CTGGTGTTGC CATGAGATTG GACAATCCAT CATCTGCCCA ACCAACCTGG GGAAGACACG TTGATGAGAC TGATAACTTG TACTCATTAA TCACTCGTAG GTCAATTGAT ACCTTGAAGG ATCCAGCTTT CCGTGACACT GTCTCCAGAG ATGACTGGCT CTTGATCAAA AAGTTGAAGC CAGTGTTCGA CATTAAATAA AATCCTAGTT GGTTACTTTA TCTCTTTTCT CATGTCCACA TTTCATTTTC TACATATAGG CATGCATAGT TACAAAAAAT AATTTATACG ATAGTAACCT AATTGTATTT TTACTAC
|
Protein sequence | MAKKSKKGAK SGGDFWDDEE LDQENTGAEH LETSAQEEPE AASADDIAGD FLSSIRKSKQ KKDQKEEEDK KTKDGPKILS KKEKERLKKE AEKQLKKEQA AKKKAQQATK KEQIKEANKQ NAAAANASAS ATPEPEAEEV EAKKTPAKKG GKKAPAGLAA LRKRLELKKQ LEEEQQRLEE EEEAKRLEEE RLAAEEEERK EAVRAAKKEK DRLKKEQLKA EGKLLTKKQK EEKKLQERRR QQLLQGNVTV AGLQQSPDGP KPKKVVYTKK KSTKAKTFIQ KPASTASESK KVNEEEDEVP VDDWEKMALD DDEPVADDWE AALNEDVEDK ADVEEDNDAE AEEEEAERKA REEAERKSQE EAARKKKEEE ARAAAAAAAA AEQAKLAAQK TISPEKDLRS PICCILGHVD TGKTKLLDKV RQTNVQGGEA GGITQQIGAT YFPIDALKHK TSVMAQYEKQ TFDVPGLLII DTPGHESFTN LRSRGSSLCN IAILVIDIMH GLEQQTLESI RLLRDRKAPF VVALNKIDRL YDWKEIPNNS FRDSFAKQSK AVQAEFHNRY EQIRLALSEQ GLNSELYFQN KNISKYVSIV PTSAVTGEGV PDLLWLLLEL TQKRMSKQLM YLSKVEATIL EVKVVEGFGY TIDVVLSNGI LREGDRVVLC GLNGPIATNI RALLTPPPAR ELRVKSEYVH HKEVKAALGV KIAANDLEKA VAGSRLIVVG EDDDEDEIME EVMDDLTGLL DSVDTSGKGV VVQASTLGSL EALLDFLKDM KIPVMSIGLG PVYKRDVMKA TTMLEKAPEL AVMLCFDVKV DKEAEQYADE QNIKIFNADI IYHLFDAFTA YQEKLLEIRR KDFMEYAVLP CVLKTIQIIN KRNPMIIGVD VVEGAVRIGT PICAVRQDPV TKQPNIMVLG KVVSLEVNHK SHDIIKKGQT SAGVAMRLDN PSSAQPTWGR HVDETDNLYS LITRRSIDTL KDPAFRDTVS RDDWLLIKKL KPVFDIK
|
| |