Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66828 |
Symbol | EFT2 |
ID | 4837377 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 496572 |
End bp | 499625 |
Gene Length | 3054 bp |
Protein Length | 842 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640388692 |
Product | Elongation factor |
Protein accession | XP_001382854 |
Protein GI | 126132658 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0480] Translation elongation factors (GTPases) |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR00490] translation elongation factor aEF-2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTATGT ACTACAAGCT CGAAACTGCT GTTGTGCACA TGTAGAATAC GTATTGTCTC GAATTGAACC AGGACGCTTT TTTCTGCCAC TCCCCAATCC GTTGATCTGC CTCTTGACTG AGAGCTGGGG GATGAACTGA GAAAATTTTT TCTCGCCAGG AATGAGACGA AATGTCAGCA CCAGAATACT AGAGGCAATG CCAGATGTAA TTGTCAGAAC TCGTATTGCA AGTATAACAT ATACAGCGAG TACAATGTCT ACAAAGTATA GCATTTACGT CATAAACAAC TTTTGCAACA TTTACAGATT CACAGCATTA CAGCAGCAAT CCTCTACATA ATGTACTCTT CATACAGAAA TACTAACATC TTACAGTTGC TTTCACTATT GAACAAATCC GTGAATTGAT GGACAAGGTT ACGAACGTTC GTAACATGTC CGTCATTGCT CACGTCGATC ACGGTAAGTC TACCTTGACC GATTCCTTGG TCCAAAGAGC TGGTATTATC TCTGCTGCTA AGGCCGGTGA AGCCAGATTC ACTGACACCA GAAAGGATGA ACAAGAAAGA GGTATCACTA TCAAGTCCAC TGCCATTTCT TTGTACGCTG CCATGACCGA TGATGACGTT AAGGAAATCA AGCAAAAGAC CGAAGGTAAC TCTTTCTTGA TCAACTTGAT CGACTCGCCA GGTCACGTTG ACTTCTCCTC TGAAGTCACT GCTGCTTTGC GTGTTACCGA TGGTGCTTTG GTTGTCGTCG ACTGTGTTGA AGGTGTCTGT GTCCAAACCG AAACCGTCTT GAGACAATCT TTGGGTGAAA GAATCAAGCC AGTTGTCATC ATCAACAAGG TTGACAGAGC TTTGTTGGAA TTGCAAGTCA CCAAGGAAGA CTTGTACCAA TCCTTCGCCA GAACTGTTGA ATCCGTTAAC GTTATCATCT CCACCTACGT TGACCCAGCC ATCGGTGACT GTCAAGTCTA CCCAGACAAG GGTACCGTTG CTTTCGGTTC CGGTTTGCAC GGTTGGGCTT TCACCGTCAG ACAATTCGCT TCCAGATACT CCAAGAAGTT TGGTGTCGAC AGACTCAAGA TGATGGAAAG ATTGTGGGGT GACTCTTACT TCAACCCAAA GACCAAGAAG TGGACCAACA AGGACAAGGA TGCCGATGGA AAGCAATTGG AAAGAGCCTT CAACATGTTC GTCTTGGACC CAATCTTTAG ATTGTTTGCT GCCATCATGA ACTTCAAGAA GGACGAAATC CCAACCTTGT TGGAAAAGTT GGAAATCTCC TTGAAGGGTG ACGAAAAGGA ATTGGAAGGT AAGGCTTTGT TGAAGGTTGT CATGAGAAAG TTCTTGCCAG CTGCTGACGC TTTGTTGGAA ATGATCATCA TCCACTTGCC ATCCCCAGTC ACTGCCCAAG CTTACAGAGC TGAAACTTTG TACGAAGGTC CATCTGACGA TGCTTCTTGT ACCGCCATCA GAAACTGTGA CCCTAAGGCT GACTTGATGT TGTACGTCTC CAAGATGGTC CCAACCTCTG ATAAGGGTAG ATTCTACGCT TTCGGTAGAG TTTTCGCTGG TACCGTCAGA TCTGGTCAAA AGGTCAGAAT CCAAGGTCCA AACTACCAGG TCGGTAAGAA GGAAGACTTG TTCCTTAAGT CTATCCAAAG AACCGTCTTG ATGATGGGAA GATTCGTCGA AGCCATCGAT GACTGTCCAG CTGGTAACAT TGTCGGTTTG GTTGGTATTG ACCAGTTCTT GTTGAAGTCT GGTACCATCA CCACCTCCGA CGCTTCCCAC AACATGAAGG TCATGAAGTT CTCTGTCTCT CCAGTTGTGC AAGTTGCCGT TGAAGTCAAG AACGCTAACG ACTTGCCAAA GTTGGTTGAA GGTTTGAAGA GATTGTCCAA GTCCGACCCT TGTGTCTTGT GTACCATCAA CGAATCCGGT GAACACATTG TTGCCGGTAC CGGTGAATTG CACTTGGAAA TCTGTTTGCA AGATTTGGAA AACGACCACG CTGGTGTTCC ATTGAAGATT TCCCCACCTA TTGTTTCCTA CAGAGAAACC GTTGAAGGTG AATCCTCCAT GGTTGCCTTG TCTAAGTCGC CAAACAAGCA TAACAGAATC TACGTTAAGG CTCAACCAAT CGATGAAGAA GTTTCCCTTG ACATCGAAGC AGGTGTTGTT AACCCAAGAG ATGATTTCAA GGCCAGAGCT AGAGTTTTGG CTGACAAGCA CGGCTGGGAT GTTACTGACG CCAGAAAGAT CTGGTGTTTC GGTCCAGACG GTACTGGTCC TAACGTTGTT GTTGACCAGT CCAAGGCTGT TCAATACTTG AACGAAATCA AGGATTCCGT TGTTGCTGCT TTCCAATGGG CTACCAAGGA AGGTCCTATC TTCGGTGAAA CCGTCAGATC CATCAGAGTC AACATCTTGG ATGTTACCTT GCACGCTGAT GCTATCCACA GAGGTGGTGG TCAAATCATC CCAACCATGA GAAGAGTTAC TTACGCTTCC ATGTTGTTGG CTGAACCAGC CATTCAAGAA CCAGTCTTCT TGGTTGAAAT CCAATGTCCA GAAAACGCCA TTGGTGGTAT CTACTCTGTC TTGAACACAA AGAGAGGTCA AGTTATCTCT GAAGAACAAA GACCAGGTAC CCCATTGTTC ACTGTTAAGG CCTACTTGCC AGTTAACGAA TCTTTCGGTT TCACCGCTGA CTTGAGAAAG TCTACTGGTG GTCAAGCTTT CCCACAATTG ATTTTCGACC ATTGGTCCGT CTTGAATGGT GACGTTACCG ACCCTAACTC CAAGCCAGGT GCCATTGTCA AGGCCAAGAG AATCAGACAA GGTATGAAGC CAGAAGTTCC AGGTTACGAA GAATACTACG ATAAGTTGTA GGTATGATGG TCTTTATCAA TAAATTCCAA AAAGAGAGAG AAGGCAGTTT TTTGCTGTTC ATTTTTGTTT TGTATATCTG TTAATATAGT CATTTCTCTA TAAACTTATT GTTTCTAGTT CACACCAGTA TAAATACAAA TTTATATAAA ATTC
|
Protein sequence | MVAFTIEQIR ELMDKVTNVR NMSVIAHVDH GKSTLTDSLV QRAGIISAAK AGEARFTDTR KDEQERGITI KSTAISLYAA MTDDDVKEIK QKTEGNSFLI NLIDSPGHVD FSSEVTAALR VTDGALVVVD CVEGVCVQTE TVLRQSLGER IKPVVIINKV DRALLELQVT KEDLYQSFAR TVESVNVIIS TYVDPAIGDC QVYPDKGTVA FGSGLHGWAF TVRQFASRYS KKFGVDRLKM MERLWGDSYF NPKTKKWTNK DKDADGKQLE RAFNMFVLDP IFRLFAAIMN FKKDEIPTLL EKLEISLKGD EKELEGKALL KVVMRKFLPA ADALLEMIII HLPSPVTAQA YRAETLYEGP SDDASCTAIR NCDPKADLML YVSKMVPTSD KGRFYAFGRV FAGTVRSGQK VRIQGPNYQV GKKEDLFLKS IQRTVLMMGR FVEAIDDCPA GNIVGLVGID QFLLKSGTIT TSDASHNMKV MKFSVSPVVQ VAVEVKNAND LPKLVEGLKR LSKSDPCVLC TINESGEHIV AGTGELHLEI CLQDLENDHA GVPLKISPPI VSYRETVEGE SSMVALSKSP NKHNRIYVKA QPIDEEVSLD IEAGVVNPRD DFKARARVLA DKHGWDVTDA RKIWCFGPDG TGPNVVVDQS KAVQYLNEIK DSVVAAFQWA TKEGPIFGET VRSIRVNILD VTLHADAIHR GGGQIIPTMR RVTYASMLLA EPAIQEPVFL VEIQCPENAI GGIYSVLNTK RGQVISEEQR PGTPLFTVKA YLPVNESFGF TADLRKSTGG QAFPQLIFDH WSVLNGDVTD PNSKPGAIVK AKRIRQGMKP EVPGYEEYYD KL
|
| |