Gene PICST_66828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_66828 
SymbolEFT2 
ID4837377 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp496572 
End bp499625 
Gene Length3054 bp 
Protein Length842 aa 
Translation table12 
GC content45% 
IMG OID640388692 
ProductElongation factor 
Protein accessionXP_001382854 
Protein GI126132658 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0480] Translation elongation factors (GTPases) 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00490] translation elongation factor aEF-2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTATGT ACTACAAGCT CGAAACTGCT GTTGTGCACA TGTAGAATAC GTATTGTCTC 
GAATTGAACC AGGACGCTTT TTTCTGCCAC TCCCCAATCC GTTGATCTGC CTCTTGACTG
AGAGCTGGGG GATGAACTGA GAAAATTTTT TCTCGCCAGG AATGAGACGA AATGTCAGCA
CCAGAATACT AGAGGCAATG CCAGATGTAA TTGTCAGAAC TCGTATTGCA AGTATAACAT
ATACAGCGAG TACAATGTCT ACAAAGTATA GCATTTACGT CATAAACAAC TTTTGCAACA
TTTACAGATT CACAGCATTA CAGCAGCAAT CCTCTACATA ATGTACTCTT CATACAGAAA
TACTAACATC TTACAGTTGC TTTCACTATT GAACAAATCC GTGAATTGAT GGACAAGGTT
ACGAACGTTC GTAACATGTC CGTCATTGCT CACGTCGATC ACGGTAAGTC TACCTTGACC
GATTCCTTGG TCCAAAGAGC TGGTATTATC TCTGCTGCTA AGGCCGGTGA AGCCAGATTC
ACTGACACCA GAAAGGATGA ACAAGAAAGA GGTATCACTA TCAAGTCCAC TGCCATTTCT
TTGTACGCTG CCATGACCGA TGATGACGTT AAGGAAATCA AGCAAAAGAC CGAAGGTAAC
TCTTTCTTGA TCAACTTGAT CGACTCGCCA GGTCACGTTG ACTTCTCCTC TGAAGTCACT
GCTGCTTTGC GTGTTACCGA TGGTGCTTTG GTTGTCGTCG ACTGTGTTGA AGGTGTCTGT
GTCCAAACCG AAACCGTCTT GAGACAATCT TTGGGTGAAA GAATCAAGCC AGTTGTCATC
ATCAACAAGG TTGACAGAGC TTTGTTGGAA TTGCAAGTCA CCAAGGAAGA CTTGTACCAA
TCCTTCGCCA GAACTGTTGA ATCCGTTAAC GTTATCATCT CCACCTACGT TGACCCAGCC
ATCGGTGACT GTCAAGTCTA CCCAGACAAG GGTACCGTTG CTTTCGGTTC CGGTTTGCAC
GGTTGGGCTT TCACCGTCAG ACAATTCGCT TCCAGATACT CCAAGAAGTT TGGTGTCGAC
AGACTCAAGA TGATGGAAAG ATTGTGGGGT GACTCTTACT TCAACCCAAA GACCAAGAAG
TGGACCAACA AGGACAAGGA TGCCGATGGA AAGCAATTGG AAAGAGCCTT CAACATGTTC
GTCTTGGACC CAATCTTTAG ATTGTTTGCT GCCATCATGA ACTTCAAGAA GGACGAAATC
CCAACCTTGT TGGAAAAGTT GGAAATCTCC TTGAAGGGTG ACGAAAAGGA ATTGGAAGGT
AAGGCTTTGT TGAAGGTTGT CATGAGAAAG TTCTTGCCAG CTGCTGACGC TTTGTTGGAA
ATGATCATCA TCCACTTGCC ATCCCCAGTC ACTGCCCAAG CTTACAGAGC TGAAACTTTG
TACGAAGGTC CATCTGACGA TGCTTCTTGT ACCGCCATCA GAAACTGTGA CCCTAAGGCT
GACTTGATGT TGTACGTCTC CAAGATGGTC CCAACCTCTG ATAAGGGTAG ATTCTACGCT
TTCGGTAGAG TTTTCGCTGG TACCGTCAGA TCTGGTCAAA AGGTCAGAAT CCAAGGTCCA
AACTACCAGG TCGGTAAGAA GGAAGACTTG TTCCTTAAGT CTATCCAAAG AACCGTCTTG
ATGATGGGAA GATTCGTCGA AGCCATCGAT GACTGTCCAG CTGGTAACAT TGTCGGTTTG
GTTGGTATTG ACCAGTTCTT GTTGAAGTCT GGTACCATCA CCACCTCCGA CGCTTCCCAC
AACATGAAGG TCATGAAGTT CTCTGTCTCT CCAGTTGTGC AAGTTGCCGT TGAAGTCAAG
AACGCTAACG ACTTGCCAAA GTTGGTTGAA GGTTTGAAGA GATTGTCCAA GTCCGACCCT
TGTGTCTTGT GTACCATCAA CGAATCCGGT GAACACATTG TTGCCGGTAC CGGTGAATTG
CACTTGGAAA TCTGTTTGCA AGATTTGGAA AACGACCACG CTGGTGTTCC ATTGAAGATT
TCCCCACCTA TTGTTTCCTA CAGAGAAACC GTTGAAGGTG AATCCTCCAT GGTTGCCTTG
TCTAAGTCGC CAAACAAGCA TAACAGAATC TACGTTAAGG CTCAACCAAT CGATGAAGAA
GTTTCCCTTG ACATCGAAGC AGGTGTTGTT AACCCAAGAG ATGATTTCAA GGCCAGAGCT
AGAGTTTTGG CTGACAAGCA CGGCTGGGAT GTTACTGACG CCAGAAAGAT CTGGTGTTTC
GGTCCAGACG GTACTGGTCC TAACGTTGTT GTTGACCAGT CCAAGGCTGT TCAATACTTG
AACGAAATCA AGGATTCCGT TGTTGCTGCT TTCCAATGGG CTACCAAGGA AGGTCCTATC
TTCGGTGAAA CCGTCAGATC CATCAGAGTC AACATCTTGG ATGTTACCTT GCACGCTGAT
GCTATCCACA GAGGTGGTGG TCAAATCATC CCAACCATGA GAAGAGTTAC TTACGCTTCC
ATGTTGTTGG CTGAACCAGC CATTCAAGAA CCAGTCTTCT TGGTTGAAAT CCAATGTCCA
GAAAACGCCA TTGGTGGTAT CTACTCTGTC TTGAACACAA AGAGAGGTCA AGTTATCTCT
GAAGAACAAA GACCAGGTAC CCCATTGTTC ACTGTTAAGG CCTACTTGCC AGTTAACGAA
TCTTTCGGTT TCACCGCTGA CTTGAGAAAG TCTACTGGTG GTCAAGCTTT CCCACAATTG
ATTTTCGACC ATTGGTCCGT CTTGAATGGT GACGTTACCG ACCCTAACTC CAAGCCAGGT
GCCATTGTCA AGGCCAAGAG AATCAGACAA GGTATGAAGC CAGAAGTTCC AGGTTACGAA
GAATACTACG ATAAGTTGTA GGTATGATGG TCTTTATCAA TAAATTCCAA AAAGAGAGAG
AAGGCAGTTT TTTGCTGTTC ATTTTTGTTT TGTATATCTG TTAATATAGT CATTTCTCTA
TAAACTTATT GTTTCTAGTT CACACCAGTA TAAATACAAA TTTATATAAA ATTC
 
Protein sequence
MVAFTIEQIR ELMDKVTNVR NMSVIAHVDH GKSTLTDSLV QRAGIISAAK AGEARFTDTR 
KDEQERGITI KSTAISLYAA MTDDDVKEIK QKTEGNSFLI NLIDSPGHVD FSSEVTAALR
VTDGALVVVD CVEGVCVQTE TVLRQSLGER IKPVVIINKV DRALLELQVT KEDLYQSFAR
TVESVNVIIS TYVDPAIGDC QVYPDKGTVA FGSGLHGWAF TVRQFASRYS KKFGVDRLKM
MERLWGDSYF NPKTKKWTNK DKDADGKQLE RAFNMFVLDP IFRLFAAIMN FKKDEIPTLL
EKLEISLKGD EKELEGKALL KVVMRKFLPA ADALLEMIII HLPSPVTAQA YRAETLYEGP
SDDASCTAIR NCDPKADLML YVSKMVPTSD KGRFYAFGRV FAGTVRSGQK VRIQGPNYQV
GKKEDLFLKS IQRTVLMMGR FVEAIDDCPA GNIVGLVGID QFLLKSGTIT TSDASHNMKV
MKFSVSPVVQ VAVEVKNAND LPKLVEGLKR LSKSDPCVLC TINESGEHIV AGTGELHLEI
CLQDLENDHA GVPLKISPPI VSYRETVEGE SSMVALSKSP NKHNRIYVKA QPIDEEVSLD
IEAGVVNPRD DFKARARVLA DKHGWDVTDA RKIWCFGPDG TGPNVVVDQS KAVQYLNEIK
DSVVAAFQWA TKEGPIFGET VRSIRVNILD VTLHADAIHR GGGQIIPTMR RVTYASMLLA
EPAIQEPVFL VEIQCPENAI GGIYSVLNTK RGQVISEEQR PGTPLFTVKA YLPVNESFGF
TADLRKSTGG QAFPQLIFDH WSVLNGDVTD PNSKPGAIVK AKRIRQGMKP EVPGYEEYYD
KL