Gene PICST_59919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_59919 
SymbolEKI1 
ID4839264 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1613777 
End bp1615408 
Gene Length1632 bp 
Protein Length526 aa 
Translation table12 
GC content44% 
IMG OID640390579 
Productethanolamine kinase 
Protein accessionXP_001385006 
Protein GI150865683 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.671475 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.729563 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCCT CAGACCTCCA GCAGTCTTCC TCGGCCTACT GCTTCATAGC GAACAAAGAC 
GTCCACTGCA TCCCCATCAA CTCGTTCAAC AGCTACTCCG ACTCTTTCCT AGCCTTGGAC
AATACCAAAC ACCAAAGGGA CGATGCAGAC TTCACACCCG ACAAACTCTA CTTGAAGAAC
AGAAACTTGC TCTACAGCCT CAACAACCCT TCACTTTCAG ATGTCGAATT CGAACCAAAC
TCTCCAACTC CACTCCACTC CACACTTAAC ACTGCCAATA ACAATACCCC TATCAATGGT
AACCATAATG GTAACAATAG CCATTCGAAT TCCACTAGCT TGTCCGTGCC GTCGACTTCT
TCTTCTTTCC GTAAATCCAT CAGTACCACC CCCAGTCCTC CGGCTTCCGA CGTTGACCTC
TCGGCTATCA TCAATAATTA CGCGACTCAC GCCGTATACT TTCCCAAACT CATAGTCAAC
TTGTCCGAGA ATTTGAACAA CAACTTCCAG GACTTGAAGA CCTTATTGGT GAAAATCTTT
CCCACTTGGA GTAACAAAGA TGAGATCACT TTGAAACAGC TTACTGGTGG TATCACCAAC
ATGTTGCTTA GATGTTCCTA CAAGCCGTTG CAGGAAACTG TACTCATCAG AGTATATGGC
CACGGAACCA ACTTAATTAT CGACAGACAC CGTGAGTTCA TTCTGCACTT GATTCTCAAC
TCCATCGGCT TGGCTCCTCC CATCCATTCC AGATTCAAGA ACGGATTGAT CTACGGCTAT
CTCTCTGGCC GGTCATTGGA ATCGTCCGAA TTGTATAGCC CCAACTTGTA TCCCTTAATC
GCACAACAGC TCGGAAATTG GCACAACCAG TTAGACTACC GCTTGATCCA GAACGGTGTG
GAAAAGATCA GAACCTTTTC CATGAGCTTG AAGTCGAAAA CAAAAAGAGA CAGCATAAGT
AATGGTTCTA CAAAGAAGAG ATATAAGAAG AAATTCATCT CCAACATATG GGAATTAATA
GAAGACTGGA TCAATATTGT GCCTGTGAAC CCGGACTTGA TATCGTCGTT TAACTCCAAT
TTGAGCCATG AAGTCACCGC TGAAAACCTC AAGAGTATCA TTACCGAGGA GTTTGAATGG
TTGAAGGAAA ACTTGATCAA TTCAAATTCA CCTGTAGTAT CTTCACACTG CGATTTATTA
TCTGGAAATG TGATCATCCC AGACGACCTC GATATCAAGA AACCTTTACA TTCCTTACCA
ACTATTGAAA AGAACCCTAT CAAATTCATA GACTACGAGT ATATGTTACC AGCACCTCGT
GCTTTCGATA TTGCCAACCA TTTGGCAGAA TGGCAGGGAT TTGATTGTGA CAGATCCGTC
ATCCCCACAC CTCACATAAG CAACCCTGTT TTAGTGAAAT GGGTGAAAGG ATATCTTAAC
GACGAAAACG CGGATATGGA TAAAGTCGGC AGCTTGATAG AAGAAATCGC TACCTTCTAT
GGTTTGCCAG GTTTCTACTG GGGTATCTGG GCCATGATCC AAAGCGAGTT GTCAAATATC
GACTTTGATT ACTCTAAGTA CGGAAAGTTG AGACTAGAAG AGTATTGGGT CTGGAAAGGA
CATTTCTTGA AA
 
Protein sequence
MDSSDLQQSS SAYCFIANKD VHCIPINSFN SYSDSFLALD NTKHQRDDAD FTPDKLYLKN 
RNLLYSLNNP SLSDVEFEPN SPTPLHSTLN TANNNTPING NHNGNNSHSN STSFPPASDV
DLSAIINNYA THAVYFPKLI VNLSENLNNN FQDLKTLLVK IFPTWSNKDE ITLKQLTGGI
TNMLLRCSYK PLQETVLIRV YGHGTNLIID RHREFISHLI LNSIGLAPPI HSRFKNGLIY
GYLSGRSLES SELYSPNLYP LIAQQLGNWH NQLDYRLIQN GVEKIRTFSM SLKSKTKRDS
ISNGSTKKRY KKKFISNIWE LIEDWINIVP VNPDLISSFN SNLSHEVTAE NLKSIITEEF
EWLKENLINS NSPVVSSHCD LLSGNVIIPD DLDIKKPLHS LPTIEKNPIK FIDYEYMLPA
PRAFDIANHL AEWQGFDCDR SVIPTPHISN PVLVKWVKGY LNDENADMDK VGSLIEEIAT
FYGLPGFYWG IWAMIQSELS NIDFDYSKYG KLRLEEYWVW KGHFLK