Gene PICST_70551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_70551 
Symbol 
ID4836963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp347485 
End bp350530 
Gene Length3046 bp 
Protein Length974 aa 
Translation table12 
GC content47% 
IMG OID640388278 
Productpredicted protein 
Protein accessionXP_001382298 
Protein GI150863731 
COG category[K] Transcription 
COG ID[COG5164] Transcription elongation factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.770055 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCATCAGCCC GAACTTCACG ATGTCCAGTG AGGAAAATGT CAAACGTGAG CAGGATAATG 
TCCTCAAACA CGAACACAAT GAAGAGGATG TTTACAATGT GGGCGAACGA GATCAGGACG
AGATCGATGA GGAAGTGGAA GAAGATGAGG AACAGAAGCT AGATATCCAG AAAGAGACAG
GCCTCAAACG GACGAGATCC GAAGCAGAAA TTGACGTTGG AGACAACGAC GAAAACATCA
ATGAAAAGAA TAACAATGAA AACGAAGAAG GAGAAGATGA CGATGAAGAA GATGACGAAG
ATGAAGACGA AGATGAAGAT GAGGATGATG ATGGTGTGTC CACGGGAAGA AGAAGAAAGA
GAAGAAGAGC TGGAAACCAG TTCATTGATA TCGAAGCTGA AGTTGACGAC GAAGAAGAAG
ATGAGCTTGA CGAAGATGAC GAAGAAGCCG AACTTTTGCG AGAGCAGTTC ATTGCTGACG
ACAGACATGC TGGCGAGAAC GAAGCTGAGA GCCACGATGA CAGATTGCAC AGACAGTATG
ACCGTAGACG GCAAGAGGCC GAGGACCAGG ATGCTGAGGT ATTGGCCGAG ACCTTGAAGC
AACGTTATAG AAAGACCCAT ACCGTCTACC GTGGGGACAC TGCTGCCAGT GGTACTGTAT
CGCAGAAACT CTTGATGCCC TCTATCAACG ACCCATCCAT CTACGCTATT AGATGTACCC
CTGGCCGTGA GAAAGACTTG GTGCGTAAGC TTTACGAAAA GAAGAGAACT CTTGACAGGC
TGAATGCTCC GTTAGATATT CTCACAGTGT TTCAGAGAGA TGCCTTCAAG GGATACATCT
ATATTGAAGC CAAGAGACCC GATGCCATCG ACAAGGCGCT TGTCGGAATG GTGAACATCT
ATGTAAGAGA CAAACTCTTA GTTCCTGTCA AGGAATACCC GGACTTGCTC AAACAGGTCA
AATCGTCGGA TGTCGAGTTG GTGCCTGGTA TCTACGTCAG AATCACCAGA GGTAAGTACA
AGAACGACTT GGCAATTGTA GACAACTTGT CAGAAAACGG GCTTGATGTG CGTTGTAAGT
TGGTTCCTCG TTTAGACTAC GGTAAGTTCG ACGAGTTCGA TAAGGATGGC CGTAGAATTA
GACTGAAGGC AAGACCCTTG CCTCGTTTGT TCAGTGAACA AGAGGCTAGA CAGTACGATA
GAGAGTTCTT ACAACCTGGA AGAGGTCCTC GTTCGTACGT CTACAGAGGC GACGAATACA
TTGAGGGCTT CTTGTACAAG GACTTCAAGT TGCAGTTTAT CCAGACCAAA GATGTACATC
CAACGTTGGA GGAATTGGAC CGCTTCCAGA CAGGCAACAA CGAGGAAGAA GGATTTGATC
TTGCCGCAAT CGCTGCCTCG TTGAAGAACA AGAAAGGTGA AGGCAAGTCC ACTGCCTTCC
AGCCCGGTGA CAAGGTGGAG ATCCGAAGAG GCGAGCAAGC TAAAACTGTG GGTGTAGTGG
TGGAAGCCTC TTTGAACGAA ATCACTATCT CTGTTACTGA CAGTGGAGAC CCCAAGTTCG
TCAACCAGAA GTTGACCGTC CCTGCCAGCG ATTTGAGAAA GATATTCAAT GAAGGTGACC
ACGTAAGAAT CGTGGAAGGT AAGCATTTCG ACGAGACTGG GTTGGTTATC AAGATAGACG
GCGACTCGGT TGTTCTTGTC AGTGACCAGA CACGTGAAGA TGTCAGAGTG TTTGCCAACT
ACTTGGTGAA AGCTACTGAT GCCTCGCTGA ACATGGACAC CATTAACAGT AAGTACGATA
TCAAGGACTT GGTGGAGTTG AATGCCGCCT CTGTCGGTGT CATTGTCAAG GCCGAAAAGA
ACATCTTTGA AGTGTTGACC TCTGACGGAA GGTTATTGTC GGTGAAACCC AGCGGAATAT
CGTCGAAGTT GAAGATGAGT CGTCGTGAAC AGATCGCTAC TGACAGAAAT GGGTTCACGA
TCAAGATCGG TGACACCGTC AAAGAGGTCT TGGGTGACAA GAAGCGAGAA GGTGCCATTT
TACATATCTA CAAGAACTCG TTGTTCATCA AGTCTAACGA GATTGTCGAG AATTTGGGTA
TCTTTGTCAC TAACCGTATG AATGTTAGCA CTATTTCCAC CAAGGACTCC ATGGTATCCA
AGAATTTGGG CCCTGACTTG ACTTCCATGA ACCCCAACTT GAAGCTTCCT AATCCTTCCG
CTGGTGGTTT CAAACCCCGT GCTGGTGGCC GTGAAAAGTT GTTGTACAAG GATGTTGCTG
TGACAAGTGG ATCCTACAAA GGTTTGAAGG GTAAGGTGAT TGAAACCGAC GATGTCTACG
CCAGAATCGA ATTGCACACG AAGAGTAAAA AGATCAAGGT AAACAAGAAT AACTTGAACG
TATTGATCCG CGGAGAAGCA ATACCATACT TGAGATTCAT TGGTGCAGCA CCTTCGGAGT
CTCGCGAGTT CAACAAGCCC AATGCGCCAG CATTTACTTC TGGCGAGAGA TCGTCGTGGA
ACGGAAATGC AACACCTGGC GTGGGTGCAA ATTCCGCCTG GGGAGGAGCT TCTTCCAGTT
GGGGAGGAGC TTCTTCTGCC TGGAACGGTG GAAAGACCCC AGCTTACAGT GGAGGAAACT
CTACCTGGGG AGGAGCTGCT TCTACTTGGA ACGGCGGAAA AACTCCAAAT GCTGGAGGAA
CTTCTGAATG GGGTGCCAAT GGAAGCAATT CTACCTGGGG CTCTTCCAGA GGTGGGGGTT
CTACTTGGGG CTCTTCCAAC AGAGGAGGAA ATTCTACGTG GGGATCCTCC GGAGGTAACA
CTAGTAACCC CAGCAATAAA AACAACAATA ACAATAGCAG TGGCAGTAAC TCTGCTTGGG
GTGGCAACAA CTCTACGTGG GGTGGCCAAA ACAAAGGGAA TAGCAGCACT TGGGGAAGAC
AATAAACTGC TGAAACATAC TATTTGTCTA CCAGCTAGCT TCCATTCTTT ACAGCTTTAT
GTACTTTAGC CAGCTTATCT TGTATACTAA TAAAAAAGAA GATATG
 
Protein sequence
MSSEENVKRE QDNVLKHEHN EEDVYNVGER DQDEIDEEVE EDEEQKLDIQ KETGLKRTRS 
EAEIDVGDND ENINEKNNNE NEEGEDDDEE DDEDEDEDED EDDDGVSTGR RRKRRRAGNQ
FIDIEAEVDD EEEDELDEDD EEAELLREQF IADDRHAGEN EAESHDDRLH RQYDRRRQEA
EDQDAEVLAE TLKQRYRKTH TVYRGDTAAS GTVSQKLLMP SINDPSIYAI RCTPGREKDL
VRKLYEKKRT LDRSNAPLDI LTVFQRDAFK GYIYIEAKRP DAIDKALVGM VNIYVRDKLL
VPVKEYPDLL KQVKSSDVEL VPGIYVRITR GKYKNDLAIV DNLSENGLDV RCKLVPRLDY
GKFDEFDKDG RRIRSKARPL PRLFSEQEAR QYDREFLQPG RGPRSYVYRG DEYIEGFLYK
DFKLQFIQTK DVHPTLEELD RFQTGNNEEE GFDLAAIAAS LKNKKGEGKS TAFQPGDKVE
IRRGEQAKTV GVVVEASLNE ITISVTDSGD PKFVNQKLTV PASDLRKIFN EGDHVRIVEG
KHFDETGLVI KIDGDSVVLV SDQTREDVRV FANYLVKATD ASSNMDTINS KYDIKDLVEL
NAASVGVIVK AEKNIFEVLT SDGRLLSVKP SGISSKLKMS RREQIATDRN GFTIKIGDTV
KEVLGDKKRE GAILHIYKNS LFIKSNEIVE NLGIFVTNRM NVSTISTKDS MVSKNLGPDL
TSMNPNLKLP NPSAGGFKPR AGGREKLLYK DVAVTSGSYK GLKGKVIETD DVYARIELHT
KSKKIKVNKN NLNVLIRGEA IPYLRFIGAA PSESREFNKP NAPAFTSGER SSWNGNATPG
VGANSAWGGA SSSWGGASSA WNGGKTPAYS GGNSTWGGAA STWNGGKTPN AGGTSEWGAN
GSNSTWGSSR GGGSTWGSSN RGGNSTWGSS GGNTSNPSNK NNNNNSSGSN SAWGGNNSTW
GGQNKGNSST WGRQ