Gene PICST_62972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_62972 
SymbolSEF2 
ID4840451 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp650572 
End bp652584 
Gene Length2013 bp 
Protein Length649 aa 
Translation table12 
GC content38% 
IMG OID640391766 
Productputative transcription factor 
Protein accessionXP_001386136 
Protein GI150866507 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.150077 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACAC AACTGAAAAA CCGTCTAAGG TTGATAATAC CGGCGCTGCT GTCTGGCTCC 
AAGGTGCTCA AGTCTTGCAT TCGCTGCCGC AAACATAAAA CCAAATGTAA TGCTCTGGTA
ACCAATCCTT TGCCTTGCAC TCACTGTGCC AAACACAAGA TCAACTGTGT CTTGGAAGTG
ATAACACCGC TGACTAACAG ATCGACTATC GATTTGGCTG AAAAACTTGC TGACGAAGTC
TCCGACTTAA AACAGGTCAT GGCAAAGATA ATCTCCAGAA GAAATGCCTT GTTCGAAAAA
CTAGCCAGCA GCGGACTTGA CATTCAAAGT ATAGCACAAC AGCAAAAGCA AAAGAACCAG
CTCTGTAGAC GAGCCCGCTC GGAGACCCCA CCAGTATCCG AAATCCAATC TTGCATCAAT
ACCCCTCAAG ACTTCGCCAT ACCGTTGACA GCCCCTGTGG ACTATGGCTT ACAGGAAAAC
GACAACGAAC CTGTATTTTC CATTTCTGCA AATAAAAGCT TGCAATCGTT TGCTATTTCG
AACGCAACTG CATCAAGACT CTTTGCTAAC TACGAACGGA ACTTCAACCA GTTTTTGCCC
ATTTTTCCAG ACAACTTCTT CAAATCGATA AATTTGAAGA CGTTTGCAAA CGAGAATGAC
TTGCTTTTCT GGTGCATAAT CTTGACTTCA TATTTAAACA ACCCCGTAGA CCAATCGGCT
CCAAGCTACC GTATCTTGTC TGAACACATC AAATCTTTAG TTGTCGAGAA ATGCTGGCTC
CAAACTCCAA GGTCAGTGTA CGTCATCTCG TCGTTGTTGA TTTTGACCAC TTGGCCATTG
CCCAACACTA GTTCCAAAAT CTCTGACAAC TTGTGTATCA AGTTCATTTC GACAATGAAG
GCATTGTCGT TACAATTTGG ATTGCACAAG TTGGAGTTCA TCAATGAATT CAGTCACAAG
ACCAAAATGA ACATCTCACA AGAGGTCAAT CTTGACAACC TAATCAGAGA AAGAATATAT
AAGTTCATCA ACATCAACTC CAACTACTGG CTCATAAACT TGGGTTTATC AAACAATAAT
TACAACGGAT TTACCCAGGA CTACATCATC AACAAATCCT CTAACATAGA CATCTTGAAC
AAGACGTCTG AAGGTGATCA TTACATAAAC TCACTCTTGA AGATCTCTAT GATCCAATCA
AAATTGAACG AAAACATGAA TATTTTGATA GGAAATAACA GCGAGTCGGT TTCGTTATTG
CCTAACCAGT TGAATACGTC CAAATTGATT AACTTCAATA TGTTTGAAAT CATCATTGAT
GACTTGAACA AGATATTGGT CAGAGATGAC AATTCACTTG TATTGAATAA TTTGATCAAG
ATTTCAATTG AGTTCTCCAA ATTGCAATTG TTTGTGTATT CCTTATCTAA GTCTGATATC
ACAATTTTGG AATACAAACA TTATATTACA AAAGTGTTGA AAAGTTGTTT TATCATCTTC
TCATCGCAAT TCCAGGACAA TTCGTTGAAC TTCAACCAAT TGCCTATTCA CTATAAGTTT
CCTATTGAAT TGGCTATGTT GATCATGCTC AGAGTATTCA AGTCTCCAAT CATGAACTCG
ATCTCAGACT ACAAGCTTGT CAAAGAAAAG TTCAACCAGA TGTACGACAA CATCATCATG
GGTGGTAAGA ACAACGAGGA CTGGGAATTT TTGAATGCTA GATTGAATAA GGTTCTTCAC
AAGTTCAACA AAATAGACAA CAAGTTTATC ATCCTGAAAA TGACGAGACA TAATGACAAT
GACAAAGTTG TACCTTCATT TTTTCTTATA AATAAGATGA AGAGCTATCT CATTGCAAGT
TTGAACTACG AGATGATCTG GCTCATCTAC GAAAACGAAC ACACGCAAAC TACAATAAAT
AACGAAGAAA TTAACTGGGA TGTTTTTGGG ATCAAAGACA ATAGAATGGA TATGATTGAT
TACTTGCAAA GTAACGAATC TATATTTTAC TAG
 
Protein sequence
MQTQSKNRLR LIIPASSSGS KVLKSCIRCR KHKTKCNASV TNPLPCTHCA KHKINCVLEV 
ITPSTNRSTI DLAEKLADEV SDLKQVMAKI ISRRNALFEK LASSGLDIQT RSETPPVSEI
QSCINTPQDF AIPLTAPEND NEPVFSISAN KSLQSFAISN ATASRLFANY ERNFNQFLPI
FPDNFFKSIN LKTFANENDL LFWCIILTSY LNNPVDQSAP SYRILSEHIK SLVVEKCWLQ
TPRSVYVISS LLILTTWPLP NTSSKISDNL CIKFISTMKA LSLQFGLHKL EFINEFSHKT
KMNISQEVNL DNLIRERIYK FININSNYWL INLGLSNNNY NGFTQDYIIN KSSNIDILNK
TSEGDHYINS LLKISMIQSK LNENMNILIG NNSESVSLLP NQLNTSKLIN FNMFEIIIDD
LNKILVRDDN SLVLNNLIKI SIEFSKLQLF VYSLSKSDIT ILEYKHYITK VLKSCFIIFS
SQFQDNSLNF NQLPIHYKFP IELAMLIMLR VFKSPIMNSI SDYKLVKEKF NQMYDNIIMG
GKNNEDWEFL NARLNKVLHK FNKIDNKFII SKMTRHNDND KVVPSFFLIN KMKSYLIASL
NYEMIWLIYE NEHTQTTINN EEINWDVFGI KDNRMDMIDY LQSNESIFY