Gene PICST_62201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_62201 
SymbolUGA3.2 
ID4840309 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp404050 
End bp405972 
Gene Length1923 bp 
Protein Length576 aa 
Translation table12 
GC content39% 
IMG OID640391624 
Productputative transcriptional regulator 
Protein accessionXP_001385430 
Protein GI150865985 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0514301 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCTTG TTTTAGACGT CAATTCGTTG GAGACAGTCG AATCCAGAAG CTCTCCTACT 
AGTGCTCACA TAGGAAGCTA TAGCTCAAAG AGAATCCTCT CGAGAAACAT AATCCTCGGC
CCGACAAAGA GATCTCGCAA TGGATGTTTG AATTGCAGAA AGAGAAAGAA GAAATGTGAC
GAAAGTTTTC CCACATGCGG TTCGTGTAAG TATCGAGGAG CTGAATGTAT CTGGAGAGAC
TCTACTAAGT TCAAGATGAA GAGGTATTCC GATTCCAGAG ACTCTACAAA GAAGTCAGCA
GTTGTGCGGC ATGTCTCAAC GAAATCTCCG TCCGAAGAAC TATCAGAAGC GATTGTTGAA
TCACAGGATA AGATGGAACT TTCAAAAGGC ATTATAGAAC TTGTTAATGA TGAGATAGAT
CAAGTGAGTC ATACTGACAT CTTGAACGCA GCTGAATTGG ACCAATTCGA GAATGTAGAA
GACGAATTGT TTACGAACAA CAATCATGAA TTGACACCTT ACACAACAGA CTTGCGATTG
GATACAAGTG AATTCACGAC CAAGAACATA GATTTCCTAA TGTCTGATGA TTTTAGCGAC
TTCCCCTATT TATCACCTAC TACAACACCA ATATTCAATC CGTTTAGACA CCTCGACGAT
AAAGCAAAGT ATTTCCTTGA CGGATTTATT CATAAGGTGG CACGTAATTT ATGTATAGGG
CCAGACTGGT GCAATTACTT TCTCAAAACG TTCTATCAGA TGGCAGAACA AGACAAATCT
GTTCTGTTCG CATTGGCCTC TTGGGGAGGT TTGTTCCTCG AAGGAAGCAC CGACGCAACT
AAGTCATATA TGATCAAGGC ATATAAATCG ATTACAGAGA GATTTCCTAA CTTCAACGAA
CTCAGCAAAG AGGACATTTA TATCTTGCTC AACTTTTTTT TGATAGGCAT AGGTGTTCAT
GTTTGTGCTG GAGATGTTTC TCAGTGGAAT ATCTTATTTA AGCAGTGTAT TGAAGTGATT
CAAAAGAACG GCGGTTTGTC AGAAATATGT CGCATGTTTG ACTATTCCAA TGATATTAAG
TGGTTAATAT CCGATGTACA GTTTCATGAT ATAATGTCCT CAAGAGCATT TTCTAAAGGA
ACAATCTTAC CAATGGAAGA GTACAATACC ATTTTCCAAC GCAACAAGAT CTTGGAGCTA
GGTAATTATG GTTTGGACCC ACTCCAAGGA TGTATCCAAC CTATTTATTT GTTGTTAGGC
GAGATTCTAC AGGTTTCATC TGATCTAAAG TCTAAGAAGA AACATATCAA TAAACTCTTA
GAGGATGCAC GAAAAGCATA CAACGATAGT GATAAAACTA ATCCAGATTT ACTCAACACT
GCTACAGTTC AAGGAGAAGT AATAAACTTG CGCATATTGC GACAAAACTT CTATAATGAA
ATGGAGGAAG TTATCGATCA GTTGAAGGAA AAATTGAAGC GGTGTCAACC AAATATACAG
CAAATGGAGC CAATTATCGA TGACAAACAC GAAGTCGAGT TGCATCTTAC TTTGTTTGAG
GTTTATCTGT ATACCTGTCA ATTATCGATG AATTATCAGA TCAAAGGAAT GCCGGCTTCG
TCAGCAGAGA TGCAGTCAAT ATTGGTCAAT GCTGTGAGCT GTATCGATAT TCTCGTAGAT
ACGAAATTGG TGTCTTCATT ATCTTTGCTG TTACTCTTGT GTGGTATCAC ATGTTGTACA
GCCACTGATA GACTAGATAT GGAGGTGCGA ATAAAGAAGA TCCAGCTGGC ATACGAGGTT
GCCAATCTTA CCAGAATGGT TGATATCATC AAAGAAGTAT GGAAGAGGAA TAGTAACGGA
AACGTATGTA TAGATTGGGT GGAGGTGTGC AACGAGAAGG ACTGGAATCT TTCTGTATGC
TAA
 
Protein sequence
MYLVLDVNSL ETVESRSSPT SAHIGSYSSK RILSRNIILG PTKRSRNGCL NCRKRKKKCD 
ESFPTCGSCK YRGAECIWRD STKFKMKRYS DSRDSTKKSA VVRHVSTKSP SEELSEAIVE
SQDKMELSKG IIELVNDEID QVNLRLDTSE FTTKNIDFLM SDDFSDFPYL SPTTTPIFNP
FRHLDDKAKY FLDGFIHKVA RNLCIGPDWC NYFLKTFYQM AEQDKSVSFA LASWGGLFLE
GSTDATKSYM IKAYKSITER FPNFNELSKE DIYILLNFFL IGIGVHVCAG DVSQWNILFK
QCIEVIQKNG GLSEICRMFD YSNDIKWLIS DVQFHDIMSS RAFSKGTILP MEEYNTIFQR
NKILELGNYG LDPLQGCIQP IYLLLGEILQ VSSDLKFTQH CYIINLRILR QNFYNEMEEV
IDQLKEKLKR CQPNIQQMEP IIDDKHEVEL HLTLFEVYSY TCQLSMNYQI KGMPASSAEM
QSILVNAVSC IDILVDTKLV SSLSLSLLLC GITCCTATDR LDMEVRIKKI QSAYEVANLT
RMVDIIKEVW KRNSNGNVCI DWVEVCNEKD WNLSVC