Gene PICST_77168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_77168 
SymbolGIN1 
ID4837929 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp139082 
End bp142289 
Gene Length3208 bp 
Protein Length907 aa 
Translation table12 
GC content43% 
IMG OID640389244 
Producttranscriptional regulator 
Protein accessionXP_001383312 
Protein GI150864478 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TCTGTCCCGT GTCTGTCCGC TGCTGTACAC CGTTAGTTTT GCTCTAGAAG CAGCCGTCTT 
TTTCGTGGTA GTTCACTCGT GGAAATTGTC ACATTCGTAA TTTTTGACAT ATTTCACAGA
TTTGTCACAG ATTTCTCACG TAGTCTTTCA AAACAGATTC TACGCTGACG CCAGCGACCT
GAAAGCAGTG CCATTCGAGT CAATTTCATT GTGCCACTGC GCAACTGCCA ACCTATAATT
ACAGCCAAGT TGGTGTTCTG AATCAGTATC TCATCGCATC ATAATCAGAG ACTTAAGTTC
AGAATAAGAA ATACAAGCTT AAAGCCTCTT ACTTTCCCAC AGCCATTCTC TTCTATTTTC
AAAGCGTTCG AGTTTTTCCG TCATGATCAA AAGAGAGGGC GAACCGTTCC CCGACTACTC
GCACCCGTCA GAAGAGTCCG ACAACGAGTT CCAGCTGTCG GGCCTTCTCC ATTCCACCTC
CATCGCTGCG TCCAACTCCT CCAACACATC GAACAACAAC AACGGAAACA GCAACCGAGG
AGACCAGAAA CGTAGACGAG TCACCAGAGC ATGCGACAAT TGCCGTCAGA AAAAGGTCAA
ATGTGATGGT AAACAGCCTT GTATCCATTG CACTGTTTAT TCGTACAAAT GCACTTATGA
TCAACCCAAC GTCAGAAACA AGAAGCATAG CGGAATACCC ATTCCGACCG CGCTGCCATC
TTCCGCAGTT TTAGCCGCCG CCGCTGCCAA CATTGCCAAT TCTAACGGAA GCAACGTTGG
CGACCTTTCT CAGACTTTGT CGGCAACATC TGCATATCCA GAGAGAAACC TATTGCTCGC
CCAAGGCATC ATCAAGTTGT TGCTTCCCAA GTTGAAGATC AACTGCTTCG ACCACAACTT
GCAGTTTGAC TTGGAAAAGT TGCAGAAAGT TGTCAACCAC ATCGACGGAA AGTCGTCTTC
CGCCTTGAAC GATATCAGTG ACGTCTACTT GGACAACGCT CCCTTACCAT CTCCCAGTAA
TCCAGGATCT ATTCGTCACG AACGGCAATC ATCCGTGTCT TCCTCTGACG AAACGACTAT
GGGAACAGAA ATCAAATTAT ATTTGCCCAA GAAAGAAATC GCCTTGGACC TCATCTACAC
CACATGGAAC AAGGCTTGTG TTTTGTTCAG ATTCTACCAT CGTCCATCGC TTCTAGAAGA
AGTAGACCTT CTTTACTCTT TGGACCCGTA CAATTACGGC GACAGACAGC AAAAATTCTT
GCCCTTCTTG TACTCGATTT TGGCCTGTGG ATCTCTTTTC TCCAAGTCAG CGACAACATC
TTCTGATCGT AACGAGAACT TGGAAGATGA TGGGTTCAAA TATTTCTTGG AAGCCCGTAA
GTTGATCGAT ATCAGCAATG TAGGCGATAT TAACTCCATC CAAACCATTG TCATGATGAT
AATGTATTTG CAATGCTCAG CCAGATTATC CACCTGTTAT TCGTATATTG GTATTGCCTT
GAGAAGTGCT CTCAAAGAAG GGTTGCATCG TAACTTGACC ATCTTCCAGA ACTCGCGAAG
GAAATTGGAC CCAATTGAAG TCGATACCAG AAAGAGACTA TTCTACACCA TCTACAAGAT
GGATATCTAC ATCAACTCCT TATTGGGCTT ACCCAGATCC ATCAATGAAG ATGAATTTGA
CCAGGATTTG CCTGAAGACT TGGATGACGA AAACGTCACT CGTACAGAGT ACTTGTACGA
CAAACAAGAA GGACGATTGT CGTCATCTGC ATGTGCTAAC CAACATACGA AGCTAATGCT
TATACTTTCA CATATCGTCA AGAAATTGTA TCCAATCAAA GTGAAGACCA CTGAAAAGGA
ATCGGTGACC CCAGACAGAA TACATTCTAA AGTTACTGAA TTGGAATTGG AATTGAAGAC
TTGGCTAGAT GCATTACCAA AAGAATTGAA ACCTACTGAT CCCAATGATA TCAACTCAGG
TAAAGAGATT CCAGAGAAAT TTGTATTAGC CAACTACTAC CTTCATTTGG CATTTTTGAA
CTGCCAAATC ATGTTGTACC GTCCCTTCAT CCATTTCGTC AGCGATGGAC CCAATAACTT
TCAAAAATCC GATCCTCGTT CGTTGATTAG AGGTAGAAAC TGCATCAAGG TAGCCAGAAT
GGTCGTGAAG TTGGCCAACA AGATGATCGA CAAGAACTTG TTGGTGGGAA CATACTGGTT
TTCTATGTAC ACTATTTTCT TCTCTGTTGC ATGTTTGATT TACTACTTCC ATTTCGCAAA
CTATAATAAT ACTCAAAGTG GTGGTGCCAA CTATGTTGGT GTTCTCTTTG ACGATGATTT
AAACATTGAT ATGATCAAAA AAGACATCGA GATCGGTAAG AAGGTGCTTG ATTGTCTTAA
GAACAACTCC AATGCTTCAT TGCGTATTTT CAACATCTTG AACACATTAT TTGAACAATT
GAATAGAAGA ACAGCCACCA CTTCACGTTT GAAGAATACT AGAGTGGTAG ACCCATATAT
CATTAACAAT AACTTGCAGA ACGAAAATGT TCAATCTACA TTCAAGAACT TTGACATCAT
GAATAACTTT AAGACAGGGG GCGCAAAATT GTCAGAATCC TCCAATGTGG TTACCAGCGA
TAAGAATCCT TATGCTCCGT ACACTCAACA AGATCCACTG AAGAAGTCCG AGGCTCTTGG
TCTGTTGTTT ACCGATACCT CGTTGGAAGC ATTCCACTCT TCTAATGATC CTGAGTTGGC
TGCGGTTGCC CCTACTGTAC CTTCACTTCC AAGTAGTCTT CCAACTATTG AAGAACAACC
TGGAATCACG AATTCAAATA CTACTGACGA AGACAGAAAC GACTATGTTC CTGGTGTATT
TGACAAGTTG GATGCTCAAA TCTTTGGCAA GATCTTACCT CCTTACATGT TAGAGAAGAA
TGCCCAGAGG TCACATGACA AGTCTGAACA AGAGCGTATT CCAGCACTTG CCAAAGCTGC
TCTTTCACAA TACCAGGCTC AGGTTCCTTC CCTGTTGAAT GACCAAGCCT TTGGCCTGGT
GAACTTAGAC GATTTTGATT TTTCTCTGTT GGGGGACCCT AACAATCTTG ACTACTTGGA
CCCGTTTAAT AATGAATCGC AGTTCCGTTA CTGAGAATTA CGAAATTTAG GTTTAACTAT
TTATTTGTAA TGTATATTAG TACCAGTT
 
Protein sequence
MIKREGEPFP DYSHPSEESD NEFQSSGLLH STSIAASNSS NTSNNNNGNS NRGDQKRRRV 
TRACDNCRQK KVKCDGKQPC IHCTVYSYKC TYDQPNVRNK KHSGIPIPTA SPSSAVLAAA
AANIANSNGS NRNLLLAQGI IKLLLPKLKI NCFDHNLQFD LEKLQKVVNH IDGKSSSALN
DISDVYLDNA PLPSPSNPGS IRHERQSSVS SSDETTMGTE IKLYLPKKEI ALDLIYTTWN
KACVLFRFYH RPSLLEEVDL LYSLDPYNYG DRQQKFLPFL YSILACGSLF SKSATTSSDR
NENLEDDGFK YFLEARKLID ISNVGDINSI QTIVMMIMYL QCSARLSTCY SYIGIALRSA
LKEGLHRNLT IFQNSRRKLD PIEVDTRKRL FYTIYKMDIY INSLLGLPRS INEDEFDQDL
PEDLDDENVT RTEYLYDKQE GRLSSSACAN QHTKLMLILS HIVKKLYPIK VKTTEKESVT
PDRIHSKVTE LELELKTWLD ALPKELKPTD PNDINSGKEI PEKFVLANYY LHLAFLNCQI
MLYRPFIHFV SDGPNNFQKS DPRSLIRGRN CIKVARMVVK LANKMIDKNL LVGTYWFSMY
TIFFSVACLI YYFHFANYNN TQSGGANYVG VLFDDDLNID MIKKDIEIGK KVLDCLKNNS
NASLRIFNIL NTLFEQLNRR TATTSRLKNT RVVDPYIINN NLQNENVQST FKNFDIMNNF
KTGGAKLSES SNVVTSDKNP YAPYTQQDPS KKSEALGSLF TDTSLEAFHS SNDPELAAVA
PTVPSLPSSL PTIEEQPGIT NSNTTDEDRN DYVPGVFDKL DAQIFGKILP PYMLEKNAQR
SHDKSEQERI PALAKAALSQ YQAQVPSSLN DQAFGSVNLD DFDFSSLGDP NNLDYLDPFN
NESQFRY