Gene PICST_48819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_48819 
SymbolMIG2 
ID4840200 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1682023 
End bp1683192 
Gene Length1170 bp 
Protein Length250 aa 
Translation table12 
GC content48% 
IMG OID640391515 
ProductDNA-binding protein (Carbon catabolite repressor) 
Protein accessionXP_001386018 
Protein GI150866422 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAGAAGGATC CTTCCTCCAG ACCGTACAAA TGTCCTCTCT GCGAAAAAGC GTTTCACCGC 
TTGGAACACC AGACCAGACA CATCCGGACC CACACTGGCG AAAAACCCCA CGCGTGTACA
TTTCCTGGCT GCTTCAAGCG GTTTTCAAGA AGCGATGAGT TGACTCGTCA CTTACGGATT
CATACTAATC CAAACTCGAG AAGAAACAAA AACTTAAACA AACACAACAT CAACTACACC
AACAATCCGC GCAACATCAA GCTGGAAGAT CCAGAAAGTC CGTCTGTAAC CTCAGATGAA
GCATTTACAT CGTCTCAGTC AGCGGCTACT CCGAAAAAGA AAATGATGCT GTTGCCGTCT
CGTAAAACAG CCCTGGCGAC GTCTCCACAA GCAGTAGATA TAAGAGTGAA GTCCGAGCTT
CTGAAGTCGT CCACCAATTC CAGCGAGGAT GAAGACGTCA GCACAAAGAC TCAGACACCT
CCCTCAGACT CAGACATAGT CATGTCCATA GCGAAACTGG AGCCTTCTAC CACAAACTCG
TCGATGCCAG CTCCTCCTCT AACAAAGCTT CCCAGCACGA TGAACATCGA TATTTTGGCC
AGCGCTGCAT CCGAGGAACT CAGCAAAATC GCCAACCCAT CAAAATCGCT TCCTTCTCTT
ACAGACTACT TTGGAAACAG CATGAACAAA GCGCCAGGCG TCCACTACAA CTTCTCCAGT
GATAGAGCCA CGTTCCATCT CAATGACTCC AAATCCAGCA ACAGTCTCCA GTACTTGTCT
AGTATCGCAA CTTTGACAAA CACACATGAA AATCAGAATC CGCCCTTTCT TGTTCAAAAG
CCTAAGGCTG TGTCAACAAA CAAACTCAGC ACGTTATCCT CGCTCCAGAG AATGACTCCC
ATCACGCAAA ACGGAATCTA CCACCCAGAG CCATCGCACA ACAAGTCTCA CATAATAGAA
GACTCGGACC TTGACTACGT CAAGCTGAGG TTGAAAAAAT CGAGACCTAA CAGCCCCAAC
CCAAAGCCGT TCACCTTACC CAATTCGCCT GTTCTCGGAC TCTCGTCGAA TAACACCCCT
ATCATCTCGG CTAACAACAG TTCGACGAAT TTGTCGTCGT TGTTGATGAC TCCGGCCTTT
AGAACTACCA GCATGGACCA CAACTCCACC
 
Protein sequence
KKDPSSRPYK CPLCEKAFHR LEHQTRHIRT HTGEKPHACT FPGCFKRFSR SDELTRHLRI 
HTNPNSRRNK NLNKHNINYT NNPRNIKSED PETPPLTKLP STMNIDILAS AASEELSKIA
NPSKSLPSLT DYFGNSMNKA PGVHYNFSSD RATFHLNDSK SSNSLQYLSK PSHNKSHIIE
DSDLDYVKSR LKKSRPNSPN PKPFTLPNSP VLGLSSNNTP IISANNSSTN LSSLLMTPAF
RTTSMDHNST