Gene PICST_67950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_67950 
SymbolMCM1 
ID4839352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1545473 
End bp1547016 
Gene Length1544 bp 
Protein Length243 aa 
Translation table12 
GC content45% 
IMG OID640390667 
Producttranscription factor of the MADS box family 
Protein accessionXP_001385295 
Protein GI126137543 
COG category[K] Transcription 
COG ID[COG5068] Regulator of arginine metabolism and related MADS box-containing transcription factors 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0903461 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCTTATATA ACACAGCGAC TCAATTACAA GTAACTAGTA AGTATGCCGT GGCTGATGGC 
GAGCTCGTGG CCACACGGAT GTGGCCGGCA GCATTAACAG CATAGAACCG TCGAATTGAT
ACAATCGGTG TGACCGGTAG CACTGGGTTT ACGGGAGGTT TGTCCATATT ATTATCACGA
AGCCCTGGAA TGTTGCATTC TCAGCGAATT GTCCCTTTGC ACTGGAGCGT AGCGATGGAT
CACATCGTAT TATCGGATGC AAGGTTGAGA CGAAGCTGAT GATGGCGTAG CTCACGTTGA
TATTGGAATG GGATTGATAG CGACATTGAT GCTGTAGTAA TTGCTATTTC AGGGTCCGAT
AACGACGATA AACAAATACA GATCAATTTC TGGAGTCCAT ACCAATATTT CTATAGCAAA
ACATAGTGAC TATCATCATG TCGTTACATC ATGCTTCGTA ACGTTGCTAT CATTTTTCAT
TCGTGTCATC TATCATTTCT GTTATCATTC TAATGCTGTT GTTGCAATCA TATACAAGTA
AATACTAACA TTATTACCAG AATGTCCGAA GTAACCGAAG CCAAGTTGGA GAACTCCGAC
CAACAATTCG AAAACATCGG TGCTAACGGC GCTGCCAACA AGGGCGGTGC CGATGTTGCT
GGCGACGATG ACGACGACGA CGAAGCCGGC GGTGGTGGAA AGGCCCAGAA GGAAAGAAGA
AAGATCGAAA TCAAATTCAT CCAGGACAAG TCTAGAAGAC ACATCACCTT TTCCAAGAGA
AAGGCCGGAA TCATGAAGAA AGCGTACGAG TTGTCGGTGT TGACAGGAAC CCAAGTATTA
TTGTTGGTTG TGTCTGAAAC CGGTTTGGTT TACACCTTTA CCACCCCTAA GTTGCAGCCA
TTGGTGACCA AGTCTGAAGG TAAAAACTTG ATTCAAGCAT GCTTGAATGC TCCTGAAGAA
GGTTTGGGCG ACGACCAAGG TGACCAGTCA GATGGAAACT CCGGCGAGTC GCCAGATCAA
AGCCCGGCTC CTCCTCAGCA ACAACCCCCA CAACACCAGC AAGTACAACA GCAACAGGCC
ATTGCTCACC AGCAACAGGT GCGCCATCAG CAACAGCCAC AACAGCAATT GCCTCCTGGT
GCACACATAC CCCACGGAAT TCCATATCCT AACGCCGGTC ATCCACAACA GCCAGGCATT
CCTATCCCAC CCAATGCCTA CGGAGACAGT TACCAACAGT ACTTCTCCAA CATGCAAAAC
AGCAACATGC CCAACCAGCA ACAATATCAA TGATGACTAT TTATTAGACA AACAGTGAAC
GAACCCGGAT TCCATTGTAC AGTATTATTA TTATTCTCCT ACCTATTATT ATTTTTGCTT
GTTGAGGTGT CAGTTTATGT TACGTTGATT GAGGTGAGTG TGCTGAGTTG TGTCGACGAG
AAGAGTTGAT TGTGATTGTT ATTGTATTCT TCCAGCGTAT TTAACAGTAC GTAAATATTT
AGTTCTTCTA TTCTTTTTAT AAAACAGTGC TTTCCATTCA TGAT
 
Protein sequence
MSEVTEAKLE NSDQQFENIG ANGAANKGGA DVAGDDDDDD EAGGGGKAQK ERRKIEIKFI 
QDKSRRHITF SKRKAGIMKK AYELSVLTGT QVLLLVVSET GLVYTFTTPK LQPLVTKSEG
KNLIQACLNA PEEGLGDDQG DQSDGNSGES PDQSPAPPQQ QPPQHQQVQQ QQAIAHQQQV
RHQQQPQQQL PPGAHIPHGI PYPNAGHPQQ PGIPIPPNAY GDSYQQYFSN MQNSNMPNQQ
QYQ