Gene PICST_66746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_66746 
SymbolCSM1 
ID4852042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3480360 
End bp3481714 
Gene Length1355 bp 
Protein Length333 aa 
Translation table 
GC content40% 
IMG OID640393750 
ProductCSM1-like protein 
Protein accessionXP_001387000 
Protein GI126276407 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CATCTCCAGA ACATTGTGCA TACTACATCA AGAAATGGAA TTTCGAATCC AAATAAAGAA 
ATCCAATCCT TCGATAAAGG TTTGAGTTTT ATAGCACTGT ATTCGCAAAA GGGCTCGTTT
AGTAAATCTT TGGTGTCATA CTGTTACACT GCAATAACGA TCTCCTTGTA ATTCCAATCT
CACAATTGTA TTTTGAATTC GTTTTTCAAT CTTCACTTGT TGATCAACGC GTCCGATAAA
TTCAACAACG AACAACATCC GTTTGATTTC ATGTTAGTGT AGTAATTCAG ATATTTCAAA
TATGGCTCCC AAAACTCGTA AAACTGTCCA GAAAGTGAAA CCCGCTCCTG TGGAAGAAGT
TCCCCACCCG AGAGTACGTA AGGTCTCAGC TAAGGTGTTG GAAAGTATGA CAGATGTATC
AGCATCAACT CCCTCAATGA AGAAAAAGAA CCCCAGCTCA CCATCAACTT CAAAAAGATC
ATCTTCATCT TCGTCAGCAG CAGTTTCGGC ATTAAGCAAG AAATTGAAGG CTTCCAACCC
CATAATAACA GAGCAGGACA TTGTTTCTGC TGAAAACGGC CAGGAACTTA TAGAGCTCAT
AAACGGACTA GTGAACACCA AACAGGACGA GACGTTCGCC AAATACAAAT CCAAGGTTCA
GAACCAGCTA AATAATGACC ACCAAGTAAT ACAAGAGTTG AATGCCGATT TGCTCCAGCG
CCAGGAAACC ATCAACGGCT TGCTCCAAGA GATAAAGCAG CTCAAGAACG AGGTTAGAAC
ATATTCCTCA TCTAATTCGG CAAGTATGAC GGTCGACGGC TCTCTTGATG AAAGCAAGGA
GTTTGTCTAT GAATCTCCCA TCAGAAAGAA GATCAACAAG AAAATCAAGA ACAGCGATAC
ATTGATTAGC CAGGACCAAC TATCCAAAGA ACTAGAAAAT ATAGGTTTCA CTTTAGACAT
GTTGGAACTT TTAACTGGTT TGAGAATTGT CAACTTCGAA GAAGATGAGT CCAAGTATTT
CTTTGATGTA AAACAGTCTG GTTCGAATGG CCAAGATGAA ATCTACATCA ACTACCAACT
TGTAATCTCA AAGTCGTTTG CAACTACTGC TGAGATCAAC TACATCCCTA CGTTCTTGGA
AGCATTGGAA AATGATGACG AGGACCAGGA ACAGGTCGAT AACGCTAACC TCTTGAAGGA
AATCCTTCCG GACTACTTGT GTGAGAACTT GTCGTTCCCC TACGATACCT TGTCACAGTT
CTACAGTAAA GTCAACCGAG CGGTCAACAA GAGGACCAAA TGAGAGTATG TACAATAATT
TCTCTATAAA TACACTAATG TACACCAGTT ATAGT
 
Protein sequence
MAPKTRKTVQ KVKPAPVEEV PHPRVRKVSA KVLESMTDVS ASTPSMKKKN PSSPSTSKRS 
SSSSSAAVSA LSKKLKASNP IITEQDIVSA ENGQELIELI NGLVNTKQDE TFAKYKSKVQ
NQLNNDHQVI QELNADLLQR QETINGLLQE IKQLKNEVRT YSSSNSASMT VDGSLDESKE
FVYESPIRKK INKKIKNSDT LISQDQLSKE LENIGFTLDM LELLTGLRIV NFEEDESKYF
FDVKQSGSNG QDEIYINYQL VISKSFATTA EINYIPTFLE ALENDDEDQE QVDNANLLKE
ILPDYLCENL SFPYDTLSQF YSKVNRAVNK RTK