Gene PICST_39840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39840 
Symbol 
ID4851658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2471207 
End bp2472580 
Gene Length1374 bp 
Protein Length457 aa 
Translation table 
GC content44% 
IMG OID640393366 
Productpredicted protein 
Protein accessionXP_001386807 
Protein GI126275182 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.571106 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCGC CCGTAACCCT AGACTCGTTT CTTCAATGGG ATCCAGCACA GGTGGCTTCG 
TTCATCAACT CTGTAGTACC TGACGATGGC CGATCCGTCG GTCTTGCATT TCTAGACAAC
AACATCGAAG GATCGTTGCT TCCGTTCTTG ACAACTGAAC ATCTCCGAGA GCTAGGCATT
CTCAAATTAC ATACTAGACT CACGATAAAG CGAGCCATAA ACGATTTGAT TTGCCAGCAT
TACCTGAAAA ACCCCCCGCA ATCACTAAAT GATCCAGAAT ACAAACTCAA CAACATCAAC
ATCAACAATA ACCATATCAG CCTCGAGTCT CTTCAACTTT CCACCGTTTT AATAAAAGAC
ATGATTAAGA AAGTGGGAGT TTTTGCCAAA CAACAGGCAC TTTCTGAAAT GTCATCGCCT
GGCTCTCCGG GCCAGATTGA AATGAAAAAA CTCCACGATA ACTTCAACAA ACTCAAGACT
GACTTGATCC CCGTGATACG ATTGTTGAAA GACTCGAAGC CTCTTCCCAC TCCCGTCTTG
GATTCGCCTA CTACTAGTTA TATGAGTAGC AATTCAGACC ATGACGACAG CACTCTTTCA
AACTCAAATG CCAACACTTT GGCACTCAGA AACGTAGCCG CTTTGAATAC GGTTGCGAAC
AGAAACTCAA ACGCAACAAA CTTAACAAAT TTGCCCAGTC CTACTTACTC CAAGAGATTC
TCTTCTGGTT CCATCTTATC TTTGGGAACA GGCAAAGTAG TACAACAGGC TGTTCCCAAA
CTCGAACCCA GATTAAATAA TGACTTCCAT TTGCAGACCA TCCCTCAGAG TTTGTCTAAT
AGAAGTATCA GTGAGTCACA CGTAGAAACT TTTGCCTCTT CACAAGCAAG ACCTCGTTTA
GTAGAAACCA AATCTTCTGG AGCTACGCCA ACAACAGCCA ATGCATCCAA GGCAATTGGT
AACGGATCAG TGCCTGGAGC AACTGTTCTC AAGCCAACAC TAAAACTGTA TGGAAGCAAC
CAGCAAATCG GTCAACAACC TAAGCCATCT TCTGCTCCTG CTGCCAATGA GCCTCTAAAG
CAGCTCAGAG CTTCTACAGA CGATTCCTGT CTCAAGATCT TGCAACAGGC AATGAAAAGA
CATCATATTC CACGTGACGA CTGGTCCAAG TACGTCTTGG TCATTTGCTA TGGTGATAAG
GAGAGAATCC TCAAACTTGG CGAAAAGCCT GTCGTTGTCT TTAAGGAACT CCAGGAGTTG
GGCAAACATC CAGCCATCAT GTTGAGACAG TTAGCTCCTA CAGTAGAAGA CGATAATAAT
ATAGAATACG TAGACTCTCG GATCGGTGAC GATATTCCGG GAGGGACGCT ATAG
 
Protein sequence
MASPVTLDSF LQWDPAQVAS FINSVVPDDG RSVGLAFLDN NIEGSLLPFL TTEHLRELGI 
LKLHTRLTIK RAINDLICQH YLKNPPQSLN DPEYKLNNIN INNNHISLES LQLSTVLIKD
MIKKVGVFAK QQALSEMSSP GSPGQIEMKK LHDNFNKLKT DLIPVIRLLK DSKPLPTPVL
DSPTTSYMSS NSDHDDSTLS NSNANTLALR NVAALNTVAN RNSNATNLTN LPSPTYSKRF
SSGSILSLGT GKVVQQAVPK LEPRLNNDFH LQTIPQSLSN RSISESHVET FASSQARPRL
VETKSSGATP TTANASKAIG NGSVPGATVL KPTLKLYGSN QQIGQQPKPS SAPAANEPLK
QLRASTDDSC LKILQQAMKR HHIPRDDWSK YVLVICYGDK ERILKLGEKP VVVFKELQEL
GKHPAIMLRQ LAPTVEDDNN IEYVDSRIGD DIPGGTL