Gene PICST_50114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_50114 
Symbol 
ID4840560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp1027420 
End bp1028436 
Gene Length1017 bp 
Protein Length338 aa 
Translation table12 
GC content39% 
IMG OID640391875 
Productpredicted protein 
Protein accessionXP_001386205 
Protein GI150866562 
COG category[L] Replication, recombination and repair 
COG ID[COG0084] Mg-dependent DNase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.975027 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAACA TTTCCATAGC TTTGTCTCTA GCAGACAGCC ATTGCCATAT TGGAATAGAT 
TGCACTGATT CCGATATCGA TGCTTTGGCA GACCAATTCA ATAATGGACT TGTAGACAAA
GACGATTTCT TCCATATCAT GACAACCTAC CATCTTGATG TAGGTTTTGT TGATCGACTT
CTTTCACAAC TAAAAAGTTC AGTAGTGGTT GCCTACTTTG GAGTCCATCC TTGGTACAGT
CACTTGTTTT CCACAGAAGA TCATGGAGAT GTTGACTTGT TACAATTGAA AAAATTACAT
TACAATAAAG TCCTTGTACC AGCACCTAGC GAAGACTTGT TGCTGGTATT ACCAGTGCCG
ATACTGCTAG AAGAACATAT GACTAAGCTA GAGAGATTGA TAGAGATACA CGGCCATAAG
TTCAAATGTG GAATTGGGGA GATTGGCTTA GATAAGCTAT TCAGAGTGCC GTCTAACGGC
TACTTTGGCA GCCAGTTGGC ACAAAACAAC GGAGCTACCA AATTGTCATC TTGTAAGGTA
TCTATGGAAC ACCAGACAGC AGTTTTTGAC AGACAATTGC AATTGGCAAA CAAGTTAAAA
AAACATATCT CAGTACATTG CGTAAAAGCT CACGGACTAT TGTATGATAT TATACCAAGA
TATACAAGCA TCTCATCTGT AATTCTTCAC TCATACAGTG GGTCTTCTGA TCAGGCCAAG
AGGTGGATAA CTACTTATAA GGGTAAGAAA TCAAAGTTAT TCTTTTCATT CTCTAATTGG
ATCAATGGAA CAGACAATAA AAGATGCCTA TTAGAAGACA TAATTGGTTA TGCGGAAGAC
AACCAGATTC TCGTTGAGAC AGATGTTTCT GTAGATGATT ATCTTGTGAG AGGAAAGCAT
GAAGATTACT TTCTCCATTT AGAAGGAATA TTTGAAAAGG TTGGAACCAT TTTGGGCCGA
GACCAAGATG AGATGGTGGA GTTGTTGAGA AGAAATATGT GCCGATCTAT AGAGTAG
 
Protein sequence
MSNISIALSL ADSHCHIGID CTDSDIDALA DQFNNGLVDK DDFFHIMTTY HLDVGFVDRL 
LSQLKSSVVV AYFGVHPWYS HLFSTEDHGD VDLLQLKKLH YNKVLVPAPS EDLLSVLPVP
ISLEEHMTKL ERLIEIHGHK FKCGIGEIGL DKLFRVPSNG YFGSQLAQNN GATKLSSCKV
SMEHQTAVFD RQLQLANKLK KHISVHCVKA HGLLYDIIPR YTSISSVILH SYSGSSDQAK
RWITTYKGKK SKLFFSFSNW INGTDNKRCL LEDIIGYAED NQILVETDVS VDDYLVRGKH
EDYFLHLEGI FEKVGTILGR DQDEMVELLR RNMCRSIE