Gene PICST_81508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_81508 
SymbolSMP2 
ID4836687 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp362840 
End bp365678 
Gene Length2839 bp 
Protein Length768 aa 
Translation table12 
GC content45% 
IMG OID640388002 
Productprotein involved in plasmid maintenance, respiration and cell proliferation (by homology) 
Protein accessionXP_001382302 
Protein GI150863734 
COG category[R] General function prediction only 
COG ID[COG5083] Uncharacterized protein involved in plasmid maintenance 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATACG TCGGCAAAGT TGGAGGATAC GTCTACAACC AGTGGAATTC GCTCAATCCT 
CTGACGCTTT CCGGTGCGAT TGACATCATT GTAGTAGAAC AGCCGGACGG GACGCTCCAC
TGTTCGCCGT GGCATGTTAG GTTTGGGAAG TTCCAAATCA TCAAGCCGCT GCAGAAAAAG
ATCGATTTGT ATGTCAATGA CGTGAAGACG GATTTGCCCA TGAAGTTGGG TGATGGTGGA
GAAGGATTTT TTGTGTTTGA AACCGACAGC CGTGATGGGT TATCCCAGAG CGTGTTGACA
TCTCCGGTTA TATCGCCAGT TTCTTCGGCA TCCACGTCTC CAATTGGTTC TCCACAACTG
TCTGGTATCG GAGAAGATGA TATTGGGCGG GCAGAGCCGG AATTACTCGA TTTAAACGCG
AACTCCGTCG ATATTGAAGA CAAGATCAAA CTTTCCGATG AGCCTTCGCC AGTACTTGCT
GGGTCTTCAC CGAAAGCTGC AACTCCGACA CCATCGTTGC AGTCCAAGAC GTTTGAAAAA
GTGAGGAAAA TCACCCAAAA GTTGAACATA CCCTCAAAGA TCGACATCAA CGGCGATATC
GTTTTGGACA TGGACGGTTA TAAGCCCAAC GCCCAGAAGA ACATCGACAA CTCCGACGAG
TTGTTCAAGA AGATATTTTT ACAGGAGATA AAGGATTCGT CGGCTGTACA CGGGAGCAAC
ACCAATAGCG CCAGTAACAG CCATAGTAAC AGTAGTGCTA ATGTCCTCGG AGAGGGAGAG
GAGGAGCTTT CTTCCCTCTG GGAACAGTTC GTCAGTAAAG ACAAAAATGG CAATATTCGT
ATTTTGAATA GAGATCACTT ATCACCGATG GAAGACGATT TGATGTCGGT CGAAGGCTTG
GTTAGTGAGA ACGAAGATGA CGCCCAGTCG TTGAGAACCT CTGCCTCTGT TATAACCTCC
AATACTGGTA GCAACACAGG AGATGGATCT CTTCCCAGTT CCAGTAGTGA GTCAAGTAAA
ACATACTTCA AAACTCTTCG GTTGACTTCT GAACAGATGC AGAAAATGAA GTTGCATTAT
GGTGAAAACA AGTTGACGTT CAAACTCAGC GAAGGTACGG CTCAGATTGA GTCTTATTTG
TATCTCTGGA GGGCCACTAC ACCAATAGTA ATCTCGGATA TCGATGGAAC CATCACCAAG
TCCGATGCCT TGGGCCATGT GCTTAACTTG TTCGGTAAGG ATTGGACCCA TCCTGGTGTA
GCTACTCTTT TCACCGACAT CAAAGCCAAC GGGTACAACA TTATATATCT TACCGCAAGG
TCGGTAGGCC AAGCCGATAC TACCAGACAA TATTTACGTG GCATTGTACA AGACAATGGC
GTGAAATTGC CTCAGGGACC GGTCATTCTT TCTCCCGATC GAACCATGGC CGCCTTGCGT
CGAGAAGTCA TCTTGAAGAA GCCAGAAGTG TTTAAGATGG CGTGTTTGAA CGATATCAAA
AGTCTCTATT TCCACAGTGA TCAGTTCGCA GAGCCAGAAG ATGACGAGAG AACGCCTTTC
TACGCGGGTT TCGGCAACAG AATCACCGAC GCTATAAGCT ACAGATCGGT GAAGATTCCC
AGTCACAGAA TATTCACTAT CAATCCCAAT GGAGAAGTCC ATATGGAATT GTTGGAGTTG
GCCGGTTACA AGTCGTCGTA TTTACATATC GGAGAGTTGG TCGACCAATT TTTCCCTCCA
ATCAAACAGG TTTCGTCCCT GGACTCGTAT TGGAATGACA ACCAGTTGCA CGAATACATG
AGCAACTTGA ACCATTCCGA TACGAACGAT GTTTCTTGCG GTGCTGGTTC TCCCAGACTG
CCTGGTTCAC CTCGAAGCCT CAACGAGGAA GGCTTCAGAG ACTTCCAAAC TGAAGAGAAG
TTCAACGACG TCAACTATTG GCGAGAGCCA CTTAGTTTCA GCGATTTGAG CGATCTTGAT
GATGAAGAAA CAGCAACACC AGAACCTCCT AAATCGCCAG GTCTATTGCT GTTGCGCTCA
TTCACTTCTG ACTCTGAAAA TATATACGAT GAGAAGAAAG CTGCTACAAA TCCAGAACTT
AATGAAACCC GTACTCGTTC CAGGGCTTCC TCAAGTCCAC CAGAACGCCC TACTTCAGTC
TCGTCATTTA CTTCCCCGTT GAAGAATTTC ATGATGTTCG GAAGCAAAGG CAACGACATC
GACGACGATG ACGACTACAC TGAAGTCAAA GACAATAATA GCCCCAGAAA GTCACAACTC
ATCTCCAAAC TAGCCAATGC ACGTGACGAC TCTTTGCTAG ACGATGGAGT TCATGATTCA
GACTACACTG CTGAAGAGGA TCTCGATGAC GATGACGATT ACACTGACGA CGATTACGAT
GAAGATGAAG ATGATGACTA TGAAGATGAC GACGATTACG ATGAAGAAGA AGAAGAGGAA
GACTACGATG AGGAAGAAGA CTTCGATGAG GAGGACATAG ATGATGATGA CATCGATGAA
GATGACGGGC TACCAGAAGA AGTTCAACAG AAGTTGTCTA TCACAGAAAA CAGCAACACC
AAATCCAAAG TCACCATTAA CAATAATAGC AGTATCGAAA AGAGAAAAGC ATCATGTGAC
CCTCCCCAAT TAGGCTTGGA TCACGCTAGT GGTTTTGTCA AAGCCAGCAA TATTCGCTAG
ATAGAAGAGT ATTATAAGGA AAGCAACAAG TACATAGAAG TCGAGTGACA CTTAGCAAAA
TAGCTGCGAA ACTACAAAAC CATTCGCTCG CACATCCATA TAGAAAGTAG ATCTCTACAA
ATATCTAATA ACGCTTTGT
 
Protein sequence
MQYVGKVGGY VYNQWNSLNP STLSGAIDII VVEQPDGTLH CSPWHVRFGK FQIIKPSQKK 
IDLYVNDVKT DLPMKLGDGG EGFFVFETDS RDGLSQSVLT SPVISPVSSA STSPIGSPQS
SGIGEDDIGR AEPELLDLNA NSVDIEDKIK LSDEPSPVLA GSSPKAATPT PSLQSKTFEK
VRKITQKLNI PSKIDINGDI VLDMDGYKPN AQKNIDNSDE LFKKIFLQEI KDSSAFVSKD
KNGNIRILNR DHLSPMEDDL MSVEGLVSEN EDDAQSLRTS ASVITSNTGS NTGDGSLPSS
SSESSKTYFK TLRLTSEQMQ KMKLHYGENK LTFKLSEGTA QIESYLYLWR ATTPIVISDI
DGTITKSDAL GHVLNLFGKD WTHPGVATLF TDIKANGYNI IYLTARSVGQ ADTTRQYLRG
IVQDNGVKLP QGPVILSPDR TMAALRREVI LKKPEVFKMA CLNDIKSLYF HSDQFAEPED
DERTPFYAGF GNRITDAISY RSVKIPSHRI FTINPNGEVH MELLELAGYK SSYLHIGELV
DQFFPPIKQV SSSDSSPGSP RSLNEEGFRD FQTEEKFNDV NYWREPLSFS DLSDLDDEET
ATPEPPKSPG LLSLRSFTSD SENIYDEKKA ATNPELNETR TRSRASSSPP ERPTSDLDDD
DDYTDDDYDE DEDDDYEDDD DYDEEEEEED YDEEEDFDEE DIDDDDIDED DGLPEEVQQK
LSITENSNTK SKVTINNNSS IEKRKASCDP PQLGLDHASG FVKASNIR