Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_81508 |
Symbol | SMP2 |
ID | 4836687 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 362840 |
End bp | 365678 |
Gene Length | 2839 bp |
Protein Length | 768 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640388002 |
Product | protein involved in plasmid maintenance, respiration and cell proliferation (by homology) |
Protein accession | XP_001382302 |
Protein GI | 150863734 |
COG category | [R] General function prediction only |
COG ID | [COG5083] Uncharacterized protein involved in plasmid maintenance |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATACG TCGGCAAAGT TGGAGGATAC GTCTACAACC AGTGGAATTC GCTCAATCCT CTGACGCTTT CCGGTGCGAT TGACATCATT GTAGTAGAAC AGCCGGACGG GACGCTCCAC TGTTCGCCGT GGCATGTTAG GTTTGGGAAG TTCCAAATCA TCAAGCCGCT GCAGAAAAAG ATCGATTTGT ATGTCAATGA CGTGAAGACG GATTTGCCCA TGAAGTTGGG TGATGGTGGA GAAGGATTTT TTGTGTTTGA AACCGACAGC CGTGATGGGT TATCCCAGAG CGTGTTGACA TCTCCGGTTA TATCGCCAGT TTCTTCGGCA TCCACGTCTC CAATTGGTTC TCCACAACTG TCTGGTATCG GAGAAGATGA TATTGGGCGG GCAGAGCCGG AATTACTCGA TTTAAACGCG AACTCCGTCG ATATTGAAGA CAAGATCAAA CTTTCCGATG AGCCTTCGCC AGTACTTGCT GGGTCTTCAC CGAAAGCTGC AACTCCGACA CCATCGTTGC AGTCCAAGAC GTTTGAAAAA GTGAGGAAAA TCACCCAAAA GTTGAACATA CCCTCAAAGA TCGACATCAA CGGCGATATC GTTTTGGACA TGGACGGTTA TAAGCCCAAC GCCCAGAAGA ACATCGACAA CTCCGACGAG TTGTTCAAGA AGATATTTTT ACAGGAGATA AAGGATTCGT CGGCTGTACA CGGGAGCAAC ACCAATAGCG CCAGTAACAG CCATAGTAAC AGTAGTGCTA ATGTCCTCGG AGAGGGAGAG GAGGAGCTTT CTTCCCTCTG GGAACAGTTC GTCAGTAAAG ACAAAAATGG CAATATTCGT ATTTTGAATA GAGATCACTT ATCACCGATG GAAGACGATT TGATGTCGGT CGAAGGCTTG GTTAGTGAGA ACGAAGATGA CGCCCAGTCG TTGAGAACCT CTGCCTCTGT TATAACCTCC AATACTGGTA GCAACACAGG AGATGGATCT CTTCCCAGTT CCAGTAGTGA GTCAAGTAAA ACATACTTCA AAACTCTTCG GTTGACTTCT GAACAGATGC AGAAAATGAA GTTGCATTAT GGTGAAAACA AGTTGACGTT CAAACTCAGC GAAGGTACGG CTCAGATTGA GTCTTATTTG TATCTCTGGA GGGCCACTAC ACCAATAGTA ATCTCGGATA TCGATGGAAC CATCACCAAG TCCGATGCCT TGGGCCATGT GCTTAACTTG TTCGGTAAGG ATTGGACCCA TCCTGGTGTA GCTACTCTTT TCACCGACAT CAAAGCCAAC GGGTACAACA TTATATATCT TACCGCAAGG TCGGTAGGCC AAGCCGATAC TACCAGACAA TATTTACGTG GCATTGTACA AGACAATGGC GTGAAATTGC CTCAGGGACC GGTCATTCTT TCTCCCGATC GAACCATGGC CGCCTTGCGT CGAGAAGTCA TCTTGAAGAA GCCAGAAGTG TTTAAGATGG CGTGTTTGAA CGATATCAAA AGTCTCTATT TCCACAGTGA TCAGTTCGCA GAGCCAGAAG ATGACGAGAG AACGCCTTTC TACGCGGGTT TCGGCAACAG AATCACCGAC GCTATAAGCT ACAGATCGGT GAAGATTCCC AGTCACAGAA TATTCACTAT CAATCCCAAT GGAGAAGTCC ATATGGAATT GTTGGAGTTG GCCGGTTACA AGTCGTCGTA TTTACATATC GGAGAGTTGG TCGACCAATT TTTCCCTCCA ATCAAACAGG TTTCGTCCCT GGACTCGTAT TGGAATGACA ACCAGTTGCA CGAATACATG AGCAACTTGA ACCATTCCGA TACGAACGAT GTTTCTTGCG GTGCTGGTTC TCCCAGACTG CCTGGTTCAC CTCGAAGCCT CAACGAGGAA GGCTTCAGAG ACTTCCAAAC TGAAGAGAAG TTCAACGACG TCAACTATTG GCGAGAGCCA CTTAGTTTCA GCGATTTGAG CGATCTTGAT GATGAAGAAA CAGCAACACC AGAACCTCCT AAATCGCCAG GTCTATTGCT GTTGCGCTCA TTCACTTCTG ACTCTGAAAA TATATACGAT GAGAAGAAAG CTGCTACAAA TCCAGAACTT AATGAAACCC GTACTCGTTC CAGGGCTTCC TCAAGTCCAC CAGAACGCCC TACTTCAGTC TCGTCATTTA CTTCCCCGTT GAAGAATTTC ATGATGTTCG GAAGCAAAGG CAACGACATC GACGACGATG ACGACTACAC TGAAGTCAAA GACAATAATA GCCCCAGAAA GTCACAACTC ATCTCCAAAC TAGCCAATGC ACGTGACGAC TCTTTGCTAG ACGATGGAGT TCATGATTCA GACTACACTG CTGAAGAGGA TCTCGATGAC GATGACGATT ACACTGACGA CGATTACGAT GAAGATGAAG ATGATGACTA TGAAGATGAC GACGATTACG ATGAAGAAGA AGAAGAGGAA GACTACGATG AGGAAGAAGA CTTCGATGAG GAGGACATAG ATGATGATGA CATCGATGAA GATGACGGGC TACCAGAAGA AGTTCAACAG AAGTTGTCTA TCACAGAAAA CAGCAACACC AAATCCAAAG TCACCATTAA CAATAATAGC AGTATCGAAA AGAGAAAAGC ATCATGTGAC CCTCCCCAAT TAGGCTTGGA TCACGCTAGT GGTTTTGTCA AAGCCAGCAA TATTCGCTAG ATAGAAGAGT ATTATAAGGA AAGCAACAAG TACATAGAAG TCGAGTGACA CTTAGCAAAA TAGCTGCGAA ACTACAAAAC CATTCGCTCG CACATCCATA TAGAAAGTAG ATCTCTACAA ATATCTAATA ACGCTTTGT
|
Protein sequence | MQYVGKVGGY VYNQWNSLNP STLSGAIDII VVEQPDGTLH CSPWHVRFGK FQIIKPSQKK IDLYVNDVKT DLPMKLGDGG EGFFVFETDS RDGLSQSVLT SPVISPVSSA STSPIGSPQS SGIGEDDIGR AEPELLDLNA NSVDIEDKIK LSDEPSPVLA GSSPKAATPT PSLQSKTFEK VRKITQKLNI PSKIDINGDI VLDMDGYKPN AQKNIDNSDE LFKKIFLQEI KDSSAFVSKD KNGNIRILNR DHLSPMEDDL MSVEGLVSEN EDDAQSLRTS ASVITSNTGS NTGDGSLPSS SSESSKTYFK TLRLTSEQMQ KMKLHYGENK LTFKLSEGTA QIESYLYLWR ATTPIVISDI DGTITKSDAL GHVLNLFGKD WTHPGVATLF TDIKANGYNI IYLTARSVGQ ADTTRQYLRG IVQDNGVKLP QGPVILSPDR TMAALRREVI LKKPEVFKMA CLNDIKSLYF HSDQFAEPED DERTPFYAGF GNRITDAISY RSVKIPSHRI FTINPNGEVH MELLELAGYK SSYLHIGELV DQFFPPIKQV SSSDSSPGSP RSLNEEGFRD FQTEEKFNDV NYWREPLSFS DLSDLDDEET ATPEPPKSPG LLSLRSFTSD SENIYDEKKA ATNPELNETR TRSRASSSPP ERPTSDLDDD DDYTDDDYDE DEDDDYEDDD DYDEEEEEED YDEEEDFDEE DIDDDDIDED DGLPEEVQQK LSITENSNTK SKVTINNNSS IEKRKASCDP PQLGLDHASG FVKASNIR
|
| |