Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66365 |
Symbol | MXP1 |
ID | 4851129 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 1002308 |
End bp | 1006002 |
Gene Length | 3695 bp |
Protein Length | 977 aa |
Translation table | |
GC content | 42% |
IMG OID | 640392837 |
Product | Metalloexopeptidase |
Protein accession | XP_001387424 |
Protein GI | 126274115 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.203014 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.591866 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTAACCCCAA AACAGTATTA TTCACCACTG GAACTCCAGC TCGCTAACAC CACCTCATAC TGCTTCTTGT ACATAGCATT CTTGTTTGTT GGCTTTCTCG GCTGGTTTCA GTTTCATCAT TTCTATCGTT GCATCAAGCC CAGTAATCAA TATTTATACA AGAGATCCAT CACAAAACAC AAACAAATAA ACTTATATGA ATTTCCACGA TTCAATCATC TCGAGTCATA TCAAAATTAT CCCATATCCA ACTTCAATTA TTGATATTTA ACTATCTTCA AGAATTGAGT CATAGCATTA AATACTTCAA TATTCATCAC GTAAACTATC GCGATTCATT TCTTTGCATA CCATGTCACA CCCTTATAAC CGAGCCATTA TCCATGAAGG CGAATACGTT CTGATTCCAG CACGGTTGCC TCTCCCGATA AGTTCTACTG ACTCATTTCT TGTGCGGTCA CCTAGTTCTA ACACTCCTGT AAGACTTAAC TCACCCAACT TGACTGTGGC AGATTTATCC ATAAATGACT CTGGTATCAG ATCGGGAAGA AGTAGCCCCA TAGCAAATGG ACGGAATTCT CCGTCACAAA CAAACCATTC ACGAGGTTCC GGCCCTACAG CAGCTGCAAA TGCTTCACCA ATCGGAAGTC CATTGGCTAA TAACATCAAC ACCGCACATA GCAGTACCAA TGGTAGTAAC ATTGTAGAAA GACATAGTAA CGCATACACT TCCAGCAATG ACTTGGACGA GATCTCGGAT TCCAATTCGC CTCCATTGCG ATCGCAAACT CTGGCAGTTT CTGTAACTTC AACATTGCCT GATTCCACCG AATACACCCC AGCACTTTTG CACAAGTGGA CTCATACCCA TTCCATTCTT TGTGTGGTCC CAGCTCCACA AAAGAAGTTG ATTTTCTGCG GAACCCAAGA CTCAAAGATA TTGGTGTACG ACATGATTAA CTATTCGTTA AAATACGAAG TCAACTGTGG CCAACAGAAT CACGCTTCTT CCGTTCTTAC TTTGACGATT TCTGCTGATG AGAACCATCT CTTCAGTGCT GGTTCTGATT CCCTCGTTAA AGTGTATGAC TTGTCAGAAA TCAAACCAGT TTCTGAAAGA GATTCTGACG ATACTGTGGA AGGTGAGCTG CCTATCCGTT GTACGCACAT CATCTTCTCT CTGGTGGATA TCGGTGATAT TTTCTCCATT GCCTGGTGCG ATCTGTTGTC CACAATATTC ATTGGAGCCC AGAACGCTTC CATATTGTGG TGCCACCTTT CTCTCACTAG TACGGGCCAT GGTTCAAATA CTTCCAACGT GGAACGCTTA CCTCACTTGC GTTACGACAA GTTCTTTGAC TCAAAGGGTC CTGGTGGATC CATGAATACG CTTCAATCGA AACATCAACT ATTCAGAAAG TATTCCAGTA CCTCTCATTC TTCACATAGT TCACCAAAAT TAGTAGAAGT GAAAAACGAG GATATCATTC GTTTTGCCCA CAATGGTTAC GTCTACTGTA TGGACGTATT CCGCTGTCGT TTAAGTGACG GGCGAATGCT GGATAAAGAC TTTAGCTTCC ATTATGCTGA CGATTTCGAG AATATCTTAG TTTCTTGTGG TGGGGACGGA CTCGTCAAGA TATGGGGGAT TAACAGCACC GAGTCTGGAC TCAAGATTAC CAGCGTAGAA TCTTTAGAAA ATGAAGAATC GGTTCTCTCT ATGTCAATTC AGGATTTCTA CCTTTATGTT GGGTTGAGTG ACTCTACCAT CAATGTGTGG GATTTGATGA CTTCGCAGTT GATTCGTTCA TTCCATTTCA CATCGGAAAA CGACGGTAAC TCCTCGTACG ATGAAGTGTT AAGTCTTGGA ATATACAACG ACTGCATTTT CAAAGCATCT AACTTGGGTG GTCTTGTAAA ATTCACTTTG AAGAGCTACC CGACGAAATC GCTAAGCCTA GATGAAGCAG CAAGATACGC AAACGTTAAT CAAACTACTT TGGATAAACA TTCCACAGTG ATAATCTCAG ATGGGGCTGT TCCTTACCAG CATGAAAGCA AATTGGGTGC TGTTTTGTCA GTCAAGATCT TCAAGGACAT CTCAGGTTGT ACATATTTGC TTTCAGGAGG TAACAAAGCT CTTTGTCTCT GGGATATTAA CAATGTAGGT TTGAAACACA ACGATCCACT GGGGTTGGTA ACTGATGATT CCGTACCCGA TTCAACTGAA CAGTGTAGAT TGTCTAATGA CGAATTGCTC AAATCGTTAA ACAAATTCAT TTCGTTCAAA ACCATCTCCA AGTTCCCGAC GCTCTATCTT GAAGATTCCC GTCATTGTGC CCAATTCTTG TGTAACTTAT TGATCGACTT GGGCTCTAAG CAAACCAAGT TGCTACCAGT AGCTGATGGT AACCCTATCG TGTATTCCAC TTTTACACGT AACAGTAAGA CAGCAACCGG CAAACCCACA AGAGTCCTCT GGTATGCCCA TTATGACGTC GTTGATGCCA CTAATCATGA AGCTGCTGAT TGGGAAACCG ATCCGTTTTT GTTAACTGCC CGTGATGGGA ACTTGTATGC TCGTGGTGTA TCCGATAACA AGGGCCCTAT ATTGGCTAGT ATATATGCCG TAGCGGACTT GTTTCTGAGA GAAGAATTGT CTTGCGATGT TGTATTCATC ATTGAAGGTG AAGAGGAGTG CGGATCTATT GGATTCCAGA AAGTCATCAA CGAGAGCAAG TCTCTCATTG GGGATATCGA CTGGGTAATG TTATCCAATT CATACTGGCT CGACGATGAA ACTCCATGTT TGAATTATGG CTTAAGAGGT GTCATCAATG CAGCGGTAAC AATCAAGTCC GATAAGCCAG ACAGGCACTC GGGTGTAGAT GGAGGTGTTC TGAAAGAACC AACTATGGAT TTGGTCCAGA TTGTGGGTCA ATTGGTAGAT CCTATTACCA ACGAAATCAA GCTCGACGGC TTCTACGACG ATGTGTTGCC ATTGACAGAA AGAGAAGTTC GTTTGTATCA GGACATCGAG CAAGCAGCAA CGATCAAGAA CATGAACAAT CAAGATTTGA AGACATTGAT GGCCAAGTGG CGTAACCCAT CGTTGACTAT ACACAAGATC CAGGTATCTG GTCCAAACAA CAACACTGTG ATTCCGCAAG TCGCCAAAGC GACAATCTCT ATTAGAATCG TACCTAATCA GGATTTGGAA AAAGTCAAAC AGTCATTAAT AGATCGCTTG ACAAAGGCTT TCGGTGCGCT TCAATCAGAA AACCGTATCC TGATCAATGT GTTCCATGAA GCAGAGCCGT GGTTAGGAGA CCCATCAAAC TTGGTCTACT CCATCTTGTT TAACAAAATC AAATCCAACT GGGGCCACGA GCCACTTTTC ATTCGTGAAG GGGGTTCTAT TCCATCCATC AGATTTCTTG AAAAGTGCTT CAATGCTCCA GCAGCACAGA TTCCATGTGG ACAGGCTTCA GACAATGCCC ACTTGAAGGA CGAAAAGTTG AGAATCCTCA ACTTGTACAA GATGAGATCT ATTTTGACAG ATACCTTCTT GGAATTAGGT CAAGACAGAC AATAGACTAT AGATACCTAG GTGGAATCTT GTGATAATTT AGATAGACTA CGCTTATACC ATTCACGCCA AAATAGTTAT TTATTTAATA ATACCGACAT ATACG
|
Protein sequence | MSHPYNRAII HEGEYVLIPA RLPLPISSTD SFLVRSPSSN TPVRLNSPNL TVADLSINDS ALLHKWTHTH SILCVVPAPQ KKLIFCGTQD SKILVYDMIN YSLKYEVNCG QQNHASSVLT LTISADENHL FSAGSDSLVK VYDLSEIKPV SERDSDDTVE GELPIRCTHI IFSLVDIGDI FSIAWCDLLS TIFIGAQNAS ILWCHLSLTS TGHGSNTSNV ERLPHLRYDK FFDSKGPGGS MNTLQSKHQL FRKYSSTSHS SHSSPKLVEV KNEDIIRFAH NGYVYCMDVF RCRLSDGRML DKDFSFHYAD DFENILVSCG GDGLVKIWGI NSTESGLKIT SVESLENEES VLSMSIQDFY LYVGLSDSTI NVWDLMTSQL IRSFHFTSEN DGNSSYDEVL SLGIYNDCIF KASNLGGLVK FTLKSYPTKS LSLDEAARYA NVNQTTLDKH STVIISDGAV PYQHESKLGA VLSVKIFKDI SGCTYLLSGG NKALCLWDIN NVGLKHNDPL GLVTDDSVPD STEQCRLSND ELLKSLNKFI SFKTISKFPT LYLEDSRHCA QFLCNLLIDL GSKQTKLLPV ADGNPIVYST FTRNSKTATG KPTRVLWYAH YDVVDATNHE AADWETDPFL LTARDGNLYA RGVSDNKGPI LASIYAVADL FLREELSCDV VFIIEGEEEC GSIGFQKVIN ESKSLIGDID WVMLSNSYWL DDETPCLNYG LRGVINAAVT IKSDKPDRHS GVDGGVLKEP TMDLVQIVGQ LVDPITNEIK LDGFYDDVLP LTEREVRLYQ DIEQAATIKN MNNQDLKTLM AKWRNPSLTI HKIQVSGPNN NTVIPQVAKA TISIRIVPNQ DLEKVKQSLI DRLTKAFGAL QSENRILINV FHEAEPWLGD PSNLVYSILF NKIKSNWGHE PLFIREGGSI PSIRFLEKCF NAPAAQIPCG QASDNAHLKD EKLRILNLYK MRSILTDTFL ELGQDRQ
|
| |