Gene PICST_66365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_66365 
SymbolMXP1 
ID4851129 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1002308 
End bp1006002 
Gene Length3695 bp 
Protein Length977 aa 
Translation table 
GC content42% 
IMG OID640392837 
ProductMetalloexopeptidase 
Protein accessionXP_001387424 
Protein GI126274115 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.203014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.591866 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTAACCCCAA AACAGTATTA TTCACCACTG GAACTCCAGC TCGCTAACAC CACCTCATAC 
TGCTTCTTGT ACATAGCATT CTTGTTTGTT GGCTTTCTCG GCTGGTTTCA GTTTCATCAT
TTCTATCGTT GCATCAAGCC CAGTAATCAA TATTTATACA AGAGATCCAT CACAAAACAC
AAACAAATAA ACTTATATGA ATTTCCACGA TTCAATCATC TCGAGTCATA TCAAAATTAT
CCCATATCCA ACTTCAATTA TTGATATTTA ACTATCTTCA AGAATTGAGT CATAGCATTA
AATACTTCAA TATTCATCAC GTAAACTATC GCGATTCATT TCTTTGCATA CCATGTCACA
CCCTTATAAC CGAGCCATTA TCCATGAAGG CGAATACGTT CTGATTCCAG CACGGTTGCC
TCTCCCGATA AGTTCTACTG ACTCATTTCT TGTGCGGTCA CCTAGTTCTA ACACTCCTGT
AAGACTTAAC TCACCCAACT TGACTGTGGC AGATTTATCC ATAAATGACT CTGGTATCAG
ATCGGGAAGA AGTAGCCCCA TAGCAAATGG ACGGAATTCT CCGTCACAAA CAAACCATTC
ACGAGGTTCC GGCCCTACAG CAGCTGCAAA TGCTTCACCA ATCGGAAGTC CATTGGCTAA
TAACATCAAC ACCGCACATA GCAGTACCAA TGGTAGTAAC ATTGTAGAAA GACATAGTAA
CGCATACACT TCCAGCAATG ACTTGGACGA GATCTCGGAT TCCAATTCGC CTCCATTGCG
ATCGCAAACT CTGGCAGTTT CTGTAACTTC AACATTGCCT GATTCCACCG AATACACCCC
AGCACTTTTG CACAAGTGGA CTCATACCCA TTCCATTCTT TGTGTGGTCC CAGCTCCACA
AAAGAAGTTG ATTTTCTGCG GAACCCAAGA CTCAAAGATA TTGGTGTACG ACATGATTAA
CTATTCGTTA AAATACGAAG TCAACTGTGG CCAACAGAAT CACGCTTCTT CCGTTCTTAC
TTTGACGATT TCTGCTGATG AGAACCATCT CTTCAGTGCT GGTTCTGATT CCCTCGTTAA
AGTGTATGAC TTGTCAGAAA TCAAACCAGT TTCTGAAAGA GATTCTGACG ATACTGTGGA
AGGTGAGCTG CCTATCCGTT GTACGCACAT CATCTTCTCT CTGGTGGATA TCGGTGATAT
TTTCTCCATT GCCTGGTGCG ATCTGTTGTC CACAATATTC ATTGGAGCCC AGAACGCTTC
CATATTGTGG TGCCACCTTT CTCTCACTAG TACGGGCCAT GGTTCAAATA CTTCCAACGT
GGAACGCTTA CCTCACTTGC GTTACGACAA GTTCTTTGAC TCAAAGGGTC CTGGTGGATC
CATGAATACG CTTCAATCGA AACATCAACT ATTCAGAAAG TATTCCAGTA CCTCTCATTC
TTCACATAGT TCACCAAAAT TAGTAGAAGT GAAAAACGAG GATATCATTC GTTTTGCCCA
CAATGGTTAC GTCTACTGTA TGGACGTATT CCGCTGTCGT TTAAGTGACG GGCGAATGCT
GGATAAAGAC TTTAGCTTCC ATTATGCTGA CGATTTCGAG AATATCTTAG TTTCTTGTGG
TGGGGACGGA CTCGTCAAGA TATGGGGGAT TAACAGCACC GAGTCTGGAC TCAAGATTAC
CAGCGTAGAA TCTTTAGAAA ATGAAGAATC GGTTCTCTCT ATGTCAATTC AGGATTTCTA
CCTTTATGTT GGGTTGAGTG ACTCTACCAT CAATGTGTGG GATTTGATGA CTTCGCAGTT
GATTCGTTCA TTCCATTTCA CATCGGAAAA CGACGGTAAC TCCTCGTACG ATGAAGTGTT
AAGTCTTGGA ATATACAACG ACTGCATTTT CAAAGCATCT AACTTGGGTG GTCTTGTAAA
ATTCACTTTG AAGAGCTACC CGACGAAATC GCTAAGCCTA GATGAAGCAG CAAGATACGC
AAACGTTAAT CAAACTACTT TGGATAAACA TTCCACAGTG ATAATCTCAG ATGGGGCTGT
TCCTTACCAG CATGAAAGCA AATTGGGTGC TGTTTTGTCA GTCAAGATCT TCAAGGACAT
CTCAGGTTGT ACATATTTGC TTTCAGGAGG TAACAAAGCT CTTTGTCTCT GGGATATTAA
CAATGTAGGT TTGAAACACA ACGATCCACT GGGGTTGGTA ACTGATGATT CCGTACCCGA
TTCAACTGAA CAGTGTAGAT TGTCTAATGA CGAATTGCTC AAATCGTTAA ACAAATTCAT
TTCGTTCAAA ACCATCTCCA AGTTCCCGAC GCTCTATCTT GAAGATTCCC GTCATTGTGC
CCAATTCTTG TGTAACTTAT TGATCGACTT GGGCTCTAAG CAAACCAAGT TGCTACCAGT
AGCTGATGGT AACCCTATCG TGTATTCCAC TTTTACACGT AACAGTAAGA CAGCAACCGG
CAAACCCACA AGAGTCCTCT GGTATGCCCA TTATGACGTC GTTGATGCCA CTAATCATGA
AGCTGCTGAT TGGGAAACCG ATCCGTTTTT GTTAACTGCC CGTGATGGGA ACTTGTATGC
TCGTGGTGTA TCCGATAACA AGGGCCCTAT ATTGGCTAGT ATATATGCCG TAGCGGACTT
GTTTCTGAGA GAAGAATTGT CTTGCGATGT TGTATTCATC ATTGAAGGTG AAGAGGAGTG
CGGATCTATT GGATTCCAGA AAGTCATCAA CGAGAGCAAG TCTCTCATTG GGGATATCGA
CTGGGTAATG TTATCCAATT CATACTGGCT CGACGATGAA ACTCCATGTT TGAATTATGG
CTTAAGAGGT GTCATCAATG CAGCGGTAAC AATCAAGTCC GATAAGCCAG ACAGGCACTC
GGGTGTAGAT GGAGGTGTTC TGAAAGAACC AACTATGGAT TTGGTCCAGA TTGTGGGTCA
ATTGGTAGAT CCTATTACCA ACGAAATCAA GCTCGACGGC TTCTACGACG ATGTGTTGCC
ATTGACAGAA AGAGAAGTTC GTTTGTATCA GGACATCGAG CAAGCAGCAA CGATCAAGAA
CATGAACAAT CAAGATTTGA AGACATTGAT GGCCAAGTGG CGTAACCCAT CGTTGACTAT
ACACAAGATC CAGGTATCTG GTCCAAACAA CAACACTGTG ATTCCGCAAG TCGCCAAAGC
GACAATCTCT ATTAGAATCG TACCTAATCA GGATTTGGAA AAAGTCAAAC AGTCATTAAT
AGATCGCTTG ACAAAGGCTT TCGGTGCGCT TCAATCAGAA AACCGTATCC TGATCAATGT
GTTCCATGAA GCAGAGCCGT GGTTAGGAGA CCCATCAAAC TTGGTCTACT CCATCTTGTT
TAACAAAATC AAATCCAACT GGGGCCACGA GCCACTTTTC ATTCGTGAAG GGGGTTCTAT
TCCATCCATC AGATTTCTTG AAAAGTGCTT CAATGCTCCA GCAGCACAGA TTCCATGTGG
ACAGGCTTCA GACAATGCCC ACTTGAAGGA CGAAAAGTTG AGAATCCTCA ACTTGTACAA
GATGAGATCT ATTTTGACAG ATACCTTCTT GGAATTAGGT CAAGACAGAC AATAGACTAT
AGATACCTAG GTGGAATCTT GTGATAATTT AGATAGACTA CGCTTATACC ATTCACGCCA
AAATAGTTAT TTATTTAATA ATACCGACAT ATACG
 
Protein sequence
MSHPYNRAII HEGEYVLIPA RLPLPISSTD SFLVRSPSSN TPVRLNSPNL TVADLSINDS 
ALLHKWTHTH SILCVVPAPQ KKLIFCGTQD SKILVYDMIN YSLKYEVNCG QQNHASSVLT
LTISADENHL FSAGSDSLVK VYDLSEIKPV SERDSDDTVE GELPIRCTHI IFSLVDIGDI
FSIAWCDLLS TIFIGAQNAS ILWCHLSLTS TGHGSNTSNV ERLPHLRYDK FFDSKGPGGS
MNTLQSKHQL FRKYSSTSHS SHSSPKLVEV KNEDIIRFAH NGYVYCMDVF RCRLSDGRML
DKDFSFHYAD DFENILVSCG GDGLVKIWGI NSTESGLKIT SVESLENEES VLSMSIQDFY
LYVGLSDSTI NVWDLMTSQL IRSFHFTSEN DGNSSYDEVL SLGIYNDCIF KASNLGGLVK
FTLKSYPTKS LSLDEAARYA NVNQTTLDKH STVIISDGAV PYQHESKLGA VLSVKIFKDI
SGCTYLLSGG NKALCLWDIN NVGLKHNDPL GLVTDDSVPD STEQCRLSND ELLKSLNKFI
SFKTISKFPT LYLEDSRHCA QFLCNLLIDL GSKQTKLLPV ADGNPIVYST FTRNSKTATG
KPTRVLWYAH YDVVDATNHE AADWETDPFL LTARDGNLYA RGVSDNKGPI LASIYAVADL
FLREELSCDV VFIIEGEEEC GSIGFQKVIN ESKSLIGDID WVMLSNSYWL DDETPCLNYG
LRGVINAAVT IKSDKPDRHS GVDGGVLKEP TMDLVQIVGQ LVDPITNEIK LDGFYDDVLP
LTEREVRLYQ DIEQAATIKN MNNQDLKTLM AKWRNPSLTI HKIQVSGPNN NTVIPQVAKA
TISIRIVPNQ DLEKVKQSLI DRLTKAFGAL QSENRILINV FHEAEPWLGD PSNLVYSILF
NKIKSNWGHE PLFIREGGSI PSIRFLEKCF NAPAAQIPCG QASDNAHLKD EKLRILNLYK
MRSILTDTFL ELGQDRQ