Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_75738 |
Symbol | XPA1 |
ID | 4837509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 139241 |
End bp | 141784 |
Gene Length | 2544 bp |
Protein Length | 710 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640388824 |
Product | X-Pro aminopeptidase |
Protein accession | XP_001382258 |
Protein GI | 150863698 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.511861 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGTTGCTAAC AGTTGCTAGT TGTTCAGAGT CCAATCAGAA CTCCGAACGT ATCAGAAAAG TTTCATTTGA GCACAGTGTT GTAATAAGAA TTGTCTAGCC AAGAGAAGGT ATTAGAAGAG TATTATTCAG CTCAAAGTTG AAGAGTGAGA TATTGTAAGA GAACTACGTC ACTATACAAC AATATCTATA TAAAAATCAC TATTCAATAT CCACACAACA ACATCTATAC GCTTATAGCT ATTCTCGCAA CAATATCGCC AATATCATTA CTATCATAAT TGGTTGACTG TCTTGCTGAA TTATCTTGTT AACGTCTACT ATACTGTCTT CTATTCGCGC TATGTCTACC AGAATCTCTA GGGGTCAGTG CAACAACTGT ACCTGTTCGC CAGGATTGTT GTCGCGTACC AATAGAAGAT CGTCCCTCTT TGCCCAGAAG ATCAACATCA ACAGAACCAA CTCCGGGATC GGTGGTTCTA GAGCTGGTTC CAGAAAAGGA TCTGTCTTCA CTATTGATCC TGCTACCTTG TGTTTGCCTG AAATTAAGGA AACAAATACC TCGCAGCGTT TGGAAGCGTT GAGAAACAAA ATGAAGGACC ACAACTTGGC AGTGTACATT GTTCCATCGG AAGACCAACA CCAGTCTGAG TACGTCTCGG CGTTTGACCA GAAGAGATCA TTCATCTCTG GATTTGGAGG CTCTGCTGGT GTAGCCGTAG TTACTCGCGA CTTATTGTGT ATGAACGATG TGCCTGAAGG ATCAGCCGCT CTTTCCACGG ACGGGAGATA CTTCAACCAA GCTACCAACG AGTTAGACTT CAACTGGATC TTGTTGAAAC AGGGTGCTAA GGATCAACCA ACCTGGGAAG AATGGGCTGT AGAACAGGCC ATTCAGTTAT CTTTGGACAG TGGCTCCAAA GCCAATGTCG GTGTCGATCC TAGACTCATC TCTTACAAGT TGTACCAGAA GATCTCGGGA ATTGTTGAAA AAGCTTTGGA AAAACACCTG AACAAGAAGA TTCAGATCGA ACTTGTGGCT GTTACTGAAA ACTTGATTGG TTCCATTTGG GAAAAGTTTG AGCCTTTACC TCCAAGGGCA TCTCTGTCGC GTATCAAAAT CTTGGATACC AAATTTACTG GTGAGCAAGT TGCTGACAAG TTGAATAGAG TCAAGCAGCA GACTTTCAAA GAAAATGTAG GAGGTTTGGT TGTTACTGCT TTGGACGAAA TCGCCTGGTT GTTAAACTTG AGGGGGCAGG ATATCGAATA CAACCCTGTT TTCTTTAGTT TCTTGGTGAT AACCAAGAGT AATGGCACCA CTTTATTCAT TCAAAAGTCT AGATTGACTG CAGATATATT GGCCCTTCTC GAAGCTAATA ATATCCAGGT TGAACCTTAT GAATCTTTCT ACTCCAGATT GTCGTCAATT TCTAAAGACT TCAGTATTGC TAATCAACTG TTCTTGATTC CATCCAATGC CAACTGGGAA GTACTCAGAA ACTTGAAATG TTCTTTCACT CAAGGTTTGT CTCCAATTGA AGATTTGAAG TCTGTCAAGA ATGCAACAGA ACTTTTGGGT GCAAAGATTG CCCATTTGAA AGATGGTAGA GCATTGGTAA GATTCTTTGC ATGGTTGGAA GAGCAGGTAG TGGATAGACA AGAGTTGATC GACGAATGTG CTGCTGATGA CAAGTTAACT GAGTTTAGGA GCCAGGAAGA AAACTTTGTT GGCCTTTCTT TCGCTACCAT CAGTGCCACT GGTGCCAATG GTGCGGTTAT TCACTATAAG CCTACTAAGG GCCAATGTGC TACTATAAAT CCATTGAAGA TCTACTTGAA CGACTCAGGA TCGCAATTTC TTGAGGGTAC TACTGATACT ACTCGTACAA TCCACTTCGG CAAGCCTACC TATGAAGAAA TCAAGCGTTA CACTTTGGTA TTAAAAGGTA ATATTGCGCT ATCAACTTTA AAATTCCCAG AAAACACCAC CGGTAACTTA ATTGACTCCA TTGCAAGACA ATATCTCTGG AAGTTTGGTT TGGACTACGG CCATGGAACT TCTCACGGAG TTGGCGCATA TTTAAATGTC CACGAAGGAC CAATTGGTAT TGGTCCTAGA CCCAATGCTG CTGCACATGC CTTGAAGCCA GGTCAGTTGA TTTCTAATGA ACCTGGCTAT TATGAAGATG GTGAGTACGG TATTCGTTTG GAAAACATGA TGTATATCAA GGATAGCGGC TTGTCGTACA ATGGTAGACA ATTCTGGGAT TTTGAGACTG TCACCAGAGT TCCTTTCTGT AGAAAGTTAA TTAATGTGGA CATGTTGGAC GAAGAAGAAT TGGCATGGCT TAACGCTTAT CACAATACAA TCTGGAACGA ATTGCATGAA ACATTTGACA AAAACAGCTA CGTCTACAAG TGGTTGAGAC GAGAAACTGA CCAGATCGTC AGACACTCTC ATAAGTTATT GTGAAGACTA ATAGATCCAA AATGTGGTCT CTTTGTGTAT AGATTGATAT GACTAATTGA AGGTGACTTT ATAA
|
Protein sequence | MSTRISRGQC NNCTCSPGLL SRTNRRSSLF AQKININRTN SGIGGSRAGS RKGSVFTIDP ATLCLPEIKE TNTSQRLEAL RNKMKDHNLA VYIVPSEDQH QSEYVSAFDQ KRSFISGFGG SAGVAVVTRD LLCMNDVPEG SAALSTDGRY FNQATNELDF NWILLKQGAK DQPTWEEWAV EQAIQLSLDS GSKANVGVDP RLISYKLYQK ISGIVEKALE KHSNKKIQIE LVAVTENLIG SIWEKFEPLP PRASSSRIKI LDTKFTGEQV ADKLNRVKQQ TFKENVGGLV VTALDEIAWL LNLRGQDIEY NPVFFSFLVI TKSNGTTLFI QKSRLTADIL ALLEANNIQV EPYESFYSRL SSISKDFSIA NQSFLIPSNA NWEVLRNLKC SFTQGLSPIE DLKSVKNATE LLGAKIAHLK DGRALVRFFA WLEEQVVDRQ ELIDECAADD KLTEFRSQEE NFVGLSFATI SATGANGAVI HYKPTKGQCA TINPLKIYLN DSGSQFLEGT TDTTRTIHFG KPTYEEIKRY TLVLKGNIAL STLKFPENTT GNLIDSIARQ YLWKFGLDYG HGTSHGVGAY LNVHEGPIGI GPRPNAAAHA LKPGQLISNE PGYYEDGEYG IRLENMMYIK DSGLSYNGRQ FWDFETVTRV PFCRKLINVD MLDEEELAWL NAYHNTIWNE LHETFDKNSY VYKWLRRETD QIVRHSHKLL
|
| |