Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_28317 |
Symbol | APN2 |
ID | 4851093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 910483 |
End bp | 911931 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | |
GC content | 40% |
IMG OID | 640392801 |
Product | AP endonuclease |
Protein accession | XP_001387814 |
Protein GI | 126274079 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0708] Exonuclease III |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.563607 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTTGG CCCAGCTCCA GAAGCTAGAC CCACTTGACA AGATTGCACT GAAGCTGCTG AATTTGACTA TTAGGTATGT CACGTTCAAT GTAAACGGAG TCAAAACACT TTTCAACTAC TATCCTTGGA CACAATTCAA CCAAGATTAT GATCTTCTAT TCAGTTCACT CCAGGCTGAC ATAATTACTC TTCAGGAGCT AAAGTTATCT TCTTCCAATA TTTCCAGTGT GAAAAACATC GGCCACTTGC CTCATTATAA ATCGTTTATA TCTATACCAA AGGTGAAGCG AGGGTACAGT GGAGTTGGAC TTTTCGTACG AATTCCAAGA GAAGAAGAAT CGTCAGCAGT GCGACGTAAC TTACAAGTTA TTAAGGCTGA AGAGGGCATA ACAGGTTATT TATCCTCAGG ACTACTGGGC GACGTCTGCT ACAGGGATCT ACCCGAGCTG GAAAGTATTG GAGGATATCC TGATGATTTG GATTCTGTCA TGGGTCTAGA GCTAGACAGT GAAGGAAGAT GTGTTTGCAT AGAACTAGCT TGCAACTTGG TGGTTTTCGC CTTGTATTGT CCTGCCAATT CTATGGGGGA AGATGAAGGA GAAGCTTTCC GACTTAATTT CTTGAAGAAT CTCTTGAAGC GGTGCTACAA CTTGAAGTAC AAACATGGCA AGGAAGTTAT TGTCATGGGA GATATAAACG TAAGTTTGGA CTTGATTGAC AGTGCTGAGG GTATCGATGA TAGACAAAAA CAGAGGCTTG TTCTACCCAA GACAGATGGA ATCGATTTTG AAACCATAAA CTACGATGAA TGTTTTAATT TCAAGAGATC AACACGAGCT CGTGCCTTAT TGAATCAGTA CACCATTCAA TCTTTACAAC ATAACCTCTC CTTAGACCGA CATCCAGACT ACGAGCAACA GTTTCTATAC GACACTACGC GATATCTACA AGGAAGACGA ATGCAGATGT ATACAGTGTG GAATACCTTG AACAGTTCAC GTGCCATTAA TTTTGGTTCG AGAATCGACC TTATCCTAGC CAGTAGCTAC AGAATGATTA AGAACATTTC CAATGCCGAT ATCTGGCCAT TTATTCTTGG TTCTGATCAC TGTCCCGTTT TCACTGACTT CGAAGCATTA GAAATTGATA AACCTGAAGA ATCGAAGTCA GCAAAACTTC ATTTTGAAGC CAAGTACCAT CATAAACTCT CCCAAGTCAG AGATATATCT CTACTATTTT CCCGAAAGAG AACTTCTACT CTGGAAAACA ATAGTCAAAA TTCTAGTCAA AATTCAGCTT CAGATGAAAC AGAAGTGAAA CGAACAAAAC CAGAAGCTTC CAGTACAAAG CTGGCCTTCA AGTATGTTAG TCGTAAACCT AAGAAGGTTG ATGGAACGAA GCCTATCAGT ACATTCTTCA CTCTCAACTC TACGAAGAAA AGTTTGTAA
|
Protein sequence | MDLAQLQKLD PLDKIALKLL NLTIRYVTFN VNGVKTLFNY YPWTQFNQDY DLLFSSLQAD IITLQELKLS SSNISSVKNI GHLPHYKSFI SIPKVKRGYS GVGLFVRIPR EEESSAVRRN LQVIKAEEGI TGYLSSGLLG DVCYRDLPEL ESIGGYPDDL DSVMGLELDS EGRCVCIELA CNLVVFALYC PANSMGEDEG EAFRLNFLKN LLKRCYNLKY KHGKEVIVMG DINVSLDLID SAEGIDDRQK QRLVLPKTDG IDFETINYDE CFNFKRSTRA RALLNQYTIQ SLQHNLSLDR HPDYEQQFLY DTTRYLQGRR MQMYTVWNTL NSSRAINFGS RIDLILASSY RMIKNISNAD IWPFILGSDH CPVFTDFEAL EIDKPEESKS AKLHFEAKYH HKLSQVRDIS LLFSRKRTST LENNSQNSSQ NSASDETEVK RTKPEASSTK LAFKYVSRKP KKVDGTKPIS TFFTLNSTKK SL
|
| |