Gene PICST_28317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28317 
SymbolAPN2 
ID4851093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp910483 
End bp911931 
Gene Length1449 bp 
Protein Length482 aa 
Translation table 
GC content40% 
IMG OID640392801 
ProductAP endonuclease 
Protein accessionXP_001387814 
Protein GI126274079 
COG category[L] Replication, recombination and repair 
COG ID[COG0708] Exonuclease III 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.563607 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTGG CCCAGCTCCA GAAGCTAGAC CCACTTGACA AGATTGCACT GAAGCTGCTG 
AATTTGACTA TTAGGTATGT CACGTTCAAT GTAAACGGAG TCAAAACACT TTTCAACTAC
TATCCTTGGA CACAATTCAA CCAAGATTAT GATCTTCTAT TCAGTTCACT CCAGGCTGAC
ATAATTACTC TTCAGGAGCT AAAGTTATCT TCTTCCAATA TTTCCAGTGT GAAAAACATC
GGCCACTTGC CTCATTATAA ATCGTTTATA TCTATACCAA AGGTGAAGCG AGGGTACAGT
GGAGTTGGAC TTTTCGTACG AATTCCAAGA GAAGAAGAAT CGTCAGCAGT GCGACGTAAC
TTACAAGTTA TTAAGGCTGA AGAGGGCATA ACAGGTTATT TATCCTCAGG ACTACTGGGC
GACGTCTGCT ACAGGGATCT ACCCGAGCTG GAAAGTATTG GAGGATATCC TGATGATTTG
GATTCTGTCA TGGGTCTAGA GCTAGACAGT GAAGGAAGAT GTGTTTGCAT AGAACTAGCT
TGCAACTTGG TGGTTTTCGC CTTGTATTGT CCTGCCAATT CTATGGGGGA AGATGAAGGA
GAAGCTTTCC GACTTAATTT CTTGAAGAAT CTCTTGAAGC GGTGCTACAA CTTGAAGTAC
AAACATGGCA AGGAAGTTAT TGTCATGGGA GATATAAACG TAAGTTTGGA CTTGATTGAC
AGTGCTGAGG GTATCGATGA TAGACAAAAA CAGAGGCTTG TTCTACCCAA GACAGATGGA
ATCGATTTTG AAACCATAAA CTACGATGAA TGTTTTAATT TCAAGAGATC AACACGAGCT
CGTGCCTTAT TGAATCAGTA CACCATTCAA TCTTTACAAC ATAACCTCTC CTTAGACCGA
CATCCAGACT ACGAGCAACA GTTTCTATAC GACACTACGC GATATCTACA AGGAAGACGA
ATGCAGATGT ATACAGTGTG GAATACCTTG AACAGTTCAC GTGCCATTAA TTTTGGTTCG
AGAATCGACC TTATCCTAGC CAGTAGCTAC AGAATGATTA AGAACATTTC CAATGCCGAT
ATCTGGCCAT TTATTCTTGG TTCTGATCAC TGTCCCGTTT TCACTGACTT CGAAGCATTA
GAAATTGATA AACCTGAAGA ATCGAAGTCA GCAAAACTTC ATTTTGAAGC CAAGTACCAT
CATAAACTCT CCCAAGTCAG AGATATATCT CTACTATTTT CCCGAAAGAG AACTTCTACT
CTGGAAAACA ATAGTCAAAA TTCTAGTCAA AATTCAGCTT CAGATGAAAC AGAAGTGAAA
CGAACAAAAC CAGAAGCTTC CAGTACAAAG CTGGCCTTCA AGTATGTTAG TCGTAAACCT
AAGAAGGTTG ATGGAACGAA GCCTATCAGT ACATTCTTCA CTCTCAACTC TACGAAGAAA
AGTTTGTAA
 
Protein sequence
MDLAQLQKLD PLDKIALKLL NLTIRYVTFN VNGVKTLFNY YPWTQFNQDY DLLFSSLQAD 
IITLQELKLS SSNISSVKNI GHLPHYKSFI SIPKVKRGYS GVGLFVRIPR EEESSAVRRN
LQVIKAEEGI TGYLSSGLLG DVCYRDLPEL ESIGGYPDDL DSVMGLELDS EGRCVCIELA
CNLVVFALYC PANSMGEDEG EAFRLNFLKN LLKRCYNLKY KHGKEVIVMG DINVSLDLID
SAEGIDDRQK QRLVLPKTDG IDFETINYDE CFNFKRSTRA RALLNQYTIQ SLQHNLSLDR
HPDYEQQFLY DTTRYLQGRR MQMYTVWNTL NSSRAINFGS RIDLILASSY RMIKNISNAD
IWPFILGSDH CPVFTDFEAL EIDKPEESKS AKLHFEAKYH HKLSQVRDIS LLFSRKRTST
LENNSQNSSQ NSASDETEVK RTKPEASSTK LAFKYVSRKP KKVDGTKPIS TFFTLNSTKK
SL