Gene PICST_30161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30161 
Symbol 
ID4837264 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2104136 
End bp2105467 
Gene Length1332 bp 
Protein Length379 aa 
Translation table12 
GC content43% 
IMG OID640388579 
Productpredicted protein 
Protein accessionXP_001382630 
Protein GI150863969 
COG category[L] Replication, recombination and repair 
COG ID[COG0648] Endonuclease IV 
TIGRFAM ID[TIGR00587] apurinic endonuclease (APN1) 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCCTA AAGTGTAAGT TCGTGTTTAA TTTAAGTACT GGGCAGTAGA AATGCCGCAA 
TTGTGTAACT GATACGAGAA GATGGTCTCC ATATTCTATA TTTCACGCCT CTGCAAGTTT
TCTGAAGTGA AAGCTTCTCT ATGAAAATGC ATTGTATTCT TGCGTTCAGT TTTCCTCTTC
TCTTGTACAT ACTAACAGGT TTCAAGTAAA ACTGAGGTAG CCAAAGCTGT CAAGGAAACT
GTCAAAAAGA CGGTCAGAGC CAAAAAGGTT AAGAAGGTCG AGTTCAAGTA TGAAAGACTG
ACTACGAGCA AATTCAAATT TGGAGCTCAC GTAGGTGCAT CTGGTGGAGT TTCAAATTCC
GTCATCAATG CCAGAAACCT TGGTGCTAAC AGTTTTGCCT TGTTTTTGAA GTCTCCTAGG
AAATGGGTGA GTCCCCCCAT CTCTGCTGAA GAAATTGACA AATTCAAGCT GTTGTGCGAA
GAACACGGAT ATGATCCCAG AACCGACGTT TTACCTCATG GATCGTATTT CATTAACTTA
GCCAATCCAG ACCCTGAAAA GGAGGAAAAA GCGTTTGACG GATTTCTAGA TGATTTGCAC
AGATGTGAAC AATTGAACAT TGGATTATAT AACTTTCATC CAGGCTCGAG TTTAGACGGG
GACCATAGGG AAGCTCTTGA GAGGTTGGCC AAAAACATCA ACAGGGCTAT TAAAGAAACA
AGCTTTGTCA AAATCGTGAT TGAAAATATG GCTGGCCACG GTAACTTGAT CGGATCCAAT
CTACAAGATA TCAGAGACGT CATAGACATC GTAGAGGACA AGCTGAGAGT CGGAGTTTGC
GTCGACACTT GCCACACATT TGCTGCTGGG TACGATATTT CGACTGAGGA GAAGTTTGAA
GCGTTCTGGA AGGAGTTTGA CAACATTGTG GGTGCCGAAT TCTTGAGTGC CATCCATTTG
AATGACTCCA AAGCTCCTTT GGGTGCCAAC AGAGATTTAC ACCAATTCTT GGGACAAGGA
TTTTTGGGCT TGGAAGCATT CAGAGTCGTG GCTAACTCGC CCAGATTGCA CAATATTCCC
ATCATCTTGG AAACCCCCGT AGGTAACGAC GATAGTTACT ATGGAGAAGA GATTAAACTC
TTGGAACTAT TAGAGGATAA AACCATTGAC GACACGGAGT TTGTAGAGAA GAAAGAAAAG
CTCTCGAAGT TGGGAGCAAA AGAGAGATCC GAGCATGAGA AGAAGTTCGA GACCAAGAAG
GCTAAGACGG CTAAGAAGAC TGCTGGTGAT GATATCGCTT CGTTGGTTAC AAAGAGACCC
AAAAGGAAGT AG
 
Protein sequence
MPPKVKTEVA KAVKETVKKT VRAKKVKKVE FKYERSTTSK FKFGAHVGAS GGVSNSVINA 
RNLGANSFAL FLKSPRKWVS PPISAEEIDK FKSLCEEHGY DPRTDVLPHG SYFINLANPD
PEKEEKAFDG FLDDLHRCEQ LNIGLYNFHP GSSLDGDHRE ALERLAKNIN RAIKETSFVK
IVIENMAGHG NLIGSNLQDI RDVIDIVEDK SRVGVCVDTC HTFAAGYDIS TEEKFEAFWK
EFDNIVGAEF LSAIHLNDSK APLGANRDLH QFLGQGFLGL EAFRVVANSP RLHNIPIILE
TPVGNDDSYY GEEIKLLELL EDKTIDDTEF VEKKEKLSKL GAKERSEHEK KFETKKAKTA
KKTAGDDIAS LVTKRPKRK