Gene PICST_14247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_14247 
SymbolMIP1 
ID4839632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp341936 
End bp345118 
Gene Length3183 bp 
Protein Length1061 aa 
Translation table12 
GC content42% 
IMG OID640390947 
Productmitochondrial DNA polymerase catalytic subunit 
Protein accessionXP_001385083 
Protein GI150865746 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GAATACCACG AAGCTCCCCG TGTTAACCAG GTGGGGATCC AGTATCTTTC AGAAGGGCTT 
CACAAGAAAA TCTTTCCCTC TGAACCTGTG GATGCATACT TGAAGCCAGA AGAGCCTGAG
CTTTTGAAGA TAGCCAAACT GCATTTGAAA CACAACGATT TGCTCGACAA AAAGACCAAT
GTAGTGGAAC CTATAGATAT CAACAACTTC CCATCATTAG TGGGAAAAAC CTTGGATGAG
CATTTCTACA AGATAGGCAA GCGTAGCGCC GAGCCATATT TAACTATGGC GAATAAGCTT
TTTTCACAAG AACTTCCTCC AAGACCTGAA AACTGGTTAT TCGAACCAGG CTGGTACCGA
TATACTCCTG ATAAGGAGCC TGAACTGGTG CCGTATCCTT TGGAAGATGA GCTTGTGTTT
GATGTAGAGG TACTATACAA GAGGTCCAGT TATTCCGTGC TAGCCACTTG TGTTTCAACG
GAAGCTTGGT ACGGTTGGGT TTCGCCATTC TTAACAAACT ATGCCCAGGA TCCCTCATAC
AGCGACTGGG AACATCTAAT TCCTTTCAAC AGCTTGAAAA AGCCCAAATT GCTCGTTGGA
TTCAACGTTA GCTATGATCG ATCGAGAATC CTAGAAGAGT ACAACATCAG ACAGTCACAA
GCGTTTTTCC TTGACGGAAT GGCTCTCCAC GTTGCTATCA GCGGAATTTG CTCACAACAA
AGACCAAAAT GGCAACAGCA CAGGAAGAGC AAGAACCAGT TAGAATTCCT GGAAGAGTAC
GAAGAAGAAA TTGCGAAGAC AGAATTGGCA GGGTTCGGTG AAGAAATATC CAAGGAATTG
GCCAGTGACC TTGTAGATGA TCCATGGCTA GATCATGGTG CTCCCAACTC GTTGGCTAAT
GTTTTAGATT TTCATTGCAA TGTTAAATTA GACAAATCGC AACGAGACTT CTTTTCCTCT
GAAAATCCCA TGGATGTCAT AAGTAACTTC AACGATTTGA TGAACTACTG TGCTTCCGAT
GTAGATGCAA CTTTTGCAGT CACAAAGAAG CTCTTTCCTG AGTTTCTAGA AAAGGTTCCA
CATCCTGTTT CATTCGCTGC TTTAAGACAC TTGGGAACGT TGATCCTTCC GACTACCACG
AAATGGGACA ACTACATTTC TACAGCAGAA GAAATCTACC AGGATAATAG AGGTCAAGTT
TCAGCAAACT TGAGAACTCG TGCTATTGAA CTCATTCAGT ATATCGAGAA AAATGATCCG
AAGTTGAAAC CAAACTTGGA TGACGATCCC TGGTTAAGAC AGTTGAATTG GACTATCAAA
GAGCCTAGAT TAAAGAAAGA CGGAACACCG ACAGCCAAAC AGGCATTCAT GACAGGCTAT
CCAGAATGGT ACCGTGAACT CTTCAAAACG ACCGATGGCG CCACCAAACA GAGGGAGATG
AACTTGACTG TTCGTACTCG TATTACGCCT TTGCTTTTGC GACTCAAATG GGAAGGTTAT
CCATTGATAT GGACAGATTC TCAGGGCTGG TGTTTCAAGG TACCTTTTGA CGAAGAAATT
TTCACAAAAC TTGAAACTCA GAACTACATT AGAGCCCAAT TAAATCAGAA GGATCTAGAT
TTACTACTCC CAGAACTACG CGATAATGGC AACAGCTTTG AGTTGTTCAA AGTTCCACAC
CCAGATGGAG CTAAGAAGAG ATGTACTCAG ATTATGTCAA AGAGCTATTT GCGTTATTTT
GAAGATGGTA CACTTACTTC TGAGTACAAC TATGCTCAGG AGATTTTATC ATTGAATTCT
GCTGCTTCGT ATTGGATGGG TAACAGAACG AGGATTTCTG AACAGTTCGT TGTTTACAAT
GACAAGAGCG GCGAGAAGAA TAAGTTCTTT GATTCCAAGA AAGAAGCTAA GGATGCTACA
AACATGGGAA TTATCTTGCC TAGACTTTGT ACTATGGGTA CAATTACTAG AAGAGCTACA
GAAAACACAT GGCTCACTGC TTCGAACGCC AAAAAGAACC GTATAGGTTC GGAGTTGAAG
TCGTTGATAG AAGCTCCACC AGGGTACTGT TTCGTAGGCG CCGATGTAGA TTCGGAGGAA
TTATGGATTG CTTCCCTTGT AGGGGATTCC ATGTTTAAGA TCCATGGAGG CACAGCTCTT
GGATGGATGA ATTTGGAAGG CGACAAGCAC GAGAAGACAG ACTTGCATTC AAAAACGGCA
GAAATCCTAG GGATATCTAG AGGTGATGCT AAGGTTTTCA ACTATGGGAG AATATATGGA
GCAGGTGTTA AGTTTGCAAC TCGACTCTTG AAGCAGTTTA ATTCCAAGTT GTCTGAAGCG
GAGGCTGAAG AAACAGCAAA ACTTTTGTAT GCTCGTACTA AGGGAGAAAT GTCTTCATCG
AAATACTTGA AACGGAGACT TTACCATGGT GGAACCGAAT CTGTCATGTT CAATGCCTTG
GAAAGCATCG CCTACCAAAA GAATCCCAGA ACACCGGTAT TAGGAGCTGC TATCACTTCC
GCATTGACTA TTGACAACTT AAACAAGAAC AGCTACTTGA CGTCGAGAAT CAATTGGGCA
ATTCAAAGCT CTGGTGTGGA TTATTTACAT TTACTTGTGG TTTCAATGGA GTATTTGATC
GACAAGTACC AGATCGATGC CAGACTATCC ATTACTGTTC ACGATGAAAT CAGATATCTT
GTCAAAGAAA GTGATAAGTA CAAAGCTGCC TTGTTATTGC AGATCAGTAA TGTCTGGACT
AGAGCCATGT TCTGCGAACA ACTCGGCATC AAGGAAGTTC CACAATCGTG TGCTTTTTTC
TCTGAAGTTG ACATTGATCA TGTGCTCAGA AAAGAGGTGT CTATGGATTG TGTTACCCCT
TCAAATCCCA AAGCTATCCC AGTTGGCGAG TCGCTTGGGA TAAACCAGCT ACTCGAAGTT
TGTGGCAATG GAGACATATT GACCGATGGA AGCACACCTA AGAAACTAAA GCTTTCGAAC
TACAAGTATG AGAAGCGAGT TCCAGTCATG AAGAAGCTCG ACGAAGAATC GAATGCAATT
GCCAATATAG CTAAGATCAG ATTACAGAAC TCTATTGACA AGGAAGAATG GAGAAAGAAC
ATTGGCACAT ACATCAAACT GAAGAAGAAC TCAGACTTTG AAGAAGCACA ACGACAATTC
GAT
 
Protein sequence
EYHEAPRVNQ VGIQYLSEGL HKKIFPSEPV DAYLKPEEPE LLKIAKSHLK HNDLLDKKTN 
VVEPIDINNF PSLVGKTLDE HFYKIGKRSA EPYLTMANKL FSQELPPRPE NWLFEPGWYR
YTPDKEPESV PYPLEDELVF DVEVLYKRSS YSVLATCVST EAWYGWVSPF LTNYAQDPSY
SDWEHLIPFN SLKKPKLLVG FNVSYDRSRI LEEYNIRQSQ AFFLDGMALH VAISGICSQQ
RPKWQQHRKS KNQLEFSEEY EEEIAKTELA GFGEEISKEL ASDLVDDPWL DHGAPNSLAN
VLDFHCNVKL DKSQRDFFSS ENPMDVISNF NDLMNYCASD VDATFAVTKK LFPEFLEKVP
HPVSFAALRH LGTLILPTTT KWDNYISTAE EIYQDNRGQV SANLRTRAIE LIQYIEKNDP
KLKPNLDDDP WLRQLNWTIK EPRLKKDGTP TAKQAFMTGY PEWYRELFKT TDGATKQREM
NLTVRTRITP LLLRLKWEGY PLIWTDSQGW CFKVPFDEEI FTKLETQNYI RAQLNQKDLD
LLLPELRDNG NSFELFKVPH PDGAKKRCTQ IMSKSYLRYF EDGTLTSEYN YAQEILSLNS
AASYWMGNRT RISEQFVVYN DKSGEKNKFF DSKKEAKDAT NMGIILPRLC TMGTITRRAT
ENTWLTASNA KKNRIGSELK SLIEAPPGYC FVGADVDSEE LWIASLVGDS MFKIHGGTAL
GWMNLEGDKH EKTDLHSKTA EILGISRGDA KVFNYGRIYG AGVKFATRLL KQFNSKLSEA
EAEETAKLLY ARTKGEMSSS KYLKRRLYHG GTESVMFNAL ESIAYQKNPR TPVLGAAITS
ALTIDNLNKN SYLTSRINWA IQSSGVDYLH LLVVSMEYLI DKYQIDARLS ITVHDEIRYL
VKESDKYKAA LLLQISNVWT RAMFCEQLGI KEVPQSCAFF SEVDIDHVLR KEVSMDCVTP
SNPKAIPVGE SLGINQLLEV CGNGDILTDG STPKKLKLSN YKYEKRVPVM KKLDEESNAI
ANIAKIRLQN SIDKEEWRKN IGTYIKSKKN SDFEEAQRQF D