Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_14247 |
Symbol | MIP1 |
ID | 4839632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 341936 |
End bp | 345118 |
Gene Length | 3183 bp |
Protein Length | 1061 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640390947 |
Product | mitochondrial DNA polymerase catalytic subunit |
Protein accession | XP_001385083 |
Protein GI | 150865746 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GAATACCACG AAGCTCCCCG TGTTAACCAG GTGGGGATCC AGTATCTTTC AGAAGGGCTT CACAAGAAAA TCTTTCCCTC TGAACCTGTG GATGCATACT TGAAGCCAGA AGAGCCTGAG CTTTTGAAGA TAGCCAAACT GCATTTGAAA CACAACGATT TGCTCGACAA AAAGACCAAT GTAGTGGAAC CTATAGATAT CAACAACTTC CCATCATTAG TGGGAAAAAC CTTGGATGAG CATTTCTACA AGATAGGCAA GCGTAGCGCC GAGCCATATT TAACTATGGC GAATAAGCTT TTTTCACAAG AACTTCCTCC AAGACCTGAA AACTGGTTAT TCGAACCAGG CTGGTACCGA TATACTCCTG ATAAGGAGCC TGAACTGGTG CCGTATCCTT TGGAAGATGA GCTTGTGTTT GATGTAGAGG TACTATACAA GAGGTCCAGT TATTCCGTGC TAGCCACTTG TGTTTCAACG GAAGCTTGGT ACGGTTGGGT TTCGCCATTC TTAACAAACT ATGCCCAGGA TCCCTCATAC AGCGACTGGG AACATCTAAT TCCTTTCAAC AGCTTGAAAA AGCCCAAATT GCTCGTTGGA TTCAACGTTA GCTATGATCG ATCGAGAATC CTAGAAGAGT ACAACATCAG ACAGTCACAA GCGTTTTTCC TTGACGGAAT GGCTCTCCAC GTTGCTATCA GCGGAATTTG CTCACAACAA AGACCAAAAT GGCAACAGCA CAGGAAGAGC AAGAACCAGT TAGAATTCCT GGAAGAGTAC GAAGAAGAAA TTGCGAAGAC AGAATTGGCA GGGTTCGGTG AAGAAATATC CAAGGAATTG GCCAGTGACC TTGTAGATGA TCCATGGCTA GATCATGGTG CTCCCAACTC GTTGGCTAAT GTTTTAGATT TTCATTGCAA TGTTAAATTA GACAAATCGC AACGAGACTT CTTTTCCTCT GAAAATCCCA TGGATGTCAT AAGTAACTTC AACGATTTGA TGAACTACTG TGCTTCCGAT GTAGATGCAA CTTTTGCAGT CACAAAGAAG CTCTTTCCTG AGTTTCTAGA AAAGGTTCCA CATCCTGTTT CATTCGCTGC TTTAAGACAC TTGGGAACGT TGATCCTTCC GACTACCACG AAATGGGACA ACTACATTTC TACAGCAGAA GAAATCTACC AGGATAATAG AGGTCAAGTT TCAGCAAACT TGAGAACTCG TGCTATTGAA CTCATTCAGT ATATCGAGAA AAATGATCCG AAGTTGAAAC CAAACTTGGA TGACGATCCC TGGTTAAGAC AGTTGAATTG GACTATCAAA GAGCCTAGAT TAAAGAAAGA CGGAACACCG ACAGCCAAAC AGGCATTCAT GACAGGCTAT CCAGAATGGT ACCGTGAACT CTTCAAAACG ACCGATGGCG CCACCAAACA GAGGGAGATG AACTTGACTG TTCGTACTCG TATTACGCCT TTGCTTTTGC GACTCAAATG GGAAGGTTAT CCATTGATAT GGACAGATTC TCAGGGCTGG TGTTTCAAGG TACCTTTTGA CGAAGAAATT TTCACAAAAC TTGAAACTCA GAACTACATT AGAGCCCAAT TAAATCAGAA GGATCTAGAT TTACTACTCC CAGAACTACG CGATAATGGC AACAGCTTTG AGTTGTTCAA AGTTCCACAC CCAGATGGAG CTAAGAAGAG ATGTACTCAG ATTATGTCAA AGAGCTATTT GCGTTATTTT GAAGATGGTA CACTTACTTC TGAGTACAAC TATGCTCAGG AGATTTTATC ATTGAATTCT GCTGCTTCGT ATTGGATGGG TAACAGAACG AGGATTTCTG AACAGTTCGT TGTTTACAAT GACAAGAGCG GCGAGAAGAA TAAGTTCTTT GATTCCAAGA AAGAAGCTAA GGATGCTACA AACATGGGAA TTATCTTGCC TAGACTTTGT ACTATGGGTA CAATTACTAG AAGAGCTACA GAAAACACAT GGCTCACTGC TTCGAACGCC AAAAAGAACC GTATAGGTTC GGAGTTGAAG TCGTTGATAG AAGCTCCACC AGGGTACTGT TTCGTAGGCG CCGATGTAGA TTCGGAGGAA TTATGGATTG CTTCCCTTGT AGGGGATTCC ATGTTTAAGA TCCATGGAGG CACAGCTCTT GGATGGATGA ATTTGGAAGG CGACAAGCAC GAGAAGACAG ACTTGCATTC AAAAACGGCA GAAATCCTAG GGATATCTAG AGGTGATGCT AAGGTTTTCA ACTATGGGAG AATATATGGA GCAGGTGTTA AGTTTGCAAC TCGACTCTTG AAGCAGTTTA ATTCCAAGTT GTCTGAAGCG GAGGCTGAAG AAACAGCAAA ACTTTTGTAT GCTCGTACTA AGGGAGAAAT GTCTTCATCG AAATACTTGA AACGGAGACT TTACCATGGT GGAACCGAAT CTGTCATGTT CAATGCCTTG GAAAGCATCG CCTACCAAAA GAATCCCAGA ACACCGGTAT TAGGAGCTGC TATCACTTCC GCATTGACTA TTGACAACTT AAACAAGAAC AGCTACTTGA CGTCGAGAAT CAATTGGGCA ATTCAAAGCT CTGGTGTGGA TTATTTACAT TTACTTGTGG TTTCAATGGA GTATTTGATC GACAAGTACC AGATCGATGC CAGACTATCC ATTACTGTTC ACGATGAAAT CAGATATCTT GTCAAAGAAA GTGATAAGTA CAAAGCTGCC TTGTTATTGC AGATCAGTAA TGTCTGGACT AGAGCCATGT TCTGCGAACA ACTCGGCATC AAGGAAGTTC CACAATCGTG TGCTTTTTTC TCTGAAGTTG ACATTGATCA TGTGCTCAGA AAAGAGGTGT CTATGGATTG TGTTACCCCT TCAAATCCCA AAGCTATCCC AGTTGGCGAG TCGCTTGGGA TAAACCAGCT ACTCGAAGTT TGTGGCAATG GAGACATATT GACCGATGGA AGCACACCTA AGAAACTAAA GCTTTCGAAC TACAAGTATG AGAAGCGAGT TCCAGTCATG AAGAAGCTCG ACGAAGAATC GAATGCAATT GCCAATATAG CTAAGATCAG ATTACAGAAC TCTATTGACA AGGAAGAATG GAGAAAGAAC ATTGGCACAT ACATCAAACT GAAGAAGAAC TCAGACTTTG AAGAAGCACA ACGACAATTC GAT
|
Protein sequence | EYHEAPRVNQ VGIQYLSEGL HKKIFPSEPV DAYLKPEEPE LLKIAKSHLK HNDLLDKKTN VVEPIDINNF PSLVGKTLDE HFYKIGKRSA EPYLTMANKL FSQELPPRPE NWLFEPGWYR YTPDKEPESV PYPLEDELVF DVEVLYKRSS YSVLATCVST EAWYGWVSPF LTNYAQDPSY SDWEHLIPFN SLKKPKLLVG FNVSYDRSRI LEEYNIRQSQ AFFLDGMALH VAISGICSQQ RPKWQQHRKS KNQLEFSEEY EEEIAKTELA GFGEEISKEL ASDLVDDPWL DHGAPNSLAN VLDFHCNVKL DKSQRDFFSS ENPMDVISNF NDLMNYCASD VDATFAVTKK LFPEFLEKVP HPVSFAALRH LGTLILPTTT KWDNYISTAE EIYQDNRGQV SANLRTRAIE LIQYIEKNDP KLKPNLDDDP WLRQLNWTIK EPRLKKDGTP TAKQAFMTGY PEWYRELFKT TDGATKQREM NLTVRTRITP LLLRLKWEGY PLIWTDSQGW CFKVPFDEEI FTKLETQNYI RAQLNQKDLD LLLPELRDNG NSFELFKVPH PDGAKKRCTQ IMSKSYLRYF EDGTLTSEYN YAQEILSLNS AASYWMGNRT RISEQFVVYN DKSGEKNKFF DSKKEAKDAT NMGIILPRLC TMGTITRRAT ENTWLTASNA KKNRIGSELK SLIEAPPGYC FVGADVDSEE LWIASLVGDS MFKIHGGTAL GWMNLEGDKH EKTDLHSKTA EILGISRGDA KVFNYGRIYG AGVKFATRLL KQFNSKLSEA EAEETAKLLY ARTKGEMSSS KYLKRRLYHG GTESVMFNAL ESIAYQKNPR TPVLGAAITS ALTIDNLNKN SYLTSRINWA IQSSGVDYLH LLVVSMEYLI DKYQIDARLS ITVHDEIRYL VKESDKYKAA LLLQISNVWT RAMFCEQLGI KEVPQSCAFF SEVDIDHVLR KEVSMDCVTP SNPKAIPVGE SLGINQLLEV CGNGDILTDG STPKKLKLSN YKYEKRVPVM KKLDEESNAI ANIAKIRLQN SIDKEEWRKN IGTYIKSKKN SDFEEAQRQF D
|
| |