Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_81859 |
Symbol | DNL1 |
ID | 4837413 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 514266 |
End bp | 517359 |
Gene Length | 3094 bp |
Protein Length | 939 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640388728 |
Product | DNA ligase IV |
Protein accession | XP_001382860 |
Protein GI | 150864147 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1793] ATP-dependent DNA ligase |
TIGRFAM ID | [TIGR00574] DNA ligase I, ATP-dependent (dnl1) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AACTAACGGA ATTATGTCAG CCACTCAGCC CACCAACTCT GGCTCTCACC TTGTGCACTG CCGCTGTTAG CTCCACTTGC TCGCACTCTG AACCTCCCTA GTAGCCAAAC GCTGTTTCCT GCACTGTCTG TCTTATCGTC CCAATAATAT TCTTATCTAA ATCGACTTCT TCCTTCCTTT GCAGATTGTG CATTGCAATT GCGCAGTTTC AATTTCTCTA ATTCCCTTCT TTCTGGCCAT GTCTTTCTTA GATAATGTCG AGCCTCCTAC CAATCTCCAG GAGCCGCAGT ACAAGTTTGT GGTGGATGAA CTTTTCACCC AACTAGACGG TGCCAACAGG GCCAACTTGG GCCATTTCAG AACCGTTTCC GATAAAAAGG TGTCAGTGAT CCAGACGTTT ATCAAGACCT TTCGGATTCA CATCGGTGAC AACATATTCC CCACGGCCCG CTTGGTGTTC CCCGATAAGG ACCGACGTCT CTACTACATC AGAGATGTAA CTCTAGCTAG ACTCATAGTC AAGATGTATT CCATACCTCC GGAATCGGAG GACTACAAAG TGTTGTACCA TTGGAAGCAT GGTTTCCAGA AGGAGAAAAG ATTCACAGTA GATTCAAATA ACTTGCGAGA TTTGCCGTTG CGGGCCTCGC GTATTATTGC TAATCGTAGA GAATCTACAG TTGCTACCAA CGATCAGAAC CAGCCAATCA CAACACGCCA TTCGTATACT GTAGCCGAAA TGAATTCCAA ATTGGACTCA TTGAATGATG CAAAGAAGTC TCAAGAACAG ATAGCCATCT TAAAGCCTTT GCTAGACTCT CTCTCTATTC CCGAAATCAG ATGGTTGCTT CATATAATTT TGAAAAAATC GATTCTAGTA CGATTTGAGA ACTACTTTCT CAGTGTATGG CACCCAGATG CTCCAGCCTT GTTCAAAGTG TGTAACAATC TCCAGAAGAC ATTCAACTAT CTCGTAAACC AAGAAACTAG GTTGAATAAG AATGATCTCA CTGTACACCC GACACTCCCA TTCCGACCTC AGTTGTCATT TAAGTTGACA AAAAACTATG ACAAGTTGAT CAAGGATATG TCGATGACAG TTCCCATGGA TGTGAACTTC CAGAAGATGT TCGCGGCTAA AGATCTTGCT GGAAAGTTCT ACATGGAAGA AAAGATGGAT GGCGACAGAA TGGTTTTACA CAAGCAAGGG AAGCACTTCA AATTTTATTC GAGAAGGTTG AAAGACTATT CCTTTCTTTA TGGCGAGAAC CTTGAAATCG GCTCTCTTAC CAAGTATTTA TCCAATGCAT TCCCCAGTAA AGTGGACTCG ATCATTTTGG ATGGAGAGAT GGTGGCATGG GATTTCAAGA GAAATGTAGT GCTACCATTT GGGACTTTGA AATCTTCGGC TATTCAGGAG TCCGTCAGAC AGTATACGAC TATCGATCAA TATGAACAGC AGTCTTCTTA TCCATTTTTC TTAGTGTTTG ATATTCTCCA TATCAATGGT ACCAATTTGA CGAATCATCC GTTATTCTTC AGAAAGGATA TCCTCCAGAA AATTATAAAC CCAATACCAC ATAGACTAGA GCTTCTTCCT TCGATAATTG GTTCAACTGC AGAAGATGTA CAAAGAGCAA TGAGAGAAGT AGTTAGTTCC AGAAGTGAAG GCATTGTAGT AAAGCATTTG CAGCTGAAGT ACTTCATTGG GGAAAGAAAC CCCCACTGGG TAAAGGTCAA ACCAGAGTAT CTTGAGAAGT TTGGTGAGAA TCTTGATTTG ACAGTAATAG GCAAAGTTCC TGGTGTGAAA ATCTCGTATA TGTGTGGCTT GAAAAATGAT GATGACCAGG TCTTCTACAG CTTCTGCACT GTTGCCAATG GGTTTACAGA AGACGAGTAT GATAAAATAG AAAGAATAAC CCATAACAAA TGGATAGCTT ACAAGGATCA ATTGCCTCCC TCAAAGGTGC TTAACTTTGG AGTAAAGAAA CCAATGCACT GGATCCACCC TCGGGATTCA GTTGTATTAG AGATAAAGGC TAGGTCTATT GATTGTACCG TTGAGAAAAC TTATGCGGTG GGAACTACAT TGCACAATCT CTACTGTCGT AGCATACGAG AAGACAAATC AATTCACGAT TGTACCACTA TTAGTGAGTA TAAGCAAATC AAAGCTAAGT ATTCTAAGGA TTTGGAGAAA TCTCACTCTG CCAACAAGAA GCGAAGAGTG TTACAGGATT CTTTTGTGCA AGAGCAACAA CCAAAACGGG TCAAAGTAGA GTCTGATCTA TTCAGCGGCT TTGATTTCGT TATTCTCAGT GACAACTTAA GTCGCAATGG AGATAGAATC ACAAGAGAAG AATTGATTAT ATTGGTAAAG AAATACGGCG GGAATATTAC CAATACGGTT CACAAATTAG GGACCAGGCA GACTATTGTA GTTACCGAGA GAGAATTGCC TACATGTAAA AGCTACTTTG ATAAGGGTAT CGACTTAGTC AGACCCAGTT GGCTCTTTGA GTGTATCAAC AGAGTGGCCA TTGTACCCCT TGAGCCTTAC TTTATCTTTG GATCCAAGAA TATGGCTGTG TTCAAGAACA GGCAAGATGA ATTTGGAGAC AGCTATGTTA TTCACCAAAC GGTAGAGTCG TGGGATCGAA ATAGATTCCA ACGTCTTCCC GAGACGGAAG TTGACATCTA TCGTAGAGAA TTTCTCGAGG ATGTTCTGGC TACAGAAGAG TCGTTTCCTC GTCGCTTTCT ATTTCACGAT ATCAAATTTC ATATAGTGTC AGTGCAGAGT ACGGAAAACT ATATGGTAGA ATTATTGCAG GATAGAATAG AGAGGTTTGG AGGGGAAGTA ATCGCTGCAC ATGCGGGGTG TTCGTTTATT GTTGTGTGTG GTGATTCAGA GAACAGGGAT AGAACTGAGG TTTTGAACAA GGTTAAGGCC ATCACTAAGG ATATCAGTGA AAAATTACAA TTTGAAGATG CTATTTTAAC CACAAAGATC CCTAGTGTTG TCACTGAAGC TTTTGTAAAG CATTGTATCA AGAGAAATGT ACTTCTTGAT TCAGACGACT ACAAGTACAT TTGA
|
Protein sequence | MSFLDNVEPP TNLQEPQYKF VVDELFTQLD GANRANLGHF RTVSDKKVSV IQTFIKTFRI HIGDNIFPTA RLVFPDKDRR LYYIRDVTLA RLIVKMYSIP PESEDYKVLY HWKHGFQKEK RFTVDSNNLR DLPLRASRII ANRRESTPIT TRHSYTVAEM NSKLDSLNDA KKSQEQIAIL KPLLDSLSIP EIRWLLHIIL KKSILVRFEN YFLSVWHPDA PALFKVCNNL QKTFNYLVNQ ETRLNKNDLT VHPTLPFRPQ LSFKLTKNYD KLIKDMSMTV PMDVNFQKMF AAKDLAGKFY MEEKMDGDRM VLHKQGKHFK FYSRRLKDYS FLYGENLEIG SLTKYLSNAF PSKVDSIILD GEMVAWDFKR NVVLPFGTLK SSAIQESVRQ YTTIDQYEQQ SSYPFFLVFD ILHINGTNLT NHPLFFRKDI LQKIINPIPH RLELLPSIIG STAEDVQRAM REVVSSRSEG IVVKHLQSKY FIGERNPHWV KVKPEYLEKF GENLDLTVIG KVPGVKISYM CGLKNDDDQV FYSFCTVANG FTEDEYDKIE RITHNKWIAY KDQLPPSKVL NFGVKKPMHW IHPRDSVVLE IKARSIDCTV EKTYAVGTTL HNLYCRSIRE DKSIHDCTTI SEYKQIKAKY SKDLEKSHSA NKKRRVLQDS FVQEQQPKRV KVESDLFSGF DFVILSDNLS RNGDRITREE LIILVKKYGG NITNTVHKLG TRQTIVVTER ELPTCKSYFD KGIDLVRPSW LFECINRVAI VPLEPYFIFG SKNMAVFKNR QDEFGDSYVI HQTVESWDRN RFQRLPETEV DIYRREFLED VSATEESFPR RFLFHDIKFH IVTENYMVEL LQDRIERFGG EVIAAHAGCS FIVVCGDSEN RDRTEVLNKV KAITKDISEK LQFEDAILTT KIPSVVTEAF VKHCIKRNVL LDSDDYKYI
|
| |