Gene PICST_81859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_81859 
SymbolDNL1 
ID4837413 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp514266 
End bp517359 
Gene Length3094 bp 
Protein Length939 aa 
Translation table12 
GC content41% 
IMG OID640388728 
ProductDNA ligase IV 
Protein accessionXP_001382860 
Protein GI150864147 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase 
TIGRFAM ID[TIGR00574] DNA ligase I, ATP-dependent (dnl1) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AACTAACGGA ATTATGTCAG CCACTCAGCC CACCAACTCT GGCTCTCACC TTGTGCACTG 
CCGCTGTTAG CTCCACTTGC TCGCACTCTG AACCTCCCTA GTAGCCAAAC GCTGTTTCCT
GCACTGTCTG TCTTATCGTC CCAATAATAT TCTTATCTAA ATCGACTTCT TCCTTCCTTT
GCAGATTGTG CATTGCAATT GCGCAGTTTC AATTTCTCTA ATTCCCTTCT TTCTGGCCAT
GTCTTTCTTA GATAATGTCG AGCCTCCTAC CAATCTCCAG GAGCCGCAGT ACAAGTTTGT
GGTGGATGAA CTTTTCACCC AACTAGACGG TGCCAACAGG GCCAACTTGG GCCATTTCAG
AACCGTTTCC GATAAAAAGG TGTCAGTGAT CCAGACGTTT ATCAAGACCT TTCGGATTCA
CATCGGTGAC AACATATTCC CCACGGCCCG CTTGGTGTTC CCCGATAAGG ACCGACGTCT
CTACTACATC AGAGATGTAA CTCTAGCTAG ACTCATAGTC AAGATGTATT CCATACCTCC
GGAATCGGAG GACTACAAAG TGTTGTACCA TTGGAAGCAT GGTTTCCAGA AGGAGAAAAG
ATTCACAGTA GATTCAAATA ACTTGCGAGA TTTGCCGTTG CGGGCCTCGC GTATTATTGC
TAATCGTAGA GAATCTACAG TTGCTACCAA CGATCAGAAC CAGCCAATCA CAACACGCCA
TTCGTATACT GTAGCCGAAA TGAATTCCAA ATTGGACTCA TTGAATGATG CAAAGAAGTC
TCAAGAACAG ATAGCCATCT TAAAGCCTTT GCTAGACTCT CTCTCTATTC CCGAAATCAG
ATGGTTGCTT CATATAATTT TGAAAAAATC GATTCTAGTA CGATTTGAGA ACTACTTTCT
CAGTGTATGG CACCCAGATG CTCCAGCCTT GTTCAAAGTG TGTAACAATC TCCAGAAGAC
ATTCAACTAT CTCGTAAACC AAGAAACTAG GTTGAATAAG AATGATCTCA CTGTACACCC
GACACTCCCA TTCCGACCTC AGTTGTCATT TAAGTTGACA AAAAACTATG ACAAGTTGAT
CAAGGATATG TCGATGACAG TTCCCATGGA TGTGAACTTC CAGAAGATGT TCGCGGCTAA
AGATCTTGCT GGAAAGTTCT ACATGGAAGA AAAGATGGAT GGCGACAGAA TGGTTTTACA
CAAGCAAGGG AAGCACTTCA AATTTTATTC GAGAAGGTTG AAAGACTATT CCTTTCTTTA
TGGCGAGAAC CTTGAAATCG GCTCTCTTAC CAAGTATTTA TCCAATGCAT TCCCCAGTAA
AGTGGACTCG ATCATTTTGG ATGGAGAGAT GGTGGCATGG GATTTCAAGA GAAATGTAGT
GCTACCATTT GGGACTTTGA AATCTTCGGC TATTCAGGAG TCCGTCAGAC AGTATACGAC
TATCGATCAA TATGAACAGC AGTCTTCTTA TCCATTTTTC TTAGTGTTTG ATATTCTCCA
TATCAATGGT ACCAATTTGA CGAATCATCC GTTATTCTTC AGAAAGGATA TCCTCCAGAA
AATTATAAAC CCAATACCAC ATAGACTAGA GCTTCTTCCT TCGATAATTG GTTCAACTGC
AGAAGATGTA CAAAGAGCAA TGAGAGAAGT AGTTAGTTCC AGAAGTGAAG GCATTGTAGT
AAAGCATTTG CAGCTGAAGT ACTTCATTGG GGAAAGAAAC CCCCACTGGG TAAAGGTCAA
ACCAGAGTAT CTTGAGAAGT TTGGTGAGAA TCTTGATTTG ACAGTAATAG GCAAAGTTCC
TGGTGTGAAA ATCTCGTATA TGTGTGGCTT GAAAAATGAT GATGACCAGG TCTTCTACAG
CTTCTGCACT GTTGCCAATG GGTTTACAGA AGACGAGTAT GATAAAATAG AAAGAATAAC
CCATAACAAA TGGATAGCTT ACAAGGATCA ATTGCCTCCC TCAAAGGTGC TTAACTTTGG
AGTAAAGAAA CCAATGCACT GGATCCACCC TCGGGATTCA GTTGTATTAG AGATAAAGGC
TAGGTCTATT GATTGTACCG TTGAGAAAAC TTATGCGGTG GGAACTACAT TGCACAATCT
CTACTGTCGT AGCATACGAG AAGACAAATC AATTCACGAT TGTACCACTA TTAGTGAGTA
TAAGCAAATC AAAGCTAAGT ATTCTAAGGA TTTGGAGAAA TCTCACTCTG CCAACAAGAA
GCGAAGAGTG TTACAGGATT CTTTTGTGCA AGAGCAACAA CCAAAACGGG TCAAAGTAGA
GTCTGATCTA TTCAGCGGCT TTGATTTCGT TATTCTCAGT GACAACTTAA GTCGCAATGG
AGATAGAATC ACAAGAGAAG AATTGATTAT ATTGGTAAAG AAATACGGCG GGAATATTAC
CAATACGGTT CACAAATTAG GGACCAGGCA GACTATTGTA GTTACCGAGA GAGAATTGCC
TACATGTAAA AGCTACTTTG ATAAGGGTAT CGACTTAGTC AGACCCAGTT GGCTCTTTGA
GTGTATCAAC AGAGTGGCCA TTGTACCCCT TGAGCCTTAC TTTATCTTTG GATCCAAGAA
TATGGCTGTG TTCAAGAACA GGCAAGATGA ATTTGGAGAC AGCTATGTTA TTCACCAAAC
GGTAGAGTCG TGGGATCGAA ATAGATTCCA ACGTCTTCCC GAGACGGAAG TTGACATCTA
TCGTAGAGAA TTTCTCGAGG ATGTTCTGGC TACAGAAGAG TCGTTTCCTC GTCGCTTTCT
ATTTCACGAT ATCAAATTTC ATATAGTGTC AGTGCAGAGT ACGGAAAACT ATATGGTAGA
ATTATTGCAG GATAGAATAG AGAGGTTTGG AGGGGAAGTA ATCGCTGCAC ATGCGGGGTG
TTCGTTTATT GTTGTGTGTG GTGATTCAGA GAACAGGGAT AGAACTGAGG TTTTGAACAA
GGTTAAGGCC ATCACTAAGG ATATCAGTGA AAAATTACAA TTTGAAGATG CTATTTTAAC
CACAAAGATC CCTAGTGTTG TCACTGAAGC TTTTGTAAAG CATTGTATCA AGAGAAATGT
ACTTCTTGAT TCAGACGACT ACAAGTACAT TTGA
 
Protein sequence
MSFLDNVEPP TNLQEPQYKF VVDELFTQLD GANRANLGHF RTVSDKKVSV IQTFIKTFRI 
HIGDNIFPTA RLVFPDKDRR LYYIRDVTLA RLIVKMYSIP PESEDYKVLY HWKHGFQKEK
RFTVDSNNLR DLPLRASRII ANRRESTPIT TRHSYTVAEM NSKLDSLNDA KKSQEQIAIL
KPLLDSLSIP EIRWLLHIIL KKSILVRFEN YFLSVWHPDA PALFKVCNNL QKTFNYLVNQ
ETRLNKNDLT VHPTLPFRPQ LSFKLTKNYD KLIKDMSMTV PMDVNFQKMF AAKDLAGKFY
MEEKMDGDRM VLHKQGKHFK FYSRRLKDYS FLYGENLEIG SLTKYLSNAF PSKVDSIILD
GEMVAWDFKR NVVLPFGTLK SSAIQESVRQ YTTIDQYEQQ SSYPFFLVFD ILHINGTNLT
NHPLFFRKDI LQKIINPIPH RLELLPSIIG STAEDVQRAM REVVSSRSEG IVVKHLQSKY
FIGERNPHWV KVKPEYLEKF GENLDLTVIG KVPGVKISYM CGLKNDDDQV FYSFCTVANG
FTEDEYDKIE RITHNKWIAY KDQLPPSKVL NFGVKKPMHW IHPRDSVVLE IKARSIDCTV
EKTYAVGTTL HNLYCRSIRE DKSIHDCTTI SEYKQIKAKY SKDLEKSHSA NKKRRVLQDS
FVQEQQPKRV KVESDLFSGF DFVILSDNLS RNGDRITREE LIILVKKYGG NITNTVHKLG
TRQTIVVTER ELPTCKSYFD KGIDLVRPSW LFECINRVAI VPLEPYFIFG SKNMAVFKNR
QDEFGDSYVI HQTVESWDRN RFQRLPETEV DIYRREFLED VSATEESFPR RFLFHDIKFH
IVTENYMVEL LQDRIERFGG EVIAAHAGCS FIVVCGDSEN RDRTEVLNKV KAITKDISEK
LQFEDAILTT KIPSVVTEAF VKHCIKRNVL LDSDDYKYI