Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2239 |
Symbol | |
ID | 6975668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 2482664 |
End bp | 2485234 |
Gene Length | 2571 bp |
Protein Length | 856 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643391766 |
Product | DNA ligase D |
Protein accession | YP_002276609 |
Protein GI | 209544380 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1793] ATP-dependent DNA ligase [COG3285] Predicted eukaryotic-type DNA primase |
TIGRFAM ID | [TIGR02776] DNA ligase D [TIGR02777] DNA ligase D, 3'-phosphoesterase domain [TIGR02778] DNA polymerase LigD, polymerase domain [TIGR02779] DNA polymerase LigD, ligase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.839838 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCTGG ATACCTATCG CAGCAAGCGC GATTTCAAGG CGACGCCCGA GCCCGAGGGT GCCGAGGCGG CGGAAAAGCC GGATACCGGC GCGGGCCATC TGTTCGTCAT CCAGAAACAT GCGGCACACC GCCTGCATTA CGACCTGCGG CTGGAAATGG ATGGCGTGCT GAAGAGCTGG GCGGTGACGC GCGGCCCCAG CCTGAACCCG GGCGACAAGC GCCTGGCCGT GCAGGTCGAG GACCACCCGC TGGATTACGG CACGTTCGAG GGCACGATCC CGAAGGGGCA GTATGGCGGC GGCACGGTCA TGCTCTGGGA CAAGGGCACG TGGACGCCGC TGCACGATCC GGTCCGGGGC CTCGCCAAGG GGCATCTCGA TTTCGAACTG CGGGGCGAAA AGCTTCACGG ACGCTGGAAC CTGGTGCGGA TGGACGGACA CCGCGAGGGA ACGCATGAGA ACTGGCTGCT GATCAAGGCC GCCGATTCCG ATGCACGGTC GAAGAACGAT GCCGATATCC TCGAAGAGCG GCCGGAATCC GTAAAGACCG GCCGTGCGAT GGACGACATC GCGGCCGGTG CGCCAGGGAA GGACAGGGTC GGGAAGGCGG CGGCCCAAAA AGCCGTGACG AAGAAAGCCG CGATACCGAA GGACGCGCCG CAGGCGGACG CGGCCGCGAC AGGCGCCGCA CCCTGCGCGG GGGCGAAGCC CGGCCCGTTG CCATCCTTCG TCGAGCCCGA ACTGGCCACC CTGGTGCGCG CGGCCCCCAC CGGCCCGCAA TGGCTGCACG AGGTCAAGTT CGACGGCTAC CGCCTGCTGG CGCGGATCGA AGCCGGACGG GTCGTGCTGC TGACCCGCAC GGGCCTGGAC TGGACCGCCC GGTTCGGCGA CCAGATCAGT GCCGCGCTGG CGTCCCTGCC GGTGCGGACG GCGCTGATCG ACGGCGAACT GGTGGTGGAA ACCGCAGGCG GGACCTCGGA TTTTTCCGCT CTCCAGGCCG ATCTCAGCGC CGGCCGCACC GACCGCTTCA TCTTCTACGT CTTCGATCTG TTGCATCTGG ACGGCTACGA CCTGCGCGAC GCGACGCTGG AGGCGCGCAA GGGCGCGCTG CACGACCTGG TGCCGGACGA TGCGGCGCGG CTGCGCTTCA GCGGCCATTT CGACGAGGCC GGGGGGCAGG TGCTGCGCCA TGCCTGCCGC CTGGGCCTGG AGGGAATCGT CTCCAAGCAG CGGGACGTGC CCTACCGGTC GGGACGGGGC CGGGACTGGG TGAAATCGAA ATGCGTGGCG CGGCAGGAAT TCGTGATCGG CGGCTACGTG CCCTCGACCG CCAGCCGGGG GGCGGTCGGC TCGCTGGTCC TTGGGGTCCA GGAAAACGGC AGGCTGGTCC ATGTCGGCCG GGTCGGGACC GGCTTCACCG CCGCGACGGC GGCGGACCTG CTCCGGCGCC TGCAACCGCT TGATGTTCCC GACAGTCCCT TCGCCGCTGC CCTGACGGCC CGCGAGCGAA AGGGGGTGCG CTATGTCCGC CCCGATTGCG TGGCGGAAGT GGAATTCCGC GCCTGGACCG CCGACGGCCA TCTGCGCCAC GCGGCGTTCC TGGGCCTGCG CGAGGACAAG CCGGCGGCCG ACATCGTCCG CGAGGTCGAA GCGCCCGACC CCGTATCCTC CGGCCCTTCA TCCAAACCGG TCAAACCGGT TTCCCCGGAA CCCGCGCCTG CCCGCCCGGC CCGCACCCTG ACCCATCCCG ACCGGTCCTA CTGGCCGGAT GCCGGCGTGA CCAAGCAGGA CCTGGCCGAT TATTACGCCG CGATCTGGCC GCGAATGGCG CCCTTCATCA CCGACCGCGC GCTGGCGCTG TTGCGCTGCC CCAACGGCAT CGCGGGGCCG CGCTTCTTCC AGAAGAACTT ATGGAAGGGC GCCGGCGGTC ACCTGGTGCC GTTGCGGGAC CCGGAAGGCG ATCCCGGCAC GCCGCTGATC GGCCTGCGCG ACCGCGACGG GCTGATCGAC CTGGCGCAGG CCGCCGCGCT GGAAATCCAT CCCTGGGGCG CGTCCGCCCA GGCGTGGGAC CAGCCCGACA TGATCGTGAT GGACCTCGAT CCCGGCGACG GCGTGCCCTG GTCCATGGTG ATCGAGGCGG CGCGGGAAAT CCGCGCGCGG CTGGAACGGT CCGGCCTCGC GTCCTTCGTC AAGACGACCG GCGGCAAGGG GCTGCATGTC GTGGCGCCGC TGAAACCCGG GGCCGACTGG ACGGCGGTCA AGGCCTTCAC CCGGTCCATG GCCCAGGCGA TGGTGTCGGA CAGCCCGCAG CGTTACGTGG CCACCATCAC CAAATCCAGG CGCCAGGGCC GTATCCTGGT CGATTACCTG CGCAACCAGC GCGGCGCGAC CGCCGTCGCG CCCTATTCCC CCCGTGCCCG CCCCGGCGCG CCGGTCGCGA TGCCGCTGGC CTGGAACGAA CTGGGGCCGG ATATCGGACC CGCGCATTTC ACCATCGGGA CGATCGCCGC CCGACTGGCG GTCGCGGACC CGTGGGCGGA GATCCGTGAC GCGGCCCGGC CGCTCCGTTA A
|
Protein sequence | MALDTYRSKR DFKATPEPEG AEAAEKPDTG AGHLFVIQKH AAHRLHYDLR LEMDGVLKSW AVTRGPSLNP GDKRLAVQVE DHPLDYGTFE GTIPKGQYGG GTVMLWDKGT WTPLHDPVRG LAKGHLDFEL RGEKLHGRWN LVRMDGHREG THENWLLIKA ADSDARSKND ADILEERPES VKTGRAMDDI AAGAPGKDRV GKAAAQKAVT KKAAIPKDAP QADAAATGAA PCAGAKPGPL PSFVEPELAT LVRAAPTGPQ WLHEVKFDGY RLLARIEAGR VVLLTRTGLD WTARFGDQIS AALASLPVRT ALIDGELVVE TAGGTSDFSA LQADLSAGRT DRFIFYVFDL LHLDGYDLRD ATLEARKGAL HDLVPDDAAR LRFSGHFDEA GGQVLRHACR LGLEGIVSKQ RDVPYRSGRG RDWVKSKCVA RQEFVIGGYV PSTASRGAVG SLVLGVQENG RLVHVGRVGT GFTAATAADL LRRLQPLDVP DSPFAAALTA RERKGVRYVR PDCVAEVEFR AWTADGHLRH AAFLGLREDK PAADIVREVE APDPVSSGPS SKPVKPVSPE PAPARPARTL THPDRSYWPD AGVTKQDLAD YYAAIWPRMA PFITDRALAL LRCPNGIAGP RFFQKNLWKG AGGHLVPLRD PEGDPGTPLI GLRDRDGLID LAQAAALEIH PWGASAQAWD QPDMIVMDLD PGDGVPWSMV IEAAREIRAR LERSGLASFV KTTGGKGLHV VAPLKPGADW TAVKAFTRSM AQAMVSDSPQ RYVATITKSR RQGRILVDYL RNQRGATAVA PYSPRARPGA PVAMPLAWNE LGPDIGPAHF TIGTIAARLA VADPWAEIRD AARPLR
|
| |