Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1876 |
Symbol | ligD |
ID | 3908071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2138794 |
End bp | 2141538 |
Gene Length | 2745 bp |
Protein Length | 914 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637883770 |
Product | ATP-dependent DNA ligase |
Protein accession | YP_485495 |
Protein GI | 86748999 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1793] ATP-dependent DNA ligase [COG3285] Predicted eukaryotic-type DNA primase |
TIGRFAM ID | [TIGR02776] DNA ligase D [TIGR02777] DNA ligase D, 3'-phosphoesterase domain [TIGR02778] DNA polymerase LigD, polymerase domain [TIGR02779] DNA polymerase LigD, ligase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.907588 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGCCA GCAAGACCCT CACCCTGTAT CGCAACAAGC GCGACTTCGA ACAGACCGCC GAGCCGCGCG GCGATGCCGA GGTCGTGCCG TCGAAGCGGC GGCGCTTCGT AATTCAGAAG CACGACGCGA CGCGGCTGCA CTACGACCTG CGGCTCGAAT ATGACGGCGT GTTCAAGTCC TGGGCGGTGA CGCGCGGCCC ATCGCTCGAT CCACGCGACA AGCGGCTCGC GGTCGAGGTC GAGGACCACC CGCTCGACTA CGGCGACTTC GAAGGCACCA TTCCCAAGGG CCAGTACGGC GGCGGCACGG TGCAACTCTG GGACCGCGGC TATTGGGACT GCGATGATCC CGAGCGGGGG TTCAAGACCG GCGATCTGAA ATTCACGCTC GACGGCGAGA AGGTGCACGG CAGCTGGGTG CTGGTGCGGA TGCGCCACGA CCGCAACGGC GGCAAGCGGA CCAACTGGCT GCTGATCAAG CATCGCGACG ACGACGCCCG CGAGGGCAAG GCCAACGACA TTCTCGACGA GGATCGCTCG GTGGCGTCGG GCCGGACCAT GAAGCAGATC GCCGAAGGCA AGGGGCGGGC GCCGAAGCCG TTCATGACAG GCAAGGTGGC GCGGGTGAAG GCCGACGCGG TGTGGGACTC GAACAAGGGG CTGGCCGCGG ACGCGCGCGC GGCCGACGAC GCCGGAAAGG CGGCGAGACC CAAGCGCGCC GCGGCGAAGA AGGCGCCGAA GAAGGCCGCA GCGAAGTCCC GCACAACGAA GACGACGTCG AGGGCCACGC GCAAGCCGGT GAAGGTCGCC GCAATGCCGG ACTTCATCCC GCCGCAGCTC TGCACTTCGG TCGAGCGTCC GCCCGGCAGC GACGGCTGGC GCCACGAGAT CAAGTTCGAC GGCTATCGGA TGCAGTTGCG CATCGCGCAC GGCGAGGCTG CGCTCAGGAC CCGCAAGGGG CTCGACTGGA CCGCGAAATT TCGGGCGATC GCCGACGAGG CTGCGGGCCT GCCGGATGCG ATCATCGACA CCGAGATCGT CGCGCTCGAT CATCACGGCC ATCCCGACTT CGCGGCGTTG CAGGCGGCGC TGTCGGACGG CGACAGCGAC AAGCTGATCT GCTTTGCGTT CGATCTGCTC TATGCCGACG GTGAGGATCT ACGGTCTTTG CCGCTGTCCG AGCGCAAGCA GCGCCTGCAA GATCTGCTCA AAGCCGCGCG CGGACGTCGC AAGGAAGGCC TGATCCGCTA TGTCGAGCAC TTCGAGACCG GCGGCGATGC CATCCTGCAA TCGGCCTGCA AGCTGTCGCT CGAAGGCATC GTCTCCAAGA AGCTGGACGC GCCGTATCGC TCCGGCCGCA GCGACAACTG GACCAAGGCG AAGTGCCGTG CCGGGCACGA GGTGGTGATC GGCGGCTGGA AAACGACGGC CGGCAAATTC CGGTCGCTGA TGGTCGGCGT GCAGAGCGAC GATCGCGCGG GGAATCATCA TTTGGCTTAC GTCGGCCTGG TCGGCACCGG ATTCGGCCAG GACGTGGTGA AGCGCATCCT GCCCGAGTTG AAGGCGCGCG CATCAAAGGA CAATCCGTTC GCGGGCGAAA ACGCGCCGCG CAAGACCAGC GACGTGAATT GGGTGACACC GGACCTGGTC GCCGAGATCG AATTCGCCGG CTTCACCGGC GCCGGCATGG TGCGGCAGGC GGCGTTCAAG GGGCTGCGCG CCGACAAGCC GGCGGACGAA GTCGTGGCGG AGAAGCCTGC CCAGGTCGGG ATCGCGCGGC CGAAGCCGAA GCGCGGGACG AAAGCAGCCC CGGCTGTGCG GAAAGTCGCG TCCGCCGCCA GCCGCGCCGA GGTGATGGGA ATCTCGATCT CCAAGCCGGA CAAGGTGCTG TGGCCGGCGA GCGAGATCAG CGAGGCGATC ACCAAGCTCG ACCTCGCACA TTACTACGAA GCGGTCGGCG ACTGGCTGAT CGCGCACATC AAAGGGCGGC CGTGCTCGAT CGTGCGGGCG CCCGACGGCA TCGACGGCGA GCATTTCTTC CAGCGCCATG CGATGCCCGG GATGTCGAAC CTGATCGACC TCGCGAAGGT CTCCGGTGAC CGCAAGCCCT ATGTCCAGAT CGATCGTGTC GAAGGGCTGA TCGCGGTGGC GCAGATCGGC GGCCTCGAAC TGCATCCGTG GAACTGCGCG CCCGGCGCCT ACGACGTACC AGGGCGGCTC GTGTTCGATC TCGATCCCGC CCCGGATGTC GGCTTCGACG ACGTGGTCGC CGCGGCACGC GAAATGAAAG ACCGACTGGA GACGATCGGA CTGTCGACGT TCTGCAAGAC CACCGGCGGT AAGGGACTGC ACGTCGTGGT GCCGCTGCAG CCGAAGGACG ACGTCGACTG GAAGCAGGCC AAGATATTCG CGCAGACGGT GTGCGCGCAG ATGGCCGACG ACAGTCCTGA GCGTTATCTG CTCAACATGT CGAAGCAGCA GCGCAAGGGA AAGATCTTCC TCGACTACCT GCGCAACGAC CGGATGTCGA CGGCGGTCGC GGCGCTGTCC CCCCGGGCGC GCGAGGGAGC CACTGTATCG ATGCCGGTGA CCTGGAGTCA GGTCAAAGCG GGCCTCGATC CGAAGCGGTT CACGCTGGGT ACGGTGCCGG CCCTGCTTCG CAAGACCAAG GCGTGGGCGG ACTACGACGA ATCAGCGGCC CCGCTGAGGG CGGCGCTCGA GGCGCTCGCA TCCAGCGGGG CTTGA
|
Protein sequence | MAASKTLTLY RNKRDFEQTA EPRGDAEVVP SKRRRFVIQK HDATRLHYDL RLEYDGVFKS WAVTRGPSLD PRDKRLAVEV EDHPLDYGDF EGTIPKGQYG GGTVQLWDRG YWDCDDPERG FKTGDLKFTL DGEKVHGSWV LVRMRHDRNG GKRTNWLLIK HRDDDAREGK ANDILDEDRS VASGRTMKQI AEGKGRAPKP FMTGKVARVK ADAVWDSNKG LAADARAADD AGKAARPKRA AAKKAPKKAA AKSRTTKTTS RATRKPVKVA AMPDFIPPQL CTSVERPPGS DGWRHEIKFD GYRMQLRIAH GEAALRTRKG LDWTAKFRAI ADEAAGLPDA IIDTEIVALD HHGHPDFAAL QAALSDGDSD KLICFAFDLL YADGEDLRSL PLSERKQRLQ DLLKAARGRR KEGLIRYVEH FETGGDAILQ SACKLSLEGI VSKKLDAPYR SGRSDNWTKA KCRAGHEVVI GGWKTTAGKF RSLMVGVQSD DRAGNHHLAY VGLVGTGFGQ DVVKRILPEL KARASKDNPF AGENAPRKTS DVNWVTPDLV AEIEFAGFTG AGMVRQAAFK GLRADKPADE VVAEKPAQVG IARPKPKRGT KAAPAVRKVA SAASRAEVMG ISISKPDKVL WPASEISEAI TKLDLAHYYE AVGDWLIAHI KGRPCSIVRA PDGIDGEHFF QRHAMPGMSN LIDLAKVSGD RKPYVQIDRV EGLIAVAQIG GLELHPWNCA PGAYDVPGRL VFDLDPAPDV GFDDVVAAAR EMKDRLETIG LSTFCKTTGG KGLHVVVPLQ PKDDVDWKQA KIFAQTVCAQ MADDSPERYL LNMSKQQRKG KIFLDYLRND RMSTAVAALS PRAREGATVS MPVTWSQVKA GLDPKRFTLG TVPALLRKTK AWADYDESAA PLRAALEALA SSGA
|
| |