Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5677 |
Symbol | |
ID | 8337037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 6549440 |
End bp | 6552157 |
Gene Length | 2718 bp |
Protein Length | 905 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644958780 |
Product | DNA polymerase I |
Protein accession | YP_003116376 |
Protein GI | 256394812 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.902906 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.469276 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCCCG CCAAGAAGAC CTCCGAGTCC ACCGCAAGCC GCCCCGTCCT GCTCCTGCTG GACGGCCACT CCCTGGCGTA CCGGGCGTAC TACGCGCTGA AGGAGGCGGA CCTGCGCACC ACGACCGGTC AGCCCACCGG CGCGGTGCAG GGGTTCACGT CCATGCTGAT CAACACGCTG CGCGACGAGC GGCCGACGCA CGTCGCCGTC GCCTTCGACG TGTCGCGGCA GACCTTCCGG ACCGAGAAGT TCCCGGAGTA CAAGGCCAAC CGCTCGGCCT CGCCGGACGA CTTCAAGAGC CAGGTCGGGC TGATCGACGA GCTGCTGGCG GGCCTGGGCA TCACGGTCAT CCGCAAGGAG GGCTTCGAGG CCGACGACGT CATCGCGACC CTGACCACGC AGGCCGCCGC CGACGGCTAC GAGGTCCGCA TCCTCACCGG CGACCGCGAC TCGCTCCAGC TGGTCACCGA GAACGTGACC GTGCTCTATC CCAAGCGCGG CGTGTCCGAC CTGTCGCGCT TCACCCCGGC CGCGGTCGAG GAGAAGTACG AGCTGACCCC GCAGCAGTAC CCCGACTTCG CCGCCCTGCG CGGCGACCCC TCGGACAACC TGCCGAACAT CCCCGGCGTG GGGGAGAAGA CCGCCGCGAA GTGGATCCGC GAGTTCGGTT CGCTGACCGA GCTGATCGAG CGCGCCGACG AGGTGAAGGG CAAGGCCGGG GAGACCCTGC GGGCGCATCT GGAGCAGGTG CGGCTCAACC GCGAGCTGAC CGAGCTGATC AAGGACGTGC CGCTGCCGGC CGGTCCGCCG GACCTGGCGT GGGCCTCCGA GGGCAATCCG GAGGCGGTGT ATCAGCTCTT CGACACCCTG GAGTTCTCCC GCTCCATGCG CGACCGCGTG GCGCCGCTGC TGGCCTCCGA CGCCGACGAC AGCCCGGTGG GCGAGGCGGT GGCGCTGCAG GGGCAGGTGC TGGAGGTCGG GCAGCTGGCC GACTGGCTGG CCGCGAACGC CACCGGTGAG GGCGCGACCG GCGTCTCGGT GCACGGAAGC TGGGGGCGCG GGACCGGGGA CGTCAGCGGC CTGGCTTTCG CCACCGCCGA CGGCACCGCG GCGTACGTCG ACGCCACGCA GCTCGACCCG GCGGACGAGA CCGCGCTGGC GGCGTGGCTG GCCGACCCGG CGCGCCAGAA GATCCTGCAC GACGCCAAGG GCCCGTCCCT GGCGCTGGCC GCCCGCGGCT TCACCCTGGC CGGCGTGGCC GCCGACACCG CGCTGGAGGC CTACCTCGCG CAGTCCGGCC GCCGCTCCTT CGACCTGGAG CCGCTCGCCG AGGAGGTGCT GGGCCGCCGC CTGTCGCCGG CCGGCGCGGA CGAGAACCAG GGCACGCTGT TCGCCGACGA GGACGCCGAG GCCGAGCGCC AGATGGCCGC CGCGCACGCC GTGCTGGACC TGGCCGACGC GCTGCGCGGC AAGCTCGCCG AGACCGGCGC CGAGCAGCTG CTCACCGACA TCGAGCTGCC GCTGGTCGGG GTGCTGGCCG AGATGGAGCA GGTCGGCATC GCCATCGACG AGCGGCTGTT CCAGGACCTG GAGAAGGGCT TCTCCGGCGA GGCGGCCAAG GAGGTCGACG CCGCCCGCGC CGAGGCGGGC GTGGAGACCC TGAACCTGGG CTCGCCCAAG CAGCTGCAGG AAGTGCTCTT CGAGAACCTG GGCATGCCCA AGACCAAGAA GATCAAGACC GGCTACACCA CCGACGCCGA CTCCCTGGCC TGGCTCCAGG CCCAGACCCA GCACCCGTTC CTGGACCACC TGCTCCGCTG GCGCGAGGTG AACCGGCTCA AGACCGTGGT CGAGGGCCTG TCCAAGTCGG TCTCGCCGGA CACCCGCATC CACACCACCT ACAACCAGAT GATCGCCGCG ACCGGCCGGC TGTCCTCGGT GGACCCCAAC CTGCAGAACA TCCCCATCCG CACCCTGGAG GGGCAGCAGA TCCGCAAGGC CTTCATCGCC GGTCCCGGCT ATGAGTCCCT GATGACCGCG GACTACAGCC AGATCGAGAT GCGCATCATG GCCCACCTCT CCGAGGACGC GGGCCTGATC GAGGCCTTCA CCTCCGGCGA GGACCTGCAC AACACCGTCG CGGCCAAGGT CTTCGACGTC GAGGCCACCG CCGTCGAGCC CGAGCACCGC CGCCGCATCA AGGCGATGAG CTACGGCCTG GCCTACGGCC TGTCCGCCTT CGGCCTGTCC CAGCAGCTCG GCATCGAGAC CGGCGAGGCG GCCAAGATGA TGGAGGACTA CTTCCAGCGC TTCGGCGGCG TCCGCGACTA CCTCCACGAC CTCGTCGTCC AAGCCCGGGC CACCGGCTAC ACCGAGACCA TGTTCGGCCG CCGCCGCTAC CTCCCCGACC TCGCCAGCGA CAACCGCCAA CGCCGCGAAA TGGCCGAACG CATGGCCCTG AACGCCCCGA TCCAGGGCTC CGCCGCCGAC ATCATCAAGG TGGCGATGAT CCGCGTCCGC GAAGGCCTGC GCGAGCAGCA GTTGAAGTCC CGCATGCTCC TCCAGGTCCA CGACGAACTC GTCCTCGAGA TCGCCCCCGG CGAAGCACCC CGCGTCGAGG AACTGGTGCG GCGCGAGATG GGGTCCGCGG CCGAGCTGCG AGTGCCGCTG GACGTTTCGG TGGGGATGGC TGAGAACTGG GCGGATGCCG CGCACTAG
|
Protein sequence | MSPAKKTSES TASRPVLLLL DGHSLAYRAY YALKEADLRT TTGQPTGAVQ GFTSMLINTL RDERPTHVAV AFDVSRQTFR TEKFPEYKAN RSASPDDFKS QVGLIDELLA GLGITVIRKE GFEADDVIAT LTTQAAADGY EVRILTGDRD SLQLVTENVT VLYPKRGVSD LSRFTPAAVE EKYELTPQQY PDFAALRGDP SDNLPNIPGV GEKTAAKWIR EFGSLTELIE RADEVKGKAG ETLRAHLEQV RLNRELTELI KDVPLPAGPP DLAWASEGNP EAVYQLFDTL EFSRSMRDRV APLLASDADD SPVGEAVALQ GQVLEVGQLA DWLAANATGE GATGVSVHGS WGRGTGDVSG LAFATADGTA AYVDATQLDP ADETALAAWL ADPARQKILH DAKGPSLALA ARGFTLAGVA ADTALEAYLA QSGRRSFDLE PLAEEVLGRR LSPAGADENQ GTLFADEDAE AERQMAAAHA VLDLADALRG KLAETGAEQL LTDIELPLVG VLAEMEQVGI AIDERLFQDL EKGFSGEAAK EVDAARAEAG VETLNLGSPK QLQEVLFENL GMPKTKKIKT GYTTDADSLA WLQAQTQHPF LDHLLRWREV NRLKTVVEGL SKSVSPDTRI HTTYNQMIAA TGRLSSVDPN LQNIPIRTLE GQQIRKAFIA GPGYESLMTA DYSQIEMRIM AHLSEDAGLI EAFTSGEDLH NTVAAKVFDV EATAVEPEHR RRIKAMSYGL AYGLSAFGLS QQLGIETGEA AKMMEDYFQR FGGVRDYLHD LVVQARATGY TETMFGRRRY LPDLASDNRQ RREMAERMAL NAPIQGSAAD IIKVAMIRVR EGLREQQLKS RMLLQVHDEL VLEIAPGEAP RVEELVRREM GSAAELRVPL DVSVGMAENW ADAAH
|
| |