Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0536 |
Symbol | |
ID | 4569073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 589218 |
End bp | 592058 |
Gene Length | 2841 bp |
Protein Length | 946 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 639765135 |
Product | DNA polymerase I |
Protein accession | YP_911017 |
Protein GI | 119356373 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATTG ACAATCAATT TGATTTTTTC CAGGCAGGTT GCGGCCAACC GTCTGAAGCA ACAAAAAAAA ACATACCGGA AAAAAAACCC GCACTGTTTC TGATAGATGG CATGGCCATG GTTTACCGGG CATATTATGC GTTGCAGTCT GCAAGAATGA AAACCCGCGA TGGCCTTCCA TCAGGAGCTG TGTTTGGCTT CACCTCTGCG CTGCTCAAGA TTTTCGAAAC CTATAAACCC GACTATCTTG CCGTTGCATT TGACAGCAGG GAAAAAACAT TTCGCCATGA CCTTTATGAC CTTTACAAAG CAAACCGTCC CTCCCCACCA GAGGATCTCA TAAGTCAGCT CGATGCCATT TATCAGCTTG TAACGCTCCT CGGCATACCG ATTATCAAAA CAGCCGGGTT TGAAGCCGAT GACCTTATCG GTTCACTTGC GCGCAAATTC GAAAAGAGTT GCGCTATCTA TATCGTAACG CCTGACAAAG ACCTTGCGCA ACTTGTCAAC GATGAAGTAT ACATACTTAA ACCGGGAAAA ATCCAGAATG AACTTGAACT TCTTGGCATT AAGGAAATTA CAATACAGTT TGGAGTCCGT CCTGAGCAGT TCACCGACTT TTTAGCACTC GCAGGAGATG CCTCGGACAA TATCCCCGGA GCAAAAGGTA TTGGCCCGAA AACCGCTGCA AGCCTGATAG GAAAATATGG ATCCATTGCC AGCATACTTC TTAATCTCAA TGAGATATCT CCAAAAAACC GACAGAGTCT CGAAGAATTT CAACCTCGCC TCAAACTGAT CAGGCAACTG GTAACCATAC GAACCGATCT CGACCTTCAC GTGAGCCTGC AAAACCTTGC AACAACGACT CCTGACGTCG AAAAAATACT TCCTTTCCTG AAGCAATACG AGCTGAAATC CATAGCAGCA AGACTTCCCG CTATTTTTCC GGAAATGAAT CTTCAGCCAG AATCACTACC CGCAGATAAC GAAAATAATG AGAAGAACGA CAGCCCGGAG CTTACTGCTC CCGGCTCCGC GAAAGATGGC GCTGATTACC GAATAGTCAC AAGGAAAGAG GAGCTGCAGG CACTCGTCGA ACAACTGAAC CATGCGGCAG CACTTGCTGT TGATACCGAG ACCACCAGCC TGGATACCTT TCAGGCTGAA CTTGTCGGGA TATCGCTCTC CATAAAACCA AAACAGGCAT GGTTCGTCTA TTTCGGCAAA GGCGGAGTGG ATAGACGAGT CGCTCTCGAC ATGCTCAAGC CGGTTCTTGA AAACCCATCT CTGCGAAAAA CGGGACAAAA CCTCAAATAT GACCTGCTCG TTCTGAAAAA ATATGGAATC GACATCAACC CTGTTGCATT TGACACCATG CTTGCGAGTT ACGTGCTCGA TCCGGAAGCG AAACATAATC TTGACGACCT TGCGCTGCGC CATCTCTCCA TAAAAACCAC AACGTACGAC GAACTGGTCG CTGACGGCAA GAAAAAAATG TCCATCCTTG ACGTTCCACC CGGCGAACTG TCCGATTATG CATGTCAGGA TGCCGATCTT GCGCTGCGCC TGCAGGAGGT TTTCAAAGAA AAACTCCTGC AGGAAAAAGA TCTGCTCTGG TTATGTGAAA ATATTGAATT CCCTCTTGTG CCGGTTCTCG CAACAATGGA GTATCATGGC ATTTCAATTG ATACCGATCA CCTGAAAAAA ACAGAAACAA CTGTTTCAAA GCAGATCGGC CAACTCTCCG AAAAAATTTT CGAAGCATCC GGCAGAGTTT TCAACCTCGA TTCGCCAAAA CAACTTGCTC ATATTCTGTT CGACATTCTC GGGCTTCCTT CCGGCAAAGC AACGAAAACC GGCTTTTCCA CCAACGTTCA GGTCCTTGAA GATCTTGCTC CGATCCACCC TGTCGCACAG GATCTCCTTG AATACAGAAG CCTGCAGAAA CTCAGAAACA CGTATATCGA AGCTTTACCA AAAATGATCA ATCCTCTGAC GGGAAAACTG CATACATCAT TCAATCAACA TGTTACCGCA ACAGGACGGC TTTCATCATC GCATCCTAAC CTGCAAAACA TCCCGATACG CACCCTGATT GGAAAAGAAA TACGGCGAGC ATTTATACCG TCCAACCCTG AAAACCTGCT TCTTTCAGCA GACTACTCCC AGATAGAACT CAGAGTGGCT GCCGAAATCA GCGGCGATGA AAAGCTTATG GAGGCGTTCA GAAACAGGGA AGACATCCAT TCCGCTACAG CAAGAACCAT TTTCAACACA ACCGAGATCA CCCCGGAGAT GCGACGCAAG GCAAAGGAGG TGAATTTCGG AGTATTGTAT GGAATCATGC CTTTTGGTCT TTCACGACGC CTCAACATCT CTCGCAATGA AGCAAAAAAT ATTATCGATA CGTATACCGA AAAGTACCCC GGCATATTCA ACGCTTTGCA ACAAATCATC AGTGACGGCA AGGAACGCGG TTATGTTTCG ACCCTGCTTG GCAGAAGACG ATACATCCCT GATCTGAACA GCAGAAACAA GAATATGCAG AAAGCTGCAG AAAGAGCAGC AATGAATACT CCGATTCAGG GAACGGCTGC AGACATCATC AAATGCGCAA TGAATCTCTG CAGCACGCAA CTTCGATTGC ATAAAATGAA ATCCGTTATG CTCCTTCAGG TTCATGACGA ACTTCTTTTT GAAACACCGG AAAACGAAAA ACATAGGCTG AAAACGCTTG TAGAAGAGGT AATGATTGAT GCGGCAAAGC GTTGCGGGTT ACATAACGTC CCTGTAGAAG TAGATACCGG TATCGGAAAA AACTGGCTCG AAGCCCATTA A
|
Protein sequence | MSIDNQFDFF QAGCGQPSEA TKKNIPEKKP ALFLIDGMAM VYRAYYALQS ARMKTRDGLP SGAVFGFTSA LLKIFETYKP DYLAVAFDSR EKTFRHDLYD LYKANRPSPP EDLISQLDAI YQLVTLLGIP IIKTAGFEAD DLIGSLARKF EKSCAIYIVT PDKDLAQLVN DEVYILKPGK IQNELELLGI KEITIQFGVR PEQFTDFLAL AGDASDNIPG AKGIGPKTAA SLIGKYGSIA SILLNLNEIS PKNRQSLEEF QPRLKLIRQL VTIRTDLDLH VSLQNLATTT PDVEKILPFL KQYELKSIAA RLPAIFPEMN LQPESLPADN ENNEKNDSPE LTAPGSAKDG ADYRIVTRKE ELQALVEQLN HAAALAVDTE TTSLDTFQAE LVGISLSIKP KQAWFVYFGK GGVDRRVALD MLKPVLENPS LRKTGQNLKY DLLVLKKYGI DINPVAFDTM LASYVLDPEA KHNLDDLALR HLSIKTTTYD ELVADGKKKM SILDVPPGEL SDYACQDADL ALRLQEVFKE KLLQEKDLLW LCENIEFPLV PVLATMEYHG ISIDTDHLKK TETTVSKQIG QLSEKIFEAS GRVFNLDSPK QLAHILFDIL GLPSGKATKT GFSTNVQVLE DLAPIHPVAQ DLLEYRSLQK LRNTYIEALP KMINPLTGKL HTSFNQHVTA TGRLSSSHPN LQNIPIRTLI GKEIRRAFIP SNPENLLLSA DYSQIELRVA AEISGDEKLM EAFRNREDIH SATARTIFNT TEITPEMRRK AKEVNFGVLY GIMPFGLSRR LNISRNEAKN IIDTYTEKYP GIFNALQQII SDGKERGYVS TLLGRRRYIP DLNSRNKNMQ KAAERAAMNT PIQGTAADII KCAMNLCSTQ LRLHKMKSVM LLQVHDELLF ETPENEKHRL KTLVEEVMID AAKRCGLHNV PVEVDTGIGK NWLEAH
|
| |