Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_0473 |
Symbol | |
ID | 6354468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | - |
Start bp | 535392 |
End bp | 538223 |
Gene Length | 2832 bp |
Protein Length | 943 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642668104 |
Product | DNA polymerase I |
Protein accession | YP_001942545 |
Protein GI | 189346016 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAACG AACGGCAATT CGACTTTTTC AGCCCGCAGC AGACTCCGGC AGAGCCGGAC GACTCCGCTG CCCCACAACA AAAGCCCGGG CTCTTTCTTA TCGACGGCAT GGCCCTGGTT TACCGTTCCT ACTATGCCCT GCAACAGGCC GGAATGAAAA CCCGCGACGG AATACCTACA GGAGCCATTC ACGGATTTGC CTCTGCGCTC CTGAAAATTT TCGAAGTGTG GCATCCTGAA TACCTTGCCG TCACATTCGA CAGCAGGGAG AAAACCTTTC GGCACAACCT CTACGAACCC TACAAGGCCA ACCGTCCGGC CCCTCCTGAA GATCTTGTCC GGCAGATCGA GGCTATACTT GAACTGATCG ACGCGTTCTC CATTCCGCTC ATCAAACTAC CCGGTTACGA AGCTGACGAT CTTATCGGAT CTGCCGTCAA ACGATTCGAG AAACAGTGCT CTATCTACAT CGTTACTCCG GACAAGGATC TTGCGCAGCT GGTTCACGAG GGAGTAATCA TGCTGAAGCC CTCAAAAAAA CAGAACGAAC TCGAACCTTT CGGACCAGCG GAAATTATGG CACAGTTCGG CGTTGAACCG GAACGCTTTA TCGATCTTCT GACCCTTACC GGCGATACTT CCGACAATAT CCCCGGAGCA AAAGGCATTG GCCCGAAAAC CGCAGCCGCC CTGCTTCTTA AATACGGCAC ACTCGACAAC ATATACCGCA ACATCGGCGA ACTGACCCCG AAAACACGAG CAAGCCTTGA AGCCTTTCAA CCTCAGCTCG ATCTGATACG GCAGCTCGTT ACCATCCACT CCGATATCGA TCTCGGGCTG AATCTCGAAA CGCTTGCATG CAAAGCTCCT GTGCAAGGCA GAATCATGCC CTTCCTTGGA AAGTACGAGC TGAAAGCTGT CGCCGTTCGC CTGCCGTCCG TTTTTCCCGG TATGCTTTCA AACGGAACCC CTGCAGAAGA AGAGCGTGAC GAACAAGAAA ACAACAATGC ACCGTCTCTC GACTCGCCAT TAACCGGTGC GGACTATGCC ATCATCGATA CCCCAGAAGG GGTTGAAAAG CTTGTCCGGC AACTCGAGCA GGCCGACTCG TTTGCCATTG ACACCGAAAC GACAAGCCTC GACACCTTTC AGGCGGAACT GGCAGGTATC TCTTTTTCGC TCAAACCCAA AGAAGCCCGA TTTATCTATT TCGGAAAAGG CGGCCTCGAC ATAGGCAACA CTCTCGAACG ACTCAAGCCT GTTCTTGAAA ATCCCGATAT CCGCAAAACC GGTCAGAACC TCAAGTATGA TCTGCTCGTT CTGAAAAATT ACGGCATATC GCTTACACCG GTTGCATTCG ACACCATGCT GGCAAGCTAC GTGCTCGATC CGGAGGAAAA ACACAACCTC GACGATCTCG CAGCCCGCCA TCTCTCCATC AGAACGACAA CCTTTGACGA ACTGACCGCC GGTGAAGGGA AAAAACGAAC CCATATTCTC GACGTTCCGT CAGCTCAGCT TTCCGACTAT GGCTGTCAGG ATGCCGATAT CGCTCTCCGG CTTCAGGAGG TATTCGAAGC CAAGCTTCTC GAAGACCAGA GACTGCTGCA CCTCTGCAGA ACCATAGAAT TTCCCCTGGT GGAAGTGCTG GCCGGCATGG AATATCAGGG CATTTCCATC GATACGGCGC ATCTTGAAAA AACAGCGGAT CTCGTCAATG CGCAGATAGG CGATCTCTCC CGAAAAATCC ATGCGGCAGC AGGCACGCCC TTCAATCTCG ACTCGCCCAA ACAACTTTCC GGCATTCTCT TCAACACGCT GGGACTGCCA ACGAAAAAAA CCACGAAAAC CGGTTTTTCG ACCAATGTCG AGGTACTTGA AGAGCTTGCC CCGCTGCACC CGATAATCGG CGACCTGCTC GAATACCGGA GTCTCCAGAA ACTCAAGACC ACCTATATCG ACGCCCTGCC GAAAATGATC AATCCCGGAA CGGGAAGACT GCACACCTCG TTCAACCAGC ATGTCACCGC AACCGGCAGG CTCTCGTCAT CCAATCCGAA CCTGCAGAAC ATTCCGGTAC GGACTACTGC CGGCAAAGAG ATCAGAAGGG CATTCATACC CTCGAATCCC GATAATCTGA TACTCTCCGC CGATTATTCC CAGATCGAGC TCCGGATTGC CGCCGAAATT TCAGGCGACG AAAAACTCAT TGAAGCATTC CGCAACCGTG AAGATATCCA TCGAGCCACG GCAAAAGTCA TCTTCAGTAC CGACGAAATC ACTTCCGATA TGCGCCGCAA GGCAAAAGAG GTCAACTTCG GCGTACTGTA CGGCATTCAG CCATACGGTT TATCGAAACG GCTGAACATT CCCCAATCGG AAGCAAAAGC AATTATCGAA ACCTATACCA CGAAATATCC CGGTCTGTTC CACGCATTGC TGGAGATCAT CGAAGAAGGA AAACGAAAAG GATACGTAAC AACCCTCCTC GGAAGAAGAC GCTATATTCC CGATCTGAAC AGCCGTAACG GAAACATTCA GAAAGCTGCC GGCCGGGCGG CCATGAACAC TCCTATTCAG GGCACGGCCG CCGACATCAT CAAATGCGCG ATGAATCTCT GCGACCGGCA GATAAAAAAG AACAACATGA ACTCGGTTAT GCTGCTTCAG GTTCACGACG AACTGCTCTT TGAAACAACT GCCGAAGAAA AAAACCGGCT TTCGAAACTG GTTGAAACCG CTATGATCGA AGCCGCAGTC TATTGCGGCC TTAAAAAAGT ACCTGTCGAA GTAGATACCG GCACAGGAAA AAACTGGCTG GAGGCTCATT AG
|
Protein sequence | MTNERQFDFF SPQQTPAEPD DSAAPQQKPG LFLIDGMALV YRSYYALQQA GMKTRDGIPT GAIHGFASAL LKIFEVWHPE YLAVTFDSRE KTFRHNLYEP YKANRPAPPE DLVRQIEAIL ELIDAFSIPL IKLPGYEADD LIGSAVKRFE KQCSIYIVTP DKDLAQLVHE GVIMLKPSKK QNELEPFGPA EIMAQFGVEP ERFIDLLTLT GDTSDNIPGA KGIGPKTAAA LLLKYGTLDN IYRNIGELTP KTRASLEAFQ PQLDLIRQLV TIHSDIDLGL NLETLACKAP VQGRIMPFLG KYELKAVAVR LPSVFPGMLS NGTPAEEERD EQENNNAPSL DSPLTGADYA IIDTPEGVEK LVRQLEQADS FAIDTETTSL DTFQAELAGI SFSLKPKEAR FIYFGKGGLD IGNTLERLKP VLENPDIRKT GQNLKYDLLV LKNYGISLTP VAFDTMLASY VLDPEEKHNL DDLAARHLSI RTTTFDELTA GEGKKRTHIL DVPSAQLSDY GCQDADIALR LQEVFEAKLL EDQRLLHLCR TIEFPLVEVL AGMEYQGISI DTAHLEKTAD LVNAQIGDLS RKIHAAAGTP FNLDSPKQLS GILFNTLGLP TKKTTKTGFS TNVEVLEELA PLHPIIGDLL EYRSLQKLKT TYIDALPKMI NPGTGRLHTS FNQHVTATGR LSSSNPNLQN IPVRTTAGKE IRRAFIPSNP DNLILSADYS QIELRIAAEI SGDEKLIEAF RNREDIHRAT AKVIFSTDEI TSDMRRKAKE VNFGVLYGIQ PYGLSKRLNI PQSEAKAIIE TYTTKYPGLF HALLEIIEEG KRKGYVTTLL GRRRYIPDLN SRNGNIQKAA GRAAMNTPIQ GTAADIIKCA MNLCDRQIKK NNMNSVMLLQ VHDELLFETT AEEKNRLSKL VETAMIEAAV YCGLKKVPVE VDTGTGKNWL EAH
|
| |