Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0221 |
Symbol | |
ID | 5897495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 234502 |
End bp | 237423 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641560705 |
Product | DNA polymerase I |
Protein accession | YP_001681856 |
Protein GI | 167644193 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.978569 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.324106 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACA CGCTCGCCCC CGATTCCGCC GCTCCCCTTC TTCCGACCGA GGCGGAAGCA GACCGTCCCC TGACCCAGGA CGGCCCCACC GTGCGGCTGT TCCTGGTCGA CGGCTCGGCT TATCTGTTCC GCGCCTACCA CGCCCTGCCG CCGCTGACCC GCAAGAGCGA CGGCTTGCCG GTGGGCGCGG TCCAGGGCTT CTGCAACATG TTGTGGAAGC TGATGCGCGA CATGCAGGGC GACGCGCCCA CCCACCTGGC GGTGATCTGG GACCATTCCG AGAAGACGTT CCGGAATAAC CTCTATGACA AGTACAAGGC CCACCGCCCG CCGCCGCCCG AAGACCTGAT CCCGCAGTTC CCGTTGGTGC GCGAGGCCAC CCGGGCCTTC GGCGTGCCCG CCATCGAGCT GCCCGGCTAC GAGGCCGACG ACCTGATCGC CGCCTATGCC TGCAAGGTCC GTGACATGGG CGGCGAGGCG GTGATCGTCT CGTCCGACAA GGACCTGATG CAACTGGTCG GCGACGGTGT GTCGATGTTC GACCCGATGA AGGGTCTACG GATCGATCGT GACCAGGTGT TCGAGAAGTT CGGCGTCTAT CCCGACCGGG TGGTCGACGT GCAGTCGCTG TGCGGCGACA GCGTCGACAA CGTGCCCGGC GCGCCGGGCA TCGGCATCAA GACCGCCGCC CAGCTGATCA ACGAATACGG CGACCTGGAC ACGCTGCTGG AGCGGGCCGG CGAGATCAAG CAGCCCAAGC GCCGCGAGAC CCTGATCGAG TATGCCGACC AGATCCGTCT GTCGCGCCAA CTCGTCAAGC TCGACTGCGA CACGCCGCTG CCCGAGCCCA TCGATGATCT GGTGGTGCGC GAGCCGGACA AGACCGTGCT GGCCGACTTC CTGGAGCTGA TGGAGTTCCG CTCCCTGGCC CGCCGGGTCG GCGACGGCAA CGCCTCTGCC TCGCCGCGCA CTCTAGACCG CCCGGCCGCC GCGCCGACCG CGCCGGTGCT GGGCGTGTCC TACATGGGCG CGGCGGCCCG GGCCGCGGCC AATCCTGTCG CCGAGCCGGC CACGATCGAC CACGCCGCCT ATGTCCGCAT CCAGGATCTC GAGACCTTGA ACGCCTGGGT CGCCAAGGCC ACGGCCAAGG GGATCGTCGC CTTCGACACC GAGACCGACG CCCTGTCCTC GGCCACCGCC GGCCTGTGCG GCGTATCGCT GGCCATTTCG CCGGGCGAGG CCTGCTACAT CCCGGTCGGC CACTGCGAGA AGGACGGCCT GGCGCTGGAG GCCGCCGCCG ATCTGGTGCA GGTGCCGATG GAAGAGGTGA TCGCCGCGCT GAAGCCGCTG CTGGAAGACC CGGCGGTGCT GAAGATCGCC CAGAACGCCA AGTACGACAT CGCCGTCCTG GCCCGCTACG GGATCAATGT GGGGCCGATC GAGGACACGA TGCTGATCAG CTACGTCCTT GAGGCCGGTC TGCATGGCCA CGGGATGGAT GAGCTGTCGG AGCTGTGGCT AGGCCACAAG CCGATCTCCT TCAAACAGGT CGCGGGCTCC GGCAAGGGGC AGATCAGCTT CAAGCACGTG GGTCTGGCCG AGGCCACCGC CTATGCCGCC GAGGACGCCG ACGTCACATT GCGGCTCTAC AATGTGCTCA AGCCGCGCCT GGCGCGCGAG GGCCTGCTGA CGGTCTACGA GACCCTGGAG CGCCCGCTGC CCGCGGTGTT GGCGGCGATG GAGAACGACG GCATCCGCGT CGATCCCGAC ACCCTGCGCC GGCTGTCCAA CGAGTTCTCG ATGCGCATGG CCGATTTCGA GGCCAGGGCC CAGGAACTGG TCGGTCGTCC GTTCAACCTG GGCAGTCCCA AGCAGATCGG CGACGTGTTG TTCGGCGAGA TGGCCCTGAA GGGCGGCAAG AAGACCGCCA CGGGCCAGTG GTCGACCGAC AGCGACGTGC TCGAGGCTTT GGCCCTGGAG CATGAGCTGC CGCGCGTGCT GCTGGACTGG CGCCAGCTGT CCAAGCTGAA GGGCACCTAT ACCGAGAACC TGATCGCCGC GATCGCGCCG GGCGGCGGCA ACCGGGTCCA CACCTCCTAC GCCCTGGCCG CCACCACGAC GGGCCGGCTG TCGTCGTCGG ACCCCAACCT GCAGAACATC CCGATCCGCA CCGAGGAAGG CCGCAAGATC CGCAAGGCCT TCGTGGCCGC GCCGGGCAAG GTGCTGATCA GCGCCGACTA CAGCCAGATC GAGCTGCGCC TGCTGGCCCA TATCGGCGAC ATTCCCCAGC TGAAGAAGGC CTTCCAGGAG GGCCTGGACA TCCACGCCAT GACCGCGTCG GAGATGTTCA ACGTGCCGAT CGAAGGCATG CCCTCGGAAG TGCGCCGCCG GGCCAAGGCG ATCAATTTCG GCATCGTCTA CGGCATCAGC GCTTTTGGCC TGGCCAACCA GCTGTCGATC CCGCAAGGCG AGGCCGGAGC CTATATCAAG ACCTATTTCG AGCGCTTCCC CGGCATCCAG GCCTATATGG AGGCGACCAA GGCCTTCGTC CGCGAGCACG GCTACGTCAC CACGATCTTC GGGCGCAAGA TCAACATCCC GGACATCGGC GGCAAGTCCG TGGCCCATCG GCAGTTCGCC GAGCGCGCCG CGATCAACGC TCCGATCCAG GGCGCGGCGG CCGATGTGAT GCGCCGAGCC ATGGTCCGCA TGCCGGGCGC GCTGAAGGCG GCGGGGCTGT CGTCACGGAT GCTGTTGCAG GTGCACGACG AACTGGTCTT CGAAGCGCCC GAGGCCGAGG CCCAGGCCAC GATCGACGTC GCCCGTGAGG TCATGCAGGG CGCCGCCGCG CCCGCCGTGA CCATTTCGGT GCCGCTGACG GTGGAGGCCA GGGCCGCCGC CAACTGGGAC GAAGCCCACT AA
|
Protein sequence | MTDTLAPDSA APLLPTEAEA DRPLTQDGPT VRLFLVDGSA YLFRAYHALP PLTRKSDGLP VGAVQGFCNM LWKLMRDMQG DAPTHLAVIW DHSEKTFRNN LYDKYKAHRP PPPEDLIPQF PLVREATRAF GVPAIELPGY EADDLIAAYA CKVRDMGGEA VIVSSDKDLM QLVGDGVSMF DPMKGLRIDR DQVFEKFGVY PDRVVDVQSL CGDSVDNVPG APGIGIKTAA QLINEYGDLD TLLERAGEIK QPKRRETLIE YADQIRLSRQ LVKLDCDTPL PEPIDDLVVR EPDKTVLADF LELMEFRSLA RRVGDGNASA SPRTLDRPAA APTAPVLGVS YMGAAARAAA NPVAEPATID HAAYVRIQDL ETLNAWVAKA TAKGIVAFDT ETDALSSATA GLCGVSLAIS PGEACYIPVG HCEKDGLALE AAADLVQVPM EEVIAALKPL LEDPAVLKIA QNAKYDIAVL ARYGINVGPI EDTMLISYVL EAGLHGHGMD ELSELWLGHK PISFKQVAGS GKGQISFKHV GLAEATAYAA EDADVTLRLY NVLKPRLARE GLLTVYETLE RPLPAVLAAM ENDGIRVDPD TLRRLSNEFS MRMADFEARA QELVGRPFNL GSPKQIGDVL FGEMALKGGK KTATGQWSTD SDVLEALALE HELPRVLLDW RQLSKLKGTY TENLIAAIAP GGGNRVHTSY ALAATTTGRL SSSDPNLQNI PIRTEEGRKI RKAFVAAPGK VLISADYSQI ELRLLAHIGD IPQLKKAFQE GLDIHAMTAS EMFNVPIEGM PSEVRRRAKA INFGIVYGIS AFGLANQLSI PQGEAGAYIK TYFERFPGIQ AYMEATKAFV REHGYVTTIF GRKINIPDIG GKSVAHRQFA ERAAINAPIQ GAAADVMRRA MVRMPGALKA AGLSSRMLLQ VHDELVFEAP EAEAQATIDV AREVMQGAAA PAVTISVPLT VEARAAANWD EAH
|
| |