Gene Caul_0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0221 
Symbol 
ID5897495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp234502 
End bp237423 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content68% 
IMG OID641560705 
ProductDNA polymerase I 
Protein accessionYP_001681856 
Protein GI167644193 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.978569 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.324106 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA CGCTCGCCCC CGATTCCGCC GCTCCCCTTC TTCCGACCGA GGCGGAAGCA 
GACCGTCCCC TGACCCAGGA CGGCCCCACC GTGCGGCTGT TCCTGGTCGA CGGCTCGGCT
TATCTGTTCC GCGCCTACCA CGCCCTGCCG CCGCTGACCC GCAAGAGCGA CGGCTTGCCG
GTGGGCGCGG TCCAGGGCTT CTGCAACATG TTGTGGAAGC TGATGCGCGA CATGCAGGGC
GACGCGCCCA CCCACCTGGC GGTGATCTGG GACCATTCCG AGAAGACGTT CCGGAATAAC
CTCTATGACA AGTACAAGGC CCACCGCCCG CCGCCGCCCG AAGACCTGAT CCCGCAGTTC
CCGTTGGTGC GCGAGGCCAC CCGGGCCTTC GGCGTGCCCG CCATCGAGCT GCCCGGCTAC
GAGGCCGACG ACCTGATCGC CGCCTATGCC TGCAAGGTCC GTGACATGGG CGGCGAGGCG
GTGATCGTCT CGTCCGACAA GGACCTGATG CAACTGGTCG GCGACGGTGT GTCGATGTTC
GACCCGATGA AGGGTCTACG GATCGATCGT GACCAGGTGT TCGAGAAGTT CGGCGTCTAT
CCCGACCGGG TGGTCGACGT GCAGTCGCTG TGCGGCGACA GCGTCGACAA CGTGCCCGGC
GCGCCGGGCA TCGGCATCAA GACCGCCGCC CAGCTGATCA ACGAATACGG CGACCTGGAC
ACGCTGCTGG AGCGGGCCGG CGAGATCAAG CAGCCCAAGC GCCGCGAGAC CCTGATCGAG
TATGCCGACC AGATCCGTCT GTCGCGCCAA CTCGTCAAGC TCGACTGCGA CACGCCGCTG
CCCGAGCCCA TCGATGATCT GGTGGTGCGC GAGCCGGACA AGACCGTGCT GGCCGACTTC
CTGGAGCTGA TGGAGTTCCG CTCCCTGGCC CGCCGGGTCG GCGACGGCAA CGCCTCTGCC
TCGCCGCGCA CTCTAGACCG CCCGGCCGCC GCGCCGACCG CGCCGGTGCT GGGCGTGTCC
TACATGGGCG CGGCGGCCCG GGCCGCGGCC AATCCTGTCG CCGAGCCGGC CACGATCGAC
CACGCCGCCT ATGTCCGCAT CCAGGATCTC GAGACCTTGA ACGCCTGGGT CGCCAAGGCC
ACGGCCAAGG GGATCGTCGC CTTCGACACC GAGACCGACG CCCTGTCCTC GGCCACCGCC
GGCCTGTGCG GCGTATCGCT GGCCATTTCG CCGGGCGAGG CCTGCTACAT CCCGGTCGGC
CACTGCGAGA AGGACGGCCT GGCGCTGGAG GCCGCCGCCG ATCTGGTGCA GGTGCCGATG
GAAGAGGTGA TCGCCGCGCT GAAGCCGCTG CTGGAAGACC CGGCGGTGCT GAAGATCGCC
CAGAACGCCA AGTACGACAT CGCCGTCCTG GCCCGCTACG GGATCAATGT GGGGCCGATC
GAGGACACGA TGCTGATCAG CTACGTCCTT GAGGCCGGTC TGCATGGCCA CGGGATGGAT
GAGCTGTCGG AGCTGTGGCT AGGCCACAAG CCGATCTCCT TCAAACAGGT CGCGGGCTCC
GGCAAGGGGC AGATCAGCTT CAAGCACGTG GGTCTGGCCG AGGCCACCGC CTATGCCGCC
GAGGACGCCG ACGTCACATT GCGGCTCTAC AATGTGCTCA AGCCGCGCCT GGCGCGCGAG
GGCCTGCTGA CGGTCTACGA GACCCTGGAG CGCCCGCTGC CCGCGGTGTT GGCGGCGATG
GAGAACGACG GCATCCGCGT CGATCCCGAC ACCCTGCGCC GGCTGTCCAA CGAGTTCTCG
ATGCGCATGG CCGATTTCGA GGCCAGGGCC CAGGAACTGG TCGGTCGTCC GTTCAACCTG
GGCAGTCCCA AGCAGATCGG CGACGTGTTG TTCGGCGAGA TGGCCCTGAA GGGCGGCAAG
AAGACCGCCA CGGGCCAGTG GTCGACCGAC AGCGACGTGC TCGAGGCTTT GGCCCTGGAG
CATGAGCTGC CGCGCGTGCT GCTGGACTGG CGCCAGCTGT CCAAGCTGAA GGGCACCTAT
ACCGAGAACC TGATCGCCGC GATCGCGCCG GGCGGCGGCA ACCGGGTCCA CACCTCCTAC
GCCCTGGCCG CCACCACGAC GGGCCGGCTG TCGTCGTCGG ACCCCAACCT GCAGAACATC
CCGATCCGCA CCGAGGAAGG CCGCAAGATC CGCAAGGCCT TCGTGGCCGC GCCGGGCAAG
GTGCTGATCA GCGCCGACTA CAGCCAGATC GAGCTGCGCC TGCTGGCCCA TATCGGCGAC
ATTCCCCAGC TGAAGAAGGC CTTCCAGGAG GGCCTGGACA TCCACGCCAT GACCGCGTCG
GAGATGTTCA ACGTGCCGAT CGAAGGCATG CCCTCGGAAG TGCGCCGCCG GGCCAAGGCG
ATCAATTTCG GCATCGTCTA CGGCATCAGC GCTTTTGGCC TGGCCAACCA GCTGTCGATC
CCGCAAGGCG AGGCCGGAGC CTATATCAAG ACCTATTTCG AGCGCTTCCC CGGCATCCAG
GCCTATATGG AGGCGACCAA GGCCTTCGTC CGCGAGCACG GCTACGTCAC CACGATCTTC
GGGCGCAAGA TCAACATCCC GGACATCGGC GGCAAGTCCG TGGCCCATCG GCAGTTCGCC
GAGCGCGCCG CGATCAACGC TCCGATCCAG GGCGCGGCGG CCGATGTGAT GCGCCGAGCC
ATGGTCCGCA TGCCGGGCGC GCTGAAGGCG GCGGGGCTGT CGTCACGGAT GCTGTTGCAG
GTGCACGACG AACTGGTCTT CGAAGCGCCC GAGGCCGAGG CCCAGGCCAC GATCGACGTC
GCCCGTGAGG TCATGCAGGG CGCCGCCGCG CCCGCCGTGA CCATTTCGGT GCCGCTGACG
GTGGAGGCCA GGGCCGCCGC CAACTGGGAC GAAGCCCACT AA
 
Protein sequence
MTDTLAPDSA APLLPTEAEA DRPLTQDGPT VRLFLVDGSA YLFRAYHALP PLTRKSDGLP 
VGAVQGFCNM LWKLMRDMQG DAPTHLAVIW DHSEKTFRNN LYDKYKAHRP PPPEDLIPQF
PLVREATRAF GVPAIELPGY EADDLIAAYA CKVRDMGGEA VIVSSDKDLM QLVGDGVSMF
DPMKGLRIDR DQVFEKFGVY PDRVVDVQSL CGDSVDNVPG APGIGIKTAA QLINEYGDLD
TLLERAGEIK QPKRRETLIE YADQIRLSRQ LVKLDCDTPL PEPIDDLVVR EPDKTVLADF
LELMEFRSLA RRVGDGNASA SPRTLDRPAA APTAPVLGVS YMGAAARAAA NPVAEPATID
HAAYVRIQDL ETLNAWVAKA TAKGIVAFDT ETDALSSATA GLCGVSLAIS PGEACYIPVG
HCEKDGLALE AAADLVQVPM EEVIAALKPL LEDPAVLKIA QNAKYDIAVL ARYGINVGPI
EDTMLISYVL EAGLHGHGMD ELSELWLGHK PISFKQVAGS GKGQISFKHV GLAEATAYAA
EDADVTLRLY NVLKPRLARE GLLTVYETLE RPLPAVLAAM ENDGIRVDPD TLRRLSNEFS
MRMADFEARA QELVGRPFNL GSPKQIGDVL FGEMALKGGK KTATGQWSTD SDVLEALALE
HELPRVLLDW RQLSKLKGTY TENLIAAIAP GGGNRVHTSY ALAATTTGRL SSSDPNLQNI
PIRTEEGRKI RKAFVAAPGK VLISADYSQI ELRLLAHIGD IPQLKKAFQE GLDIHAMTAS
EMFNVPIEGM PSEVRRRAKA INFGIVYGIS AFGLANQLSI PQGEAGAYIK TYFERFPGIQ
AYMEATKAFV REHGYVTTIF GRKINIPDIG GKSVAHRQFA ERAAINAPIQ GAAADVMRRA
MVRMPGALKA AGLSSRMLLQ VHDELVFEAP EAEAQATIDV AREVMQGAAA PAVTISVPLT
VEARAAANWD EAH