Gene Caul_4461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4461 
Symbol 
ID5901922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4830593 
End bp4833946 
Gene Length3354 bp 
Protein Length1117 aa 
Translation table11 
GC content68% 
IMG OID641564980 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_001686079 
Protein GI167648416 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.681559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCC ACGCCTACGC CGAACTGCAA GCCACCACCA ACTTCTCCTT CCTGCGCGGC 
GCCTCGCACG CCCAGGAGCT GATGCTGACC GGCCAGGCCC TGGGCCTGAC CGCCGTGGGG
GTCGCCGATC GCAACAGCCT GGCCGGGGTG GTGCGGGCCT GGACGGCGGC CAAGGACCTC
AAGATCCGGG CGCTGACCGG CTGCCGGCTG GACTTCATGG ACGGCGCGCC CAGCCTGCTC
TGCTACCCGT CCGACCGCGA GGCCTTCGGT CGCCTGACCC GCCTGCTGAC CGTGGGCCAG
CGCCGGGCCG GGAAGGGAGA GTGCCACCTC TCCTGGAAGG ACTTCCTGGA GCACGCCGAG
GGCCAGCTGG CCCTGATCGT GCCGCCGGCC CGGCTGGACG AGGCCTTCGA GGCCGACCTG
GCGCGGATCG CCGGCGAACT GCGCGGCCGG GTCTGGCTGG CCGCCAGCCG GGCCTACGCC
GCCCAGGATC TCAAGCGCCT GTCGCGGCTG GCCGACCTGG CGCGCGAGGC CCGCGCGCCG
ATGGTGGCGA CCAACGACGT GCTCTATCAC GGCCCCGAGC GGCGCCCGCT GCAGGACGTG
ATGACCTGCA TCCGCGAGGG CTGCACGATC ACGCAGGCGG GCTTTCGCCT GGAGGCCAAC
GCCGAGCGCC ACATCAAGAC CCCCGCCGAG ATGGCCCGGC TGTTCGAGCG CTGGCCGCAA
GCGGTGGAGC GCACGATCGA GATCGTCGAG CGCATCGGCT TCGACCTGGG CGACCTGAAG
GAGCAATATC CCGACGAGCC CGTGCCGCCC GGCAAGACGG CGATGCAGCA CCTGACCGAC
CTGACCTGGC AGGGCGCGGC CTGGCGTTAT CCGAACGGCG TGTCGCCCAA GGTCGCCGAT
CAGCTGCGCG AGGAACTCCG GCTGATCGCC AAGATGGACT ATCCTAACTA TTTCATCACC
GTGCACGACA TCGTCCACAA GGCGCGTGAG ATGGGCATCC TGTGCCAGGG GCGGGGGTCG
GCGGCTAATT CCTCGGTCTG CTTCTGCCTG GGGGTGACGG CGATCGACCC CACCGAGCAC
CGCCTGCTGT TCACCCGCTT CATCTCCGAG AACCGCGGCG AGCCGCCGGA CATCGACGTC
GACTTCGAGC ACGAGCGGCG CGAGGAGGTG ATGCAATATG TTTACAAGCG CTATGGCCGC
GAATACGCCG CCATCTGCTG CACGGTGATC CACTATCGGC CGCGCAGCGC CATCCGCGAC
GTCGGCAAGG CCCTGGGCCT GACCGAGGAC ATCACCGCGA TCCTGGCCAA CACCGTCTGG
GGCAGCTGGG GCGACGGCCT GCCCGACGAA CACATCAAGC AGACGGGGCT TGATCCGAAC
AATCCGCAGA TCGCTCGGGC CATCGCCCTG GCCACCGAGC TGCTGCAGCA CCCCCGCCAC
CTGTCGCAGC ATGTGGGCGG CTTCGTGCTG ACCAAGCGGC GGCTGGACGA GACCGTGCCG
ATCGGCAACG CGGCGATGAA GGACCGCACC TTCATCGAGT GGGACAAGGA CGACATCGAC
AGTCTGAAAC TGATGAAGGT CGACATTCTG GCGCTCGGCA TGCTGACCGC CATCCAGCGG
GCGTTCGGCA TGCTGCGCGC CGATCACGGC CAGCCCATCA CCGACCTGGC CGACGTGCCG
GTCGAGGTGA AGGGCGTCTA CGACATGCTG TCGGTGGCCG ACAGCGTGGG GGTGTTCCAG
GTCGAGAGCC GGGCGCAGAT GTCGATGCTG CCGAGGCTGA AGCCGAACAA GTTCTATGAC
CTGGTCATCG AGGTGGCGAT CGTCCGGCCG GGGCCGATCC AGGGCGACAT GGTTCATCCC
TATCTGCGGC GGCGACAGGG GCTGGAGAAG GCCGAATGGC CCGCGCCGTC ACCTGAGCAT
GGGCACAAGG ACGAACTGCG AGAGATCCTT GGCAATACCT TCGGCGTGCC CCTCTTCCAG
GAACAGGCCA TGAGCCTGGC CATCGAGGCG GCCAAGTTCA CGCCTGACGA GGCCGACGGC
CTGCGCAAGG CCATGGCCAC CTTCAAGAAC CTGGGCGACC CCGGCGAGTA CCGTGACAAG
TTCGTCGAGG GCATGGTGCG GCGCGGCTAC GAACGCGACT TCGCCACCCG GTGCTTCAAG
CAGATCGAGG GCTTCGGCTC CTACGGCTTC CCGGAAAGCC ACGCGGCCAG CTTCGCCAAG
CTGGTCTATA TCTCGGCCTG GATCAAATGG GCCTGGCCGG ACGTGTTCTG CGCCGCCCTG
ATCAATTCCC AGCCGATGGG CTTCTACCAG CCGGCCCAAC TGGTGCGCGA CGCGCGCGAG
CATGGAGTCG AGGTGCGGGC GCCGGACGTG CTGATGAGCG ACTGGGACTG CACGCTGGAG
GGTGTATCCA TCGCCCAAAC TTCGACTGTC ATCCCGGACG AAGCGCAGCG AAGATCCGGG
ACCGACCCAG AGCGCCGCGT TTCCGGCGGT CCCGGATCTT CGCGCTCCGC GCTCGTCCGG
GATGACAGGG CTTTTCAGCC CGCCTCGGAC AAGCTCTTCA ACCCCGATAC CCGCCCGTTC
TGGAAGGCGG TCCGCCTGGG CCTGCGCCAG ATCAAGGGAC TCAAGGAAGA CGACGCCAAG
CTCATCGAAG AAGTCCGCGC CGAAGGCGCC CGCACGCCCG CCGACTTCGC CCGGGCCGGG
GTGTCGCAGC GCGGGTTGGA GTTGCTGGCC GAGGCCGACG CCTTCGCCTC GGCGGGCCTG
ACGCGGCGCG ACGCCCTGTG GGCGGTCAAG GGCCTGAAGG GCGAGGCCAA GGTCGACAGC
CAGGCGCCGC TGCTGGCCGG GCTGCCGCTG TTCGACACCG CCGTCGCCTT GCCGACCATG
GCCCTCCCCC AGCAGGTGGC CGAGGACTAC CGCACCACCA GCCTGTCGCT GAAGGCCCAC
CCGTTGCGGT TCTTCCGGCC AATGCTGGAC CAGTTGAACG TCACGCCCGC CGAGCGGCTG
AAGGGCGTCC GCAACGGCCG CAAGGTCAGC GTCGGCGGCT TGGTGCTGAT CCGCCAGCGG
CCGGGCACCG CCAAGGGCGT GGTGTTCCTG ACCCTGGAGG ACGAGACGGG CGTAGCCAAT
GCGGTGGTCT GGAAGGACTG TTTCGACGCT CACCGCCGCA CGGTGATGAG CGCCTCGTTC
CTGATCGTCC ACGGCAAACT ACAAGCGTCC GAAGGGGTTA TCCACGTGGT GGCCGAGCGC
TTCACCGACC TATCGGCCGA ACTGGCGCGG CTGCGGGAGT CCCCGGAAGC GCCGACGCCG
ACGGTGCGGA TGCGGACGTC CGGGCGGCTG CAGCGGAGCC GGGATTTCCA CTAG
 
Protein sequence
MKTHAYAELQ ATTNFSFLRG ASHAQELMLT GQALGLTAVG VADRNSLAGV VRAWTAAKDL 
KIRALTGCRL DFMDGAPSLL CYPSDREAFG RLTRLLTVGQ RRAGKGECHL SWKDFLEHAE
GQLALIVPPA RLDEAFEADL ARIAGELRGR VWLAASRAYA AQDLKRLSRL ADLAREARAP
MVATNDVLYH GPERRPLQDV MTCIREGCTI TQAGFRLEAN AERHIKTPAE MARLFERWPQ
AVERTIEIVE RIGFDLGDLK EQYPDEPVPP GKTAMQHLTD LTWQGAAWRY PNGVSPKVAD
QLREELRLIA KMDYPNYFIT VHDIVHKARE MGILCQGRGS AANSSVCFCL GVTAIDPTEH
RLLFTRFISE NRGEPPDIDV DFEHERREEV MQYVYKRYGR EYAAICCTVI HYRPRSAIRD
VGKALGLTED ITAILANTVW GSWGDGLPDE HIKQTGLDPN NPQIARAIAL ATELLQHPRH
LSQHVGGFVL TKRRLDETVP IGNAAMKDRT FIEWDKDDID SLKLMKVDIL ALGMLTAIQR
AFGMLRADHG QPITDLADVP VEVKGVYDML SVADSVGVFQ VESRAQMSML PRLKPNKFYD
LVIEVAIVRP GPIQGDMVHP YLRRRQGLEK AEWPAPSPEH GHKDELREIL GNTFGVPLFQ
EQAMSLAIEA AKFTPDEADG LRKAMATFKN LGDPGEYRDK FVEGMVRRGY ERDFATRCFK
QIEGFGSYGF PESHAASFAK LVYISAWIKW AWPDVFCAAL INSQPMGFYQ PAQLVRDARE
HGVEVRAPDV LMSDWDCTLE GVSIAQTSTV IPDEAQRRSG TDPERRVSGG PGSSRSALVR
DDRAFQPASD KLFNPDTRPF WKAVRLGLRQ IKGLKEDDAK LIEEVRAEGA RTPADFARAG
VSQRGLELLA EADAFASAGL TRRDALWAVK GLKGEAKVDS QAPLLAGLPL FDTAVALPTM
ALPQQVAEDY RTTSLSLKAH PLRFFRPMLD QLNVTPAERL KGVRNGRKVS VGGLVLIRQR
PGTAKGVVFL TLEDETGVAN AVVWKDCFDA HRRTVMSASF LIVHGKLQAS EGVIHVVAER
FTDLSAELAR LRESPEAPTP TVRMRTSGRL QRSRDFH