Gene Caul_5201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5201 
Symbol 
ID5897253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp117916 
End bp121077 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content68% 
IMG OID641555304 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_001676635 
Protein GI167621850 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0149235 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGC CCGATCCTGA TGATCTTGGC CCCTACGTCG AACTGCAGTG CGCTTCGCAT 
TTTTCGCTGC TGCGCGGCGC GTCCTCGCCC GGGGAGCTGT TCGACGAGGC TCAGCGCCTG
GGCTACCGCG CCCTGGCCAT TTGCGATCGC AACAGCGTCG CAGGCGTGGT GCGCGCCCAT
ATCGCCGCCA AGGCCACGGG GGTGCGGCTG ATCATCGGCT GCGAGCTGGT GCTGCGCTGC
GGTCTGACCC TGGTGGTCCT TCCCACCGAT CGCGCGGCCT ACGGCCGGCT TTGTCGCCTG
CTCTCGCTGG GCAAGTCCCG GGCGGGCAAG GGCCAATGCG ACCTTGGACT GGAGGACCTC
GCCGCCCACG CCCAAGGCCT GATCGCCATC TTGGTCCCCG ACGCGCCCGA TGATCTCTGC
GCGCTGCAGC TGAAGAAGGT GGCGGCGATC TTTGGCGTCG ACGCCCACCT GGCCCTGACC
TTGCGGCGTC GTCCGGGCGA TGCGTTACGG CTGCACCAGC TGGAGGCCAT GGCGCGCCAG
GCTGGGGTCA GGCCTGTGGT CACCAATCGC GTCCTTTTCC ACGAGAAGGG CCGGCGGCTG
CTGCAAGACG TGGTCACCTG CATCCGCGAA GGCACGACGA TCGACGATGT CGGGTTCAAG
CGCGACCGTC ACGCCGACCG CTATTTGAAG GCGCCCAGCG AAATCCAGCG GCTGCTGGCC
AAGCACCCCG ACGCCGTCAC CGCCTCGGTC GAAATCGCCC GACGCTGCCG ATTCAGCCTC
GACGAGCTGT CCTACCAGTA CCCCCACGAG GTCAGCGAGC CGGGTCGCAC GCCTCAGGAG
ACGCTGGAGC GCCTGACCTG GTTCGGCGCG GCCCAGCGCT ATCCCGACGG CTTGCCCGCC
GAGGTCGAAA AGGCCCTGGC CCATGAGCTG AGGCTGATCG GCCTGCTGGG CTATGCGCCC
TACTTCCTGA CCGTCAATTC GATCGTCCAG TACGCCCGCG GCCAGGACAT TCTTTGCCAG
GGGCGCGGCT CGGCCGCCAA CTCCGCGGTC TGTTACGTCC TGGGGGTCAC CTCGATCGAT
CCCGAGCGCA ACGATCTGCT CTTCGAACGC TTCGTCTCCC AAGAGCGCAA CGAGCCGCCA
GACATCGACG TCGATTTCGA GCACGAGCGG CGCGAGACGG TGATGCAGTG GATCTTCGAG
ACCTATGGCC GCCACCGCTG CGCCCTGGTC GCCGTCGTCC AGCGCTTTCG GCCACGCGGC
GCGGTGCGCG ACGTCGGCAA GGTCCTGGGT CTGCCCGAGG ACATGACCAA GGCCCTCTCC
AGCCAGATCT GGAGCTTTTC GCGCGAGGAC ATCGAGGAAA GCCATGCCCG CGACGTGGGC
CTGGACCTGT CCGACCGGCG CCTGCGGCTG ACCCTGGAGC TGGCCCAGCT GCTGCTCAAC
ACCCCTCGCC ACTTCTCCCA GCATCCGGGC GGCTTCGTGC TGACCGAGGA CCGACTGGAC
GAGCTCGTCC CGATCGAACC GGCGCGCATG CAGGATCGCC AGATCATCGA GTGGGACAAG
GACGACATCG ACGAGCTGAA GTTCATGAAG GTCGATGTGC TGGCCCTGGG CATGCTGACC
TGCCTCAAGC GCGGCCTGGA CCTGCTGGCC GCGCACAAGG ACATCCACAA GGACCTGGCG
ACCATTCCGC CGGAGGATCC GCGCACCTAT GCGATGATCC GCAAGGCCGA CACCCTGGGC
GTCTTCCAGA TCGAGAGCCG GGCCCAGATG GTTATGCTGC CGCGCCTCAA GCCCAGGTCC
TTCTACGACC TGGTCATCGA GGTGGCGATC GTGCGGCCCG GTCCGATCCA GGGCGATATG
GTTCATCCCT ATCTGCGTCG GCGCGAGGGC AAGGAGAAGG TCTCCTACCC CAAGCCCGAG
CTGGAGAAGG TGCTGGGCAA GACGCTCGGC GTGCCGCTGT TCCAGGAACA GGCCATGCGC
GTGGCCATCG AATGCGCCGG GTTCACGCCC GGCGAGGCCG ACCAGCTGCG CCGCTCGATG
GCCACCTTCA AGATGACCGG CGGGGTGAGC CACTTTCGGC AAAAGCTCTT GGGCGGCATG
GTCGCGCGCG GCTATCCCCA GGACTTCGCC GAGCAGACCT TCAGCCAGCT GGAGGGGTTC
GGCTCGTACG GCTTTCCAGA AAGCCACGCG GCCAGCTTCG CCCTGCTGGC CTACGCCTCC
TCCTGGCTCA AGTGCCATCA CCCCGACGTC TTTTGCGCCG CCCTGCTCAA CAGCCAGCCC
ATGGGCTTCT ATCAGCCGGC GCAGATTGTC CGAGACGCGG TCGAGCACGG CGTCCAGATC
CGTCCGATCT GCGTGAACGC CTCGCGTTGG GACTGCACGC TGGAGCCGGA CGGTGACACC
GGGCGCCTGG CCGTCCGGCT TGGGATGCGG TTGGTCAAGG GGCTGGCCGA CGGAGACGCC
GCCGCCCTGG TGTTGGCCCG GGGCGAGGAG CCCTATCGTT CGATCGACGA GGTTTGGCGC
CGGTCCGGCG CCAAACCTGG CGTTCTGGGC CGGCTGGCCG AGGCCGACGC CTTCCTGCCG
AGCCTGGGCT TGGCCCGCCG CGAGGCGGCC TGGGCGATCA AGGCTTTGCG CGACGACGCC
CTGCCGCTCT TTGACCAGCC CGGCGCCAGC GAGCTGAACG AGCCGGCCGT GGCCCTCAAG
GCCATGACCG AGGGCCGCGA GATCGTCGAG GACTACAGCC ATGTGGGCCT CTCACTGCGC
CGCCACCCGC TGGCCCTGCT GCGTTCGGAC CTCACCCGCC TGCGCCGCGC GCCCTGCCGC
GATGTCGCCC AGGGCCGCGA TGGCCGCTTC ATTCAGACCG CGGGCCTGGT GCTGGTGCGA
CAGATGCCCG GCAGCGCCAA GGGCGTTCTC TTCATGACCC TCGAGGACGA GACGGGCGTG
GCCAACCTGG TGGTCTGGAA GACGCTTTAC GAAAAGCAAC GCCGCATCGC CCTGGGCGCT
CATCTGCTGG GCGTGGACGG CCGCATCCAG CGCGAGGGCG AGGTCGTCCA CCTGGTAGCC
TACAAGCTCC ATGACCTCGG CCATGTGATG GCGGGTCTGG AGGACCGTTC GGGCGATCCG
GCCGACATGG CCTGGGCCAA GCGCTCGCGA AACTTCTGCT AG
 
Protein sequence
MSLPDPDDLG PYVELQCASH FSLLRGASSP GELFDEAQRL GYRALAICDR NSVAGVVRAH 
IAAKATGVRL IIGCELVLRC GLTLVVLPTD RAAYGRLCRL LSLGKSRAGK GQCDLGLEDL
AAHAQGLIAI LVPDAPDDLC ALQLKKVAAI FGVDAHLALT LRRRPGDALR LHQLEAMARQ
AGVRPVVTNR VLFHEKGRRL LQDVVTCIRE GTTIDDVGFK RDRHADRYLK APSEIQRLLA
KHPDAVTASV EIARRCRFSL DELSYQYPHE VSEPGRTPQE TLERLTWFGA AQRYPDGLPA
EVEKALAHEL RLIGLLGYAP YFLTVNSIVQ YARGQDILCQ GRGSAANSAV CYVLGVTSID
PERNDLLFER FVSQERNEPP DIDVDFEHER RETVMQWIFE TYGRHRCALV AVVQRFRPRG
AVRDVGKVLG LPEDMTKALS SQIWSFSRED IEESHARDVG LDLSDRRLRL TLELAQLLLN
TPRHFSQHPG GFVLTEDRLD ELVPIEPARM QDRQIIEWDK DDIDELKFMK VDVLALGMLT
CLKRGLDLLA AHKDIHKDLA TIPPEDPRTY AMIRKADTLG VFQIESRAQM VMLPRLKPRS
FYDLVIEVAI VRPGPIQGDM VHPYLRRREG KEKVSYPKPE LEKVLGKTLG VPLFQEQAMR
VAIECAGFTP GEADQLRRSM ATFKMTGGVS HFRQKLLGGM VARGYPQDFA EQTFSQLEGF
GSYGFPESHA ASFALLAYAS SWLKCHHPDV FCAALLNSQP MGFYQPAQIV RDAVEHGVQI
RPICVNASRW DCTLEPDGDT GRLAVRLGMR LVKGLADGDA AALVLARGEE PYRSIDEVWR
RSGAKPGVLG RLAEADAFLP SLGLARREAA WAIKALRDDA LPLFDQPGAS ELNEPAVALK
AMTEGREIVE DYSHVGLSLR RHPLALLRSD LTRLRRAPCR DVAQGRDGRF IQTAGLVLVR
QMPGSAKGVL FMTLEDETGV ANLVVWKTLY EKQRRIALGA HLLGVDGRIQ REGEVVHLVA
YKLHDLGHVM AGLEDRSGDP ADMAWAKRSR NFC