Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5201 |
Symbol | |
ID | 5897253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010335 |
Strand | - |
Start bp | 117916 |
End bp | 121077 |
Gene Length | 3162 bp |
Protein Length | 1053 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641555304 |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_001676635 |
Protein GI | 167621850 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0149235 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTGC CCGATCCTGA TGATCTTGGC CCCTACGTCG AACTGCAGTG CGCTTCGCAT TTTTCGCTGC TGCGCGGCGC GTCCTCGCCC GGGGAGCTGT TCGACGAGGC TCAGCGCCTG GGCTACCGCG CCCTGGCCAT TTGCGATCGC AACAGCGTCG CAGGCGTGGT GCGCGCCCAT ATCGCCGCCA AGGCCACGGG GGTGCGGCTG ATCATCGGCT GCGAGCTGGT GCTGCGCTGC GGTCTGACCC TGGTGGTCCT TCCCACCGAT CGCGCGGCCT ACGGCCGGCT TTGTCGCCTG CTCTCGCTGG GCAAGTCCCG GGCGGGCAAG GGCCAATGCG ACCTTGGACT GGAGGACCTC GCCGCCCACG CCCAAGGCCT GATCGCCATC TTGGTCCCCG ACGCGCCCGA TGATCTCTGC GCGCTGCAGC TGAAGAAGGT GGCGGCGATC TTTGGCGTCG ACGCCCACCT GGCCCTGACC TTGCGGCGTC GTCCGGGCGA TGCGTTACGG CTGCACCAGC TGGAGGCCAT GGCGCGCCAG GCTGGGGTCA GGCCTGTGGT CACCAATCGC GTCCTTTTCC ACGAGAAGGG CCGGCGGCTG CTGCAAGACG TGGTCACCTG CATCCGCGAA GGCACGACGA TCGACGATGT CGGGTTCAAG CGCGACCGTC ACGCCGACCG CTATTTGAAG GCGCCCAGCG AAATCCAGCG GCTGCTGGCC AAGCACCCCG ACGCCGTCAC CGCCTCGGTC GAAATCGCCC GACGCTGCCG ATTCAGCCTC GACGAGCTGT CCTACCAGTA CCCCCACGAG GTCAGCGAGC CGGGTCGCAC GCCTCAGGAG ACGCTGGAGC GCCTGACCTG GTTCGGCGCG GCCCAGCGCT ATCCCGACGG CTTGCCCGCC GAGGTCGAAA AGGCCCTGGC CCATGAGCTG AGGCTGATCG GCCTGCTGGG CTATGCGCCC TACTTCCTGA CCGTCAATTC GATCGTCCAG TACGCCCGCG GCCAGGACAT TCTTTGCCAG GGGCGCGGCT CGGCCGCCAA CTCCGCGGTC TGTTACGTCC TGGGGGTCAC CTCGATCGAT CCCGAGCGCA ACGATCTGCT CTTCGAACGC TTCGTCTCCC AAGAGCGCAA CGAGCCGCCA GACATCGACG TCGATTTCGA GCACGAGCGG CGCGAGACGG TGATGCAGTG GATCTTCGAG ACCTATGGCC GCCACCGCTG CGCCCTGGTC GCCGTCGTCC AGCGCTTTCG GCCACGCGGC GCGGTGCGCG ACGTCGGCAA GGTCCTGGGT CTGCCCGAGG ACATGACCAA GGCCCTCTCC AGCCAGATCT GGAGCTTTTC GCGCGAGGAC ATCGAGGAAA GCCATGCCCG CGACGTGGGC CTGGACCTGT CCGACCGGCG CCTGCGGCTG ACCCTGGAGC TGGCCCAGCT GCTGCTCAAC ACCCCTCGCC ACTTCTCCCA GCATCCGGGC GGCTTCGTGC TGACCGAGGA CCGACTGGAC GAGCTCGTCC CGATCGAACC GGCGCGCATG CAGGATCGCC AGATCATCGA GTGGGACAAG GACGACATCG ACGAGCTGAA GTTCATGAAG GTCGATGTGC TGGCCCTGGG CATGCTGACC TGCCTCAAGC GCGGCCTGGA CCTGCTGGCC GCGCACAAGG ACATCCACAA GGACCTGGCG ACCATTCCGC CGGAGGATCC GCGCACCTAT GCGATGATCC GCAAGGCCGA CACCCTGGGC GTCTTCCAGA TCGAGAGCCG GGCCCAGATG GTTATGCTGC CGCGCCTCAA GCCCAGGTCC TTCTACGACC TGGTCATCGA GGTGGCGATC GTGCGGCCCG GTCCGATCCA GGGCGATATG GTTCATCCCT ATCTGCGTCG GCGCGAGGGC AAGGAGAAGG TCTCCTACCC CAAGCCCGAG CTGGAGAAGG TGCTGGGCAA GACGCTCGGC GTGCCGCTGT TCCAGGAACA GGCCATGCGC GTGGCCATCG AATGCGCCGG GTTCACGCCC GGCGAGGCCG ACCAGCTGCG CCGCTCGATG GCCACCTTCA AGATGACCGG CGGGGTGAGC CACTTTCGGC AAAAGCTCTT GGGCGGCATG GTCGCGCGCG GCTATCCCCA GGACTTCGCC GAGCAGACCT TCAGCCAGCT GGAGGGGTTC GGCTCGTACG GCTTTCCAGA AAGCCACGCG GCCAGCTTCG CCCTGCTGGC CTACGCCTCC TCCTGGCTCA AGTGCCATCA CCCCGACGTC TTTTGCGCCG CCCTGCTCAA CAGCCAGCCC ATGGGCTTCT ATCAGCCGGC GCAGATTGTC CGAGACGCGG TCGAGCACGG CGTCCAGATC CGTCCGATCT GCGTGAACGC CTCGCGTTGG GACTGCACGC TGGAGCCGGA CGGTGACACC GGGCGCCTGG CCGTCCGGCT TGGGATGCGG TTGGTCAAGG GGCTGGCCGA CGGAGACGCC GCCGCCCTGG TGTTGGCCCG GGGCGAGGAG CCCTATCGTT CGATCGACGA GGTTTGGCGC CGGTCCGGCG CCAAACCTGG CGTTCTGGGC CGGCTGGCCG AGGCCGACGC CTTCCTGCCG AGCCTGGGCT TGGCCCGCCG CGAGGCGGCC TGGGCGATCA AGGCTTTGCG CGACGACGCC CTGCCGCTCT TTGACCAGCC CGGCGCCAGC GAGCTGAACG AGCCGGCCGT GGCCCTCAAG GCCATGACCG AGGGCCGCGA GATCGTCGAG GACTACAGCC ATGTGGGCCT CTCACTGCGC CGCCACCCGC TGGCCCTGCT GCGTTCGGAC CTCACCCGCC TGCGCCGCGC GCCCTGCCGC GATGTCGCCC AGGGCCGCGA TGGCCGCTTC ATTCAGACCG CGGGCCTGGT GCTGGTGCGA CAGATGCCCG GCAGCGCCAA GGGCGTTCTC TTCATGACCC TCGAGGACGA GACGGGCGTG GCCAACCTGG TGGTCTGGAA GACGCTTTAC GAAAAGCAAC GCCGCATCGC CCTGGGCGCT CATCTGCTGG GCGTGGACGG CCGCATCCAG CGCGAGGGCG AGGTCGTCCA CCTGGTAGCC TACAAGCTCC ATGACCTCGG CCATGTGATG GCGGGTCTGG AGGACCGTTC GGGCGATCCG GCCGACATGG CCTGGGCCAA GCGCTCGCGA AACTTCTGCT AG
|
Protein sequence | MSLPDPDDLG PYVELQCASH FSLLRGASSP GELFDEAQRL GYRALAICDR NSVAGVVRAH IAAKATGVRL IIGCELVLRC GLTLVVLPTD RAAYGRLCRL LSLGKSRAGK GQCDLGLEDL AAHAQGLIAI LVPDAPDDLC ALQLKKVAAI FGVDAHLALT LRRRPGDALR LHQLEAMARQ AGVRPVVTNR VLFHEKGRRL LQDVVTCIRE GTTIDDVGFK RDRHADRYLK APSEIQRLLA KHPDAVTASV EIARRCRFSL DELSYQYPHE VSEPGRTPQE TLERLTWFGA AQRYPDGLPA EVEKALAHEL RLIGLLGYAP YFLTVNSIVQ YARGQDILCQ GRGSAANSAV CYVLGVTSID PERNDLLFER FVSQERNEPP DIDVDFEHER RETVMQWIFE TYGRHRCALV AVVQRFRPRG AVRDVGKVLG LPEDMTKALS SQIWSFSRED IEESHARDVG LDLSDRRLRL TLELAQLLLN TPRHFSQHPG GFVLTEDRLD ELVPIEPARM QDRQIIEWDK DDIDELKFMK VDVLALGMLT CLKRGLDLLA AHKDIHKDLA TIPPEDPRTY AMIRKADTLG VFQIESRAQM VMLPRLKPRS FYDLVIEVAI VRPGPIQGDM VHPYLRRREG KEKVSYPKPE LEKVLGKTLG VPLFQEQAMR VAIECAGFTP GEADQLRRSM ATFKMTGGVS HFRQKLLGGM VARGYPQDFA EQTFSQLEGF GSYGFPESHA ASFALLAYAS SWLKCHHPDV FCAALLNSQP MGFYQPAQIV RDAVEHGVQI RPICVNASRW DCTLEPDGDT GRLAVRLGMR LVKGLADGDA AALVLARGEE PYRSIDEVWR RSGAKPGVLG RLAEADAFLP SLGLARREAA WAIKALRDDA LPLFDQPGAS ELNEPAVALK AMTEGREIVE DYSHVGLSLR RHPLALLRSD LTRLRRAPCR DVAQGRDGRF IQTAGLVLVR QMPGSAKGVL FMTLEDETGV ANLVVWKTLY EKQRRIALGA HLLGVDGRIQ REGEVVHLVA YKLHDLGHVM AGLEDRSGDP ADMAWAKRSR NFC
|
| |