Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1076 |
Symbol | |
ID | 5898531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1134506 |
End bp | 1136719 |
Gene Length | 2214 bp |
Protein Length | 737 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641561558 |
Product | catalase/peroxidase HPI |
Protein accession | YP_001682704 |
Protein GI | 167645041 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0376] Catalase (peroxidase I) |
TIGRFAM ID | [TIGR00198] catalase/peroxidase HPI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.257821 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGGAA GCGAACTCGT CGAGCCGACG ACCAAGTGCC CGCTGAAGCA CGGCGTCCGG TTCCACACCA GTTTCGGAGG CCGCTCGAAC CGCGACTGGT GGCCCAACCA ACTGAACTTG AAGATCCTCC ACCAGCACGC ACCGGCGTCC AACCCGATGC CCGCCGGGTT CAGCTACGCC GAGCAGGTCG AGACACTGGA TGTCGAGGCC CTGAAGCGGG ACCTGGCGGC GCTGATGACC GATTCCCAGG ACTGGTGGCC GGCCGACTAT GGCCACTATG GCCCCCTGTT CGTGCGGATG GCCTGGCACA GCGCCGGCAC CTATCGCACC GGCGACGGCC GCGGCGGCGC CGGCGGCGGC CAGCAGCGTT TCGCGCCGCT GAACAGCTGG CCGGACAACG GCAACCTCGA CAAGGCCCGC CGCCTGATCT GGCCAATCAA GCAGAAGTAC GGCGCCCGGA TCAGCTGGGC CGACCTGATG ATCCTGGCCG CCGACGTCGG CATGGAGACC ATGGGCTTCA AGACCTTCGG CTTCGGCTTC GGCCGCGAGG ACACCTGGGA GCCCGAGGAG GACGTCCACT GGGGGGCCGA GGACACCTGG CTGGGCGACG CTCGCTACAC CGGCGAGCGC GAGCTGGACA AGCCGCTGGG CGCCGTCCAG ATGGGCCTGA TCTACGTCAA TCCGGAAGGC CCCAACGGCA AGCCCGACCC GCTGGCCGCC GCCCATGACA TCCGCGAGAC CTTCGCGCGC ATGGCCATGA ACGACGAGGA GACTGTCGCC CTGATCGCTG GCGGCCACAC CTTCGGCAAG GCCCACGGCG CGGGCGACGC GGCCCACGTC GGGGTCGAGC CCGAAGCGGC CGGCATCGCC CTGCAGGGCC TGGGCTGGAA GAACAGCTTC GGCAGCGGGG TCGGCAGCGA CGCCATCACC AGCGGCCTGG AAGGCCCGTG GACCCCCAAT CCGATCAAGT GGGACAACGG CTTCTTCGAC ACGCTGTTCG GCCACGAGTG GGAACTGACC AAGAGCCCGG CCGGCGCCTT CCAGTGGACG CCCAAGGATC CGGAGGCCGG ACCCAAGGCG CCCGACGCCC ACGACCCGTC CAGGCAGGTG GCGCCAATGA TGCTGACCAC GGACCTGGCC CTGCGGCTCG ACCCCAACTA TGGGCCGATC TCCAAGCGGT TCCACGAGAA CCCGGACCAG TTCCAGGACG CCTTCGCCCG CGCCTGGTTC AAGCTGACCC ACCGCGACAT GGGTCCCAAG GCCCGCTACC TCGGCCCGCT CGTGCCCCAG GAAGAGCTGC TGTGGCAGGA CCCGCTGCCG GAGCCCCAAG GTCCGCCGAT CGACGCCAAC GACATCCGCG AGCTGAAGGC CAAGGTGCTG GCCACCGGGC TGTCCGTGCC CCAGCTGGTC GCGACGGCCT GGGCCTCGGC CTCGACCTTC CGGGGTTCGG ACAAGCGCGG CGGCGCGAAC GGCGCGCGCA TCCGCCTGTC GCCGCAGAAG GACTGGGCGG TCAACCAGCC GGCCCAGCTG GCGAACGTGC TGGCCACGCT GGAGGGCGTC CAGTCGGCGT TCAACGGCGG CCAAACCGAC GGCAAGACGG TCTCCCTGGC CGACCTGATC GTGCTGGCGG GCTGCGCCGC CGTCGAACAG GCCGCCAAGG CCGCCGGCCA CGACGTCGAG GTTCCGTTCA CGCCCGGCCG GGTCGACGCC TCGCAGAACC AGACCGACGT GGCGTCGTTC GGGGTGCTGG AGCCCAAGGC CGACGGCTTC CGCAACTACC TGAACACCGA CCTGCCGCTC ACCGCCGAGG AACTGCTGGT CGACAAGGCC CAGCTGCTGA CCTTGAGCGC GCCGGAAATG ACGGTCCTGG TCGGCGGCCT GCGGGCCCTG AACGCCAACA CCGACCAGTC GTCGCACGGC GTCTTCACGA CGCGGCCGGG CTCGCTGACC AACGACTTCT TCGTCAACCT GCTGGACATG CGCACGGTGT GGACCGCCAC CTCGGAGGAC GAAGCCCAGT TCGAGGGCCG CGACCGGACG ACGGGCGACC TGAAATGGAC CGCCACCCGG GTCGACCTGA TCTTTGGGTC CAATTCCCAG CTGCGCGCCC TGGCCGAGGT GTTCGCCCAG TCCGACTCGC AGGGCGCGTT CGTGGGCGCC TTCGTGGCGG CCTGGACCAA GGTGATGAAC CTGGATCGCT TCGACCTGGC ATGA
|
Protein sequence | MDGSELVEPT TKCPLKHGVR FHTSFGGRSN RDWWPNQLNL KILHQHAPAS NPMPAGFSYA EQVETLDVEA LKRDLAALMT DSQDWWPADY GHYGPLFVRM AWHSAGTYRT GDGRGGAGGG QQRFAPLNSW PDNGNLDKAR RLIWPIKQKY GARISWADLM ILAADVGMET MGFKTFGFGF GREDTWEPEE DVHWGAEDTW LGDARYTGER ELDKPLGAVQ MGLIYVNPEG PNGKPDPLAA AHDIRETFAR MAMNDEETVA LIAGGHTFGK AHGAGDAAHV GVEPEAAGIA LQGLGWKNSF GSGVGSDAIT SGLEGPWTPN PIKWDNGFFD TLFGHEWELT KSPAGAFQWT PKDPEAGPKA PDAHDPSRQV APMMLTTDLA LRLDPNYGPI SKRFHENPDQ FQDAFARAWF KLTHRDMGPK ARYLGPLVPQ EELLWQDPLP EPQGPPIDAN DIRELKAKVL ATGLSVPQLV ATAWASASTF RGSDKRGGAN GARIRLSPQK DWAVNQPAQL ANVLATLEGV QSAFNGGQTD GKTVSLADLI VLAGCAAVEQ AAKAAGHDVE VPFTPGRVDA SQNQTDVASF GVLEPKADGF RNYLNTDLPL TAEELLVDKA QLLTLSAPEM TVLVGGLRAL NANTDQSSHG VFTTRPGSLT NDFFVNLLDM RTVWTATSED EAQFEGRDRT TGDLKWTATR VDLIFGSNSQ LRALAEVFAQ SDSQGAFVGA FVAAWTKVMN LDRFDLA
|
| |