Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0117 |
Symbol | |
ID | 5897829 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 127962 |
End bp | 129092 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641560601 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_001681753 |
Protein GI | 167644090 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.145436 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACCCC TTACGAAGTT TGGGCTTACG AACGCGGGTC CGCCGAAGGG CCTCACGATC CTCGGCCTGG AGACCAGCTG CGACGAAACC GCCGCGTCCG TCGTGCGGCG GGACGAGGAC GGCCATGTCA CGGTGCTGTC CTCGATCGTC GGCACCCAGT TCGAGCAGCA CGCCCCGTTC GGCGGGGTGG TCCCCGAGAT CGCCGCCCGC GCCCATGTCG AAGCCATCGA CAGCGTGGCG GCGGAAGCCA TGCGCGTCGC CGGGATCGGC TTCGACGCCC TGGACGGCGT GGCCGCCACC GCCGGGCCGG GCCTGGTGGG CGGCGTGATG GTCGGGCTGG CGTTCGGCAA GGCCGTGGCC CTGGCCCGCG ACATGCCGCT GGTGGCGGTC AATCACCTGG AAGGCCACGC GGTCTCGGCC CGGCTGGGCG CGGATGTGGC CTATCCGTTC CTGCTGCTGC TGGTCTCCGG CGGACATTGC CAATTGCTGG AAGTGGCCGG GGTCGGGGCC TGCACGCGCC TGGGCACCAC CATCGACGAC GCGGCTGGCG AGGCCTTCGA CAAGATCGCC AAGAGCCTGG GCCTGCCCTA TCCCGGCGGT CCGGCCCTCG AGAAACTGGC GGCCAGCGGC GATCCGACCA AGTTCGACCT GCCCCGCGCC CTGCTGGGCC GCAAGGATTG CGACTTCTCG TTCTCGGGCC TGAAGACCGC CGCCGCGCGG CTGGCCGAGA AGGTCGCGAG CCAGACCGAG CGCGCCGACC TGGCCGCCGC CGTCCAGTTC GCCATCGCCC GCCAGCTCTC CGAACGCACC GACCGGGCGA TGAAGCTCTA CGCCGCCAGC CACCCCGGCC AGGACCTGCG CTTCGTCGTG GCCGGCGGGG TCGCGGCCAA TGGCGCGGTC AAGGACGCCT TGCGGAAGAA CTGCGCCGAC AACGGCTTCA GCTTCGACGC CCCGCCGCTG GCCTACTGCA CCGACAACGC CGCGATGATC GCCCTGGCCG GGGCCGAGCG GCTGGCGGCC GGGATCTCCG ACGACCTGGA CGCCGTGGCG CGTCCGCGCT GGCCGCTGGA CGAGGCGGCG GCGTTGTCCA ATCCGAGCAA CAGTTTCGGC CGCAAGGGAG CCAAGGCATG A
|
Protein sequence | MTPLTKFGLT NAGPPKGLTI LGLETSCDET AASVVRRDED GHVTVLSSIV GTQFEQHAPF GGVVPEIAAR AHVEAIDSVA AEAMRVAGIG FDALDGVAAT AGPGLVGGVM VGLAFGKAVA LARDMPLVAV NHLEGHAVSA RLGADVAYPF LLLLVSGGHC QLLEVAGVGA CTRLGTTIDD AAGEAFDKIA KSLGLPYPGG PALEKLAASG DPTKFDLPRA LLGRKDCDFS FSGLKTAAAR LAEKVASQTE RADLAAAVQF AIARQLSERT DRAMKLYAAS HPGQDLRFVV AGGVAANGAV KDALRKNCAD NGFSFDAPPL AYCTDNAAMI ALAGAERLAA GISDDLDAVA RPRWPLDEAA ALSNPSNSFG RKGAKA
|
| |