Gene Caul_0117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0117 
Symbol 
ID5897829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp127962 
End bp129092 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content72% 
IMG OID641560601 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_001681753 
Protein GI167644090 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.145436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCCC TTACGAAGTT TGGGCTTACG AACGCGGGTC CGCCGAAGGG CCTCACGATC 
CTCGGCCTGG AGACCAGCTG CGACGAAACC GCCGCGTCCG TCGTGCGGCG GGACGAGGAC
GGCCATGTCA CGGTGCTGTC CTCGATCGTC GGCACCCAGT TCGAGCAGCA CGCCCCGTTC
GGCGGGGTGG TCCCCGAGAT CGCCGCCCGC GCCCATGTCG AAGCCATCGA CAGCGTGGCG
GCGGAAGCCA TGCGCGTCGC CGGGATCGGC TTCGACGCCC TGGACGGCGT GGCCGCCACC
GCCGGGCCGG GCCTGGTGGG CGGCGTGATG GTCGGGCTGG CGTTCGGCAA GGCCGTGGCC
CTGGCCCGCG ACATGCCGCT GGTGGCGGTC AATCACCTGG AAGGCCACGC GGTCTCGGCC
CGGCTGGGCG CGGATGTGGC CTATCCGTTC CTGCTGCTGC TGGTCTCCGG CGGACATTGC
CAATTGCTGG AAGTGGCCGG GGTCGGGGCC TGCACGCGCC TGGGCACCAC CATCGACGAC
GCGGCTGGCG AGGCCTTCGA CAAGATCGCC AAGAGCCTGG GCCTGCCCTA TCCCGGCGGT
CCGGCCCTCG AGAAACTGGC GGCCAGCGGC GATCCGACCA AGTTCGACCT GCCCCGCGCC
CTGCTGGGCC GCAAGGATTG CGACTTCTCG TTCTCGGGCC TGAAGACCGC CGCCGCGCGG
CTGGCCGAGA AGGTCGCGAG CCAGACCGAG CGCGCCGACC TGGCCGCCGC CGTCCAGTTC
GCCATCGCCC GCCAGCTCTC CGAACGCACC GACCGGGCGA TGAAGCTCTA CGCCGCCAGC
CACCCCGGCC AGGACCTGCG CTTCGTCGTG GCCGGCGGGG TCGCGGCCAA TGGCGCGGTC
AAGGACGCCT TGCGGAAGAA CTGCGCCGAC AACGGCTTCA GCTTCGACGC CCCGCCGCTG
GCCTACTGCA CCGACAACGC CGCGATGATC GCCCTGGCCG GGGCCGAGCG GCTGGCGGCC
GGGATCTCCG ACGACCTGGA CGCCGTGGCG CGTCCGCGCT GGCCGCTGGA CGAGGCGGCG
GCGTTGTCCA ATCCGAGCAA CAGTTTCGGC CGCAAGGGAG CCAAGGCATG A
 
Protein sequence
MTPLTKFGLT NAGPPKGLTI LGLETSCDET AASVVRRDED GHVTVLSSIV GTQFEQHAPF 
GGVVPEIAAR AHVEAIDSVA AEAMRVAGIG FDALDGVAAT AGPGLVGGVM VGLAFGKAVA
LARDMPLVAV NHLEGHAVSA RLGADVAYPF LLLLVSGGHC QLLEVAGVGA CTRLGTTIDD
AAGEAFDKIA KSLGLPYPGG PALEKLAASG DPTKFDLPRA LLGRKDCDFS FSGLKTAAAR
LAEKVASQTE RADLAAAVQF AIARQLSERT DRAMKLYAAS HPGQDLRFVV AGGVAANGAV
KDALRKNCAD NGFSFDAPPL AYCTDNAAMI ALAGAERLAA GISDDLDAVA RPRWPLDEAA
ALSNPSNSFG RKGAKA