Gene Caul_4497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4497 
Symbol 
ID5901958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4869518 
End bp4870786 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content71% 
IMG OID641565016 
Productpeptidase M16 domain-containing protein 
Protein accessionYP_001686115 
Protein GI167648452 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.439835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.512739 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAATC CCACCACCCT CCACACCCTC GAAAACGGCG TCCGCGTCGT CTGTGACCCG 
ATGCCCGGGC TCGAAACCAT CGCCCTGTCG GTCGTCGCCG GTCGCGGCGC GGCCTATGAG
GACCCCGCCC GGTCGGGCTG GTCGCACCTG CTGGAGCACA TGGTGTTCAA GGGCGCGGGG
CAGCGCAGCT CACGCGACAT CGTCGAGGTG ATCGAGGCGC AGGGCGGGCA GATCAACGCC
GCCACCGGCT ATGAGCGCAC CAGCTTCCAG GTCCGCGCCC TGAAGGGCGG TCTCGACCTC
GGCATGGGCG TGCTGGCCGA CCTGGTGCTG CGCCCCACCC TCGACGAGGC CGACCTGACC
CGCGAGAAGC AGGTGGTGGC CCAGGAGATC GCCGAGGCGG CCGACGCCCC GGACGACTAC
GTGTTCGACC TGGTCCAGGC CGCCGCCTGG GGCGATCATC CGCTGGCCCG GCCGATCCTC
GGCACGGTCG ACAGCGTCAA TTCCGCCACC GTCGAGGGCC TGGCCCATTG GCGCGGCGAG
CTCTACGCCG CCGACCGGCT GATCGTCAGC GCCAGCGGCG CGGTCGATCT GGACGAGGTG
CTGGACCTGG CTCGCCGCGC GTTCGGGTCG ATGCCCGCCG AGGCCGGCGC CCTGGCCTCC
GACCCGGCCG GCTTCGTCGG CGGTCGCAAG AGCCAGGCCC GCAAGCTGGA GCAGGCCCAG
TTGGTGTTCA TGCTGCCGGC CGTCAGCGCC CGCGAGGACG ACTATTTCGC CCTGCGGATC
TTCGCCGAGG CCTTGGGTGG CGGCATGTCG TCGCGGCTGT TCCAGGAGGC CCGCGAGAAG
CGCGGCCTCG CCTACAACAT CGACGCCTAT GCCGACACCT ATGCCGACGC CGGCGCCCTG
GGCGTCTATG CCGGCTGCGC GGCCAGCGAC GCGGCCGAGA CGGCCAAGGT CTGCGCCGAG
CAGATCCTGG GCCTGGCCGC CAGGATCGAG GACGCCGAAC TGGCCCGCGC CAAGGCCCAA
CTGAAGGCCC ACATGTTCAT GGCCCGCGAA CAACCCCTGT CGCGGGCCGA ACAGGCCGCC
GGCCAGACCC TGATGTTCGA CCGGTTGTAC ACGCCGGCCG AACTGGCCGA AGCGGTGGAC
GCGGTGAGCG TCGAGGACCT GCAACGCCTG GGTCGGATGA TGCTGGGGCC TGGCAAGGCC
GCCACCGCCA TCCTGGGCGC AAAGGCGGCG CTGAAGGCGG GGGACGTGTT CGAGAAGACG
CTGGGGTAG
 
Protein sequence
MTNPTTLHTL ENGVRVVCDP MPGLETIALS VVAGRGAAYE DPARSGWSHL LEHMVFKGAG 
QRSSRDIVEV IEAQGGQINA ATGYERTSFQ VRALKGGLDL GMGVLADLVL RPTLDEADLT
REKQVVAQEI AEAADAPDDY VFDLVQAAAW GDHPLARPIL GTVDSVNSAT VEGLAHWRGE
LYAADRLIVS ASGAVDLDEV LDLARRAFGS MPAEAGALAS DPAGFVGGRK SQARKLEQAQ
LVFMLPAVSA REDDYFALRI FAEALGGGMS SRLFQEAREK RGLAYNIDAY ADTYADAGAL
GVYAGCAASD AAETAKVCAE QILGLAARIE DAELARAKAQ LKAHMFMARE QPLSRAEQAA
GQTLMFDRLY TPAELAEAVD AVSVEDLQRL GRMMLGPGKA ATAILGAKAA LKAGDVFEKT
LG