Gene Caul_3112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3112 
Symbol 
ID5900567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3376799 
End bp3378700 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content71% 
IMG OID641563615 
Productpeptidase M23B 
Protein accessionYP_001684737 
Protein GI167647074 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.233689 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCAGT TGTGGACGCG CGCGGCGGTG ATCGCCCTGA CGGCTGGAAC GCTCGCGGCC 
TGCGAAAGCA CGGGCGGCGC GCAATATCCC ACCACGCGCC AGCCGCCCAC GCCGAACTTC
CCGATCGTCC AGGCGCCGGT GGCCAACCAG CAGCCTGAAG GCCCGTCGAC GCCGCCCGAA
GAGCCGCAGC CGAGCATTTC GACGCCAACC AGCAGCGTCG GCGTGTCCAG CCAGGCGCTG
GCGCCGGTGA CGACCGCCGA ACCCCCGCCG CCGCCTCCGC CCCCGGTCGA ATACCGCCCC
GCGCCGACTC CGCCGCCGCG GCCCGTGGTG GTCACCAGCG TGGCTGGCCC CGTGGTGACG
ATCCCGGGCC CGCCGAGGAC CTACAAGGTC AAGGCGGGCG ACAACATCGA CGCCATCGCC
CGCACGCTGG GCACGACGCG CGCGGATCTG GTGAAGGACA ACGACCTGAA GTCGCCGTAC
CGGATCCATC CCGGTCAGGT GCTGCAGGGA CCCGACGGCA AGGACGCCAA GGCCTATGTG
GTTCAGACCG GCGACACGAT GTTCGCCATC GCCAAGCGCT TCTCGGTCAC CGCCGCCGCC
ATCGCCGACG AGAACGACGT CGGCGCGAAC TCGGCCCTCA AGAAGGGCCA GAAGCTGCGC
CTGCCGTCCG GCTACAAGGA CAAGGGTCCG ACCAAGACCA CGGTGATGCA GGCCGCTCCC
CAAGGTTCGA ACAGCAGCCA GCGACCGATC GGTTCCCAGG CTCCGAGCCG CCCGCCCGTC
CAGAGCCAGC CCGCCTATAC GCCGGCGCCT TCACGTGGAC CCGTCGAGGA GCCCGAGGCC
GCGCCGGCCC GCCCGGTCAC CACCACGACG ACCAGCGTCA CCGGCCCTGT GGTCGAGGTG
GCCGGTCCGC GCCGCACCTA CACCGTCAAG GCGGGCGACG CGATCGACGC CATCGCCCGG
GGCCTGGACA CCACGCGCGC CGACCTGGTC GAGGACAACA AGCTCAAGCC GCCCTACCGC
ATCCATCCCG GCCAGAAGCT CAAAGGTCCG GCGACCACGG CCAGGGCCTA TGTCGCCAAT
AGCGGCGACA CCCTGTCCAA CATCGCCAAG CGCTTCAACG TCAAGCCGGC GGCCCTGGCC
GATGAGAACG ACATCAAGGT CTCGGCCACG ATCAAGAAGG GCCAGAAGAT CCGCCTGCCG
TCCGGCTACA AGGACAAGGG CCCGCTGAAG ACCACCACGA CCACGACCCC CGCGGCGCCG
CGTCCCGTGA CCCCGCGTCC GATCACGCCG GCTCCGATCT ACAATCCGCC GGCCGAGACC
CAGACGCCGC CGCCGGCCTA CACGCCGACC GGTCCCGGAC CGCGTCCCTA TACGCCGCCG
CCGGCCACGA ACTATCCGCG TTCCACCGGC CCGGTCTCGG CCCAGCCGGT GACGCCGCCG
CCATCGCCGG GCCAGATCAT CGGCAGCAGC CCGCCGCCGA CCGAGGCCGA GATCACCGCC
GCCGGCCGCG GCCGCTTCGT CTGGCCGCTG CGAGGCGACA CGATCTCCGA CTTCGGTCCC
AAGGGCACGG GCCAGCGCAA CGACGGCGTC AACATCCGCG CCGCCGCCGG CACGCCGGTT
CGCGCGGCCG CCGCCGGGGA AGTGGTCTAT GCCGGCAACC AGGTGCCGGG CTTCGGCAAT
CTGGTGCTGG TCAAGCACGC CGACGGCTGG GTGACGGCCT ATGCCCACTT GTCGGCGACC
GAGGTGAAGA TGCGCCAGCA GGTGTCGCAG GGCGACACCC TGGGCGCGGT CGGCCAGACC
GGCGGCGTCA CCGAGCCGCA ACTGCACTTC GAGGTCCGCT ACGCGCCCAC GCCCAAGGAC
AAGGCGCGTC CGGTGGATCC GGGCCTGGTG CTGCCGAGGT AG
 
Protein sequence
MRQLWTRAAV IALTAGTLAA CESTGGAQYP TTRQPPTPNF PIVQAPVANQ QPEGPSTPPE 
EPQPSISTPT SSVGVSSQAL APVTTAEPPP PPPPPVEYRP APTPPPRPVV VTSVAGPVVT
IPGPPRTYKV KAGDNIDAIA RTLGTTRADL VKDNDLKSPY RIHPGQVLQG PDGKDAKAYV
VQTGDTMFAI AKRFSVTAAA IADENDVGAN SALKKGQKLR LPSGYKDKGP TKTTVMQAAP
QGSNSSQRPI GSQAPSRPPV QSQPAYTPAP SRGPVEEPEA APARPVTTTT TSVTGPVVEV
AGPRRTYTVK AGDAIDAIAR GLDTTRADLV EDNKLKPPYR IHPGQKLKGP ATTARAYVAN
SGDTLSNIAK RFNVKPAALA DENDIKVSAT IKKGQKIRLP SGYKDKGPLK TTTTTTPAAP
RPVTPRPITP APIYNPPAET QTPPPAYTPT GPGPRPYTPP PATNYPRSTG PVSAQPVTPP
PSPGQIIGSS PPPTEAEITA AGRGRFVWPL RGDTISDFGP KGTGQRNDGV NIRAAAGTPV
RAAAAGEVVY AGNQVPGFGN LVLVKHADGW VTAYAHLSAT EVKMRQQVSQ GDTLGAVGQT
GGVTEPQLHF EVRYAPTPKD KARPVDPGLV LPR