Gene Caul_1027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1027 
Symbol 
ID5898482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1087680 
End bp1089074 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content71% 
IMG OID641561509 
Productpeptidase M23B 
Protein accessionYP_001682655 
Protein GI167644992 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAAT TCGATCCGAG GCGGCCGACC CTACGTCTGG CCCCGCGACT CTTCACCGCG 
ATCGCCGGCC TGGCCGCCCT GGCCCTGGGC TGGAAACTGA CCGCGCCCGA CGCGCCGGTC
CAGGTGGTCA AGGCGCCGCT CGACGCCAAC GCCCTGGCCG CGCTGCAACA CGCCGCCTTC
GCCAGCGCCG AAGCCCAGCC CGGCTTCGAG CGTCCGGAAT CCATCCCCGT CAAGGTGCGT
CCTGGCGAGA CCCTGGAAGG CGCCGTGCTG CGCGCCGGGG TCGCTCCCAA CGACGCCCGT
CAGGTGGTGG CCGCTCTGCA AGGCGCGATC GACACCGTCA ACATCAAGGC CGGCATGGCC
TTCGAGGCCG CCGTGGCCGA GCGCCGCGGC CACAACGGCC AGGGCGCGGC CGGCCCCGCC
CGGCTGATCG GCCTGTCGAT GCGCACCGGC CCGTCCTCGA CCCTGACCGT CTCGCGCACC
TTCGATGGCG CGATGAAGTT GCGGGAGCTG GAGGAAAAGG TCACCGACGA GACCAAGGTG
GCCTGCGGCC AGATGGAGGG CTCGTTCTAC GAGAGCGTCG CCAGCATCGG CGGCTCGCCG
GCCGTGGTCA GCCAGGCCGC CCAGCTGTTC GCCCACAAGA TCGACTTTTC GCGCGACATC
CACGAGGGCG ACCGCTTCTG CCTGGTGTTC GGCCGCAAGG TCACCGAGAG CGGCCGCACG
GTCGAGGCCG GCGACCTGGA ATATGCCGAG GTGAAGGGCC AGAAGTTCTA CGCGTTTGAT
CGCGATGGGC CGGACGGCAA GCCGCAGTTC TTCGACGAGC TGGGCAAGAA CATCAAGGGC
TTCCTGCTGC GCACGCCGGT CGACGGCGCG CGCATCACCT CCACCTTCGG CCAGCGCAAG
CACCCGGTGC TGGGCTATAC CCGCGCCCAC CAGGGCGTCG ATTTCGGGGC CGGCACCGGC
ACCCCGATCC TGGCGGCCGG CGACGGCGTG GTGCTGGAGG CCCGCCGCTG GAGCGGCTAT
GGCAACTGGC TGCGCATCCG CCATTCGGGC CAGTGGGACA CCGGCTACGG CCACATCTCG
CGCTACGCCC CCGGCATCCG TCCGGGCGTC CACGTGCGCC AGGGCCAGGT GGTGGCCTAT
GTCGGGGCGA CGGGCCTGGC GACCGGTCCG CACCTGCACT ACGAGGTCTG GCTGAACGGC
AAGCGGGTCA ATCCGATCGG CGCCAAGGTG CCCCAGGGCA CCATCCTGGC CGGCGGCGAG
CTGACCCGCT TCAAGGCCCA GCGCGCGCGC ATCGACCACC TGCTGGCCGA CGGCGGCGAC
GTGGTGCACG ACAAGACCAC CCCGAAACTG GCCCTGGCCT CGCTGGACAG GGGCAAGGGG
CCGGCGCTGC GCTGA
 
Protein sequence
MQEFDPRRPT LRLAPRLFTA IAGLAALALG WKLTAPDAPV QVVKAPLDAN ALAALQHAAF 
ASAEAQPGFE RPESIPVKVR PGETLEGAVL RAGVAPNDAR QVVAALQGAI DTVNIKAGMA
FEAAVAERRG HNGQGAAGPA RLIGLSMRTG PSSTLTVSRT FDGAMKLREL EEKVTDETKV
ACGQMEGSFY ESVASIGGSP AVVSQAAQLF AHKIDFSRDI HEGDRFCLVF GRKVTESGRT
VEAGDLEYAE VKGQKFYAFD RDGPDGKPQF FDELGKNIKG FLLRTPVDGA RITSTFGQRK
HPVLGYTRAH QGVDFGAGTG TPILAAGDGV VLEARRWSGY GNWLRIRHSG QWDTGYGHIS
RYAPGIRPGV HVRQGQVVAY VGATGLATGP HLHYEVWLNG KRVNPIGAKV PQGTILAGGE
LTRFKAQRAR IDHLLADGGD VVHDKTTPKL ALASLDRGKG PALR