Gene Caul_2579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2579 
Symbol 
ID5900034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2802483 
End bp2803637 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content67% 
IMG OID641563070 
Productpeptidase M23B 
Protein accessionYP_001684204 
Protein GI167646541 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATCA CGCGCTTCAA GCGACTGCGG CAATCCCTGG AAGAGATGTT CCCGGAACGT 
CATCTCTATG TCCGCTCCGG CGGCGAGATG CGCGGCTACG TCTTCTCCAC CGGCAAGCAA
CTGATCTGCG CCACTGGGAT CGCGGCGGCG GCCCTGTGGA TGGGCGTCTG CACCGCCTCG
ATGATGGTCA GCGCCCTGTC CGTCAGCTCG ACCGACCAGA TGGTCATCAA GCAGAAGGCC
TATTACGAGC GGCTCAACGC CGATCGCCAG GCGCGGCTGA ACAGCGCCGT TGCCCAGTTG
TCGGCCAGCA GCGGCTCGCT CGACGAATTG GCCAGCTCGG TCGAAAAGCG CCATGCCGCC
CTCGCCATGC TGGTCAGCGA CTTCCGTGGC GTGCCCGGCG CCGCCCAGGC TCTGCAGACC
GCCAAGCCGC GCTTGCCCGG CGCCTCCCCA GTCGAACGCA TCCAGGCCAC GCGGATGGAT
CAGGAGCGCC TGATCGACGC GGCCGAGACC TTCGCCAAGA GCCGCGCCGA ACGTTTGCGC
TTGGCGATGC GGATGGCCGG CCTCGACGCC AGCTCGTTCA CCGGTCGCGC CGGCTCATCG
CTTGGCGGCC CGCTGATCGA GGCCAAGGAT CCCCGCGCCC TGGCCGCCGT GCTCGACGTC
GACGAAGACT TCGCCAGCCG CATCCAGCAC GCCGCCACCG ACATGTCGGA CATGCGTCAG
CTGAGCGCCG CCTCGCAGAA ACTGCCCTTC TACCGGCCGA CCACCAACCC CGCCCTGAGC
AGCAGCTACG GCGTGCGGTT CGACCCCTTC ACCCATCGTC CCGCCTTCCA CTCCGGCCTC
GATTTCCCCG GCGCCTTCTA CACGCCGATC ATGGCCACCG CGCCGGGCGT GGTGTCGTTC
ACCGGCGTCC GCTCGGGTTA CGGCAATGTG GTCGAGATCG ACCACGGCAA CGGTTTCAAG
ACCCGTTACG CCCACCTGCA GGCCACATCG GTCAAGGTTG GTCAGCGGGT GGCCATCGGT
CAGCGTATCG CGGCCATGGG CTCGACGGGT CGTTCGACCG GTCCGCATCT GCACTACGAA
GTATGGGTCA ACGGGCGGGC GCAGAACCCG AACCGTTTCT TGAAGGCTGG TGAGTATGTT
CAGCAAGCAA GCTAA
 
Protein sequence
MAITRFKRLR QSLEEMFPER HLYVRSGGEM RGYVFSTGKQ LICATGIAAA ALWMGVCTAS 
MMVSALSVSS TDQMVIKQKA YYERLNADRQ ARLNSAVAQL SASSGSLDEL ASSVEKRHAA
LAMLVSDFRG VPGAAQALQT AKPRLPGASP VERIQATRMD QERLIDAAET FAKSRAERLR
LAMRMAGLDA SSFTGRAGSS LGGPLIEAKD PRALAAVLDV DEDFASRIQH AATDMSDMRQ
LSAASQKLPF YRPTTNPALS SSYGVRFDPF THRPAFHSGL DFPGAFYTPI MATAPGVVSF
TGVRSGYGNV VEIDHGNGFK TRYAHLQATS VKVGQRVAIG QRIAAMGSTG RSTGPHLHYE
VWVNGRAQNP NRFLKAGEYV QQAS