Gene Caul_3669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3669 
SymbolmurD 
ID5901124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3963749 
End bp3965158 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content71% 
IMG OID641564180 
ProductUDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase 
Protein accessionYP_001685294 
Protein GI167647631 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0771] UDP-N-acetylmuramoylalanine-D-glutamate ligase 
TIGRFAM ID[TIGR01087] UDP-N-acetylmuramoylalanine--D-glutamate ligase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00445508 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.10958 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCGG TCCGCGGTTT CGAGGGCAAG ACCGTCGCCG TGTTCGGCCT GGGCCGGACG 
GGGCTGACGG CCGCGCGCGC GCTGATCGCC GGCGGGGCCA AGGTGGCGCT GTGGGACGAA
AAGCCCGAGA GCCGCCAGGC CGCCGTGGCC GAGGGGCTGA ACGTCGTCGA CCTGACCACC
AGCGACTGGA GCGACTACGC CGCCCTGATG CTGTCGCCGG GCGTGCCGCT GACCCATCCC
AAGCCGCACT GGACGGTGGG CAAGGCCAAG GCGGCCGGGG TCGAGGTGCT GGGCGACATC
GAGCTGTTCG CCCGCACGGT GAACGCCGCG CCCGAGCACA AGAAGCCCAA GATCATCGCC
ATCACCGGCA CCAACGGCAA GTCGACGACG ACGGCCCTGA TCGGCCATCT GTGCCGCCAG
GCCGGGCGCG ACACCCGGGT CGGCGGCAAT ATCGGCGAGG GCGTGCTGGG CCTGGAGGAC
ATGCACGGCG GCGCGGTCTA CGTGCTGGAG CTGTCGTCCT ACCAACTGGA CCTGACCTCC
AGCCTCAAGC CCGACGCCGT GGTGCTGCTG AACATCTCGC CCGACCACCT GGACCGGCAT
GGCGGGATGG ACGGCTATAT CGCCGCCAAG CGCCGGATCT TCCTCAACCA GGGCAAGGGC
GACACGGCGA TCATCGGGGT GGACGATCCC TGGTGCCAGC AGATCTGCAC CGAGATCACC
GCCGCCAACC GCCGCACCAT CTGGCCGATC AGCGCCGGCA AGGCCATGGG GCGCGGCGTC
TACGCCCTGC AGGGCGTGCT GTACGACGCG ACCGGCGAGC GCGTGACCGA GATGGCCGAC
CTGTTGCGGG CCCGCAGCCT GCCAGGCCGT CATAACTGGC AGAACGCCGC GGCCGCCTAC
GCCGCGGCCA AGGCCATCGG CATTCCCGCC CACCAGGCCG TCGACGGCCT GATGAGCTTC
CCGGGCCTGG CCCATCGCAT GGAGACGGTC GGCAAGCTGG GCAAGGTCCG CTTCGTCAAC
GACAGCAAGG CCACCAACGC CGACGCCGCC CGCCAGGCGA TGTCGAGCTA TCCCAAGTTC
TACTGGATCG CGGGCGGCGT GCCCAAGGCC GGCGGCATAG ACGACCTCGT CGACCTGTTC
CCGCGCGTGG CCGGAGCCTA TCTGATCGGC CAGGCGGCCG AGGACTTCGG CAAGACGCTT
GAGGGCAAGG CCCCGGCGCG CCAGTGCGGC GATATCGAGA CCGCTGTCGC CGCCGCCTAT
GCCGACGCCG TCGCCAGCGG GGAGGAGGCG GTCGTCCTGC TTTCGCCGGC CTGCGCCTCG
TTCGACCAGT TCGCCGACTT CGAGCAGCGC GGCGAGGCGT TCCGCGCGGC GGTCAACGGA
TTGGGCAAGC CGGCGGCGAA GCGGGCCTAG
 
Protein sequence
MIPVRGFEGK TVAVFGLGRT GLTAARALIA GGAKVALWDE KPESRQAAVA EGLNVVDLTT 
SDWSDYAALM LSPGVPLTHP KPHWTVGKAK AAGVEVLGDI ELFARTVNAA PEHKKPKIIA
ITGTNGKSTT TALIGHLCRQ AGRDTRVGGN IGEGVLGLED MHGGAVYVLE LSSYQLDLTS
SLKPDAVVLL NISPDHLDRH GGMDGYIAAK RRIFLNQGKG DTAIIGVDDP WCQQICTEIT
AANRRTIWPI SAGKAMGRGV YALQGVLYDA TGERVTEMAD LLRARSLPGR HNWQNAAAAY
AAAKAIGIPA HQAVDGLMSF PGLAHRMETV GKLGKVRFVN DSKATNADAA RQAMSSYPKF
YWIAGGVPKA GGIDDLVDLF PRVAGAYLIG QAAEDFGKTL EGKAPARQCG DIETAVAAAY
ADAVASGEEA VVLLSPACAS FDQFADFEQR GEAFRAAVNG LGKPAAKRA