Gene Caul_3666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3666 
SymbolmurG 
ID5901121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3959304 
End bp3960389 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content73% 
IMG OID641564177 
Productundecaprenyldiphospho-muramoylpentapeptide beta-N- acetylglucosaminyltransferase 
Protein accessionYP_001685291 
Protein GI167647628 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0707] UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase 
TIGRFAM ID[TIGR01133] undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.231982 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.103527 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGC TGGTGGTCGT CGCCGCCGGG GGCACCGGCG GACACCTGTT TCCCGCCCAG 
GCCCTGGCCG AGGTGCTGAA GGATCGCGGC TGGCGCGTGG TGCTGGCCAC CGACGAGCGC
GGCGCGCTGT TCGCCGACAA GTTCCCGGCC GAGGAGCGCC TGGCCCTGTC GGCCGCCACC
GCCAAGGCCG GCGATCCGAT CGGCATGGTC AAGGCGGGCT TCGCGGTCGC CCAGGGCGTG
CTGCAGGCCA AGGCCGCCTT CAAGCGCCTG GACCCGGCCG TCGTGGTCGG CTTCGGCGGC
TATCCCGCCC TGCCAGCCCT GCTGGCGGCC CTGTCCGAGG GCCGGCCGAC GGTGATCCAC
GAGCAGAACG CGGTGCTGGG CCGGGTCAAC CGCTTCCTGG CCTCGCGCGC CACCGAGGTG
GCCTGCGCCT TCCCGACCCT GGAAAAGGCC ACGCCCAAGG TGAAGGCCCG CGCCCACGTG
GTCGGCAATC CGGTGCGGCC CGAGATCCGC GCCCTCTACG ACGTGCCCTA CCTGCCGCCC
GAGGTGCAAC TGCGGGTGTT GGTCACCGGC GGCAGCCAGG GCGCGCGCCT GCTGTCGGAG
CTGGTGCCCG AAGCCATCGC CAAGCTGCCC GAGGAGATGC GCGGCCGCCT GAAGGTGCAG
CAGCAGAGCC GGGCCGAGTC GATGGAGAGC GCCCGCAAGA TCTATCGCAA CGCCATGGTC
GACTGCGAGG TCGCGCCGTT CTTCCGCGAC ATGGCCGGCC GTCTGCGCCA GGCCCACCTG
GTGGTCGGCC GGGCCGGCGC CTCGACCTGC TGCGAGCTGG CGGTGGCCGG CCGCCCGTCG
ATCCTGGTGC CCCTGAAGAT CGCCGCCGAC GACCACCAGC GCTTCAACGC CCGGCAGCTG
GAAGAGGCGG GCGGGGCGGC GGTGTGCCTG GAGGACGAAC TGACCGTCGA CGCCATGGCC
GGCGCCCTCA ACGCCCTGCT CAAGGACCCC GAGCGCCTGG CCCGCATGGC CGAGGGCGCG
CGCAAGGTGG CGACCCCCGA CGCGGCCGAG AAGCTGGCCG ACCTAGTCGT GAGGACCGCG
CGATAG
 
Protein sequence
MSKLVVVAAG GTGGHLFPAQ ALAEVLKDRG WRVVLATDER GALFADKFPA EERLALSAAT 
AKAGDPIGMV KAGFAVAQGV LQAKAAFKRL DPAVVVGFGG YPALPALLAA LSEGRPTVIH
EQNAVLGRVN RFLASRATEV ACAFPTLEKA TPKVKARAHV VGNPVRPEIR ALYDVPYLPP
EVQLRVLVTG GSQGARLLSE LVPEAIAKLP EEMRGRLKVQ QQSRAESMES ARKIYRNAMV
DCEVAPFFRD MAGRLRQAHL VVGRAGASTC CELAVAGRPS ILVPLKIAAD DHQRFNARQL
EEAGGAAVCL EDELTVDAMA GALNALLKDP ERLARMAEGA RKVATPDAAE KLADLVVRTA
R