Gene Caul_3034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3034 
Symbol 
ID5900489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3298404 
End bp3299471 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content69% 
IMG OID641563536 
Productputative glycerol-3-phosphate acyltransferase PlsX 
Protein accessionYP_001684659 
Protein GI167646996 
COG category[I] Lipid transport and metabolism 
COG ID[COG0416] Fatty acid/phospholipid biosynthesis enzyme 
TIGRFAM ID[TIGR00182] fatty acid/phospholipid synthesis protein PlsX 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.829882 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGGGG ATCACGGCCC GTCCGTGATC GTTCCCGCCG TGGCGCTCGC CGCCAAAAGC 
CTTCCCGACG TCCGTTTCCT GCTGCACGGC GACGAAGCCC AGCTGAACGC CCAACTGGCC
AAATCCCCGG ACGCGCGCGC CGTCAGCGAG GTGCGTCACA CCGACAAGGC CATCTCGATG
GAAGAAAAGC CCGCCCAGGC GATGCGCCGG GGCAAGGGCA CCAGCCTGTG GAACGCCGTC
GAGGCCCTGC GCAACAACGA GGCCGCCGCC GTCGTCTCGG CCGGCAACAC CGGCGCTCTG
ATGGCGATCT CCAAGCTGAT CCTGCGCATG GGGGCCAATC TGGAGCGCCC GGCGATCGTC
GCCAGCTGGC CGACCATGCG GGGCGTCTCG GCCGTGCTCG ACGTCGGCGC CAATGTCGAG
AGCGACGCGG GCCAGTTGAT CGAGTTCGCG ATCATGGGCG CGGCCTTCCA CCACGCGGTG
CACGGTTCCG AGCGTCCGAC CGTCGGCCTG CTCAATGTCG GCTCCGAGGA CCAGAAGGGT
CACGAGGAGG TGCGCGAGGC GCACGCCATC CTCAAGGAGA CCAAGCTCGA CTTCGACTAT
CGCGGCTTCG TCGAGGGCAC CGACATCGCC AAGGGCACGG TCGACGTGGT CGTCACCGAC
GGCTTCACCG GCAATGTCGC CCTGAAGACC GCCGAGGGTC TGGCGCGGTT CTTCGCGGCC
GAGATCAAGG CCACCCTGAC CTCCGGTCCG CTCGCCATGC TGGGGGCGGT GATCGCCTCC
GGCGCCCTGA AGAAGATGCG TCGACGCCTG GATCCGGGCC GAGTCAACGG CGGGCCGCTG
CTGGGCCTCA ACGGCATCGT GGTCAAGAGC CACGGCGGGG CCGACCCTAT CGGTTACGCC
TCGGCCATCC GCGTGGCCGT CGATCTGGCG CGCAGCGACT TCCAGGCCGA GATCGACCGT
AATCTGAAAC GTCTGACAGA AACCGGCCTG AAATCCGGCG CGGATCAAGC GGCCAATCCC
AGCCATGGGG GCGCGTCGGG CGCCGAGGGC CAAGGAGCTT CCGAGTGA
 
Protein sequence
MGGDHGPSVI VPAVALAAKS LPDVRFLLHG DEAQLNAQLA KSPDARAVSE VRHTDKAISM 
EEKPAQAMRR GKGTSLWNAV EALRNNEAAA VVSAGNTGAL MAISKLILRM GANLERPAIV
ASWPTMRGVS AVLDVGANVE SDAGQLIEFA IMGAAFHHAV HGSERPTVGL LNVGSEDQKG
HEEVREAHAI LKETKLDFDY RGFVEGTDIA KGTVDVVVTD GFTGNVALKT AEGLARFFAA
EIKATLTSGP LAMLGAVIAS GALKKMRRRL DPGRVNGGPL LGLNGIVVKS HGGADPIGYA
SAIRVAVDLA RSDFQAEIDR NLKRLTETGL KSGADQAANP SHGGASGAEG QGASE