Gene Caul_4542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4542 
Symbol 
ID5902003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4916069 
End bp4917076 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content73% 
IMG OID641565061 
Producttetraacyldisaccharide 4'-kinase 
Protein accessionYP_001686160 
Protein GI167648497 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1663] Tetraacyldisaccharide-1-P 4'-kinase 
TIGRFAM ID[TIGR00682] tetraacyldisaccharide 4'-kinase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.187603 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGG CCACGCCGCG CTGGTGGTAT CTTCGCGAGG GCGCGCCCAG CCCGATCACC 
CGCGCCCTGC TGACCCCGCT GTCGTGGATC TGGGCCGCCC AGACCGCCCG GCGCATCGCC
CGCACCACGC CGCGCGGCGC CGACTGCGCG GTGATCTGCG TCGGCAACTT CACGGTCGGC
GGGGTGGGCA AGACCCCGAT CGTCCGCGAG CTGCTGCTGA CCCTGACGAA GCGGGGCCGC
CGCGCCCACG GCCTGGCGCG CGGCTATGGC GGCAAGCTGA AAGGGCCGGT GCGGGTCGAG
CCGTCGCGCC ACACCGTCGC CGAGGTCGGC GACGAGCCGC TGATGCTGGC CCAGGACTTT
CCGATGTGGG TGTCGCGCGA CCGGGTGCTG GGCGCGCGCA AGGCCGCCGC GTCCGGCGCC
GAGGTGGTGG TCATGGACGA CGGCCACCAG AACCCCGACC TGCGCAAGAC CCTGTCGCTG
GTGGTGGTCG ATGGCGAGAC CCGCGAGGAC GAGTGGCCGT TCGGCGACGG TCGGGTGTTC
CCCGCCGGTC CGATGCGCGA GCCGCTGAAC GTCAGCCTGG GGCGCACCGA CGCGGTGATC
GTGCTGCTGC CGGCCGACCT GCCAGAGGCT GATCCGCGGC TGCTGGCGCT GTTTGGCGAC
ACCCCGGTGC TGATCGCCCG GCTGGAGCCC GCCGCCCCGC CGCCCAAGGG CCGCCAGGTC
GGCTTCGCCG GCATCGGCAA GCCCTGGAAG GTCGAGCGCG CCCTGAAGGC CGCCGGCTGC
CACCTGGTCG ACTTCGCGCC CTATCCCGAT CATGGCCAAT ATGACGAGGC GACGCTGAAC
TTCCTTTGGG AGCGGGCCCA GACCTACAGC GCCGGGCTGG TCACGACCGA GAAGGACTGG
GTGCGGCTGC CCCAGGCCTG GCGGGATCGG GTGACGCCTT GGCCGGTGCG GGCGCGGTTC
GAGGATGAAG GGGCGTTGGG GGCGTTGTTG GAGTCAGTGG GGCTGTAG
 
Protein sequence
MKLATPRWWY LREGAPSPIT RALLTPLSWI WAAQTARRIA RTTPRGADCA VICVGNFTVG 
GVGKTPIVRE LLLTLTKRGR RAHGLARGYG GKLKGPVRVE PSRHTVAEVG DEPLMLAQDF
PMWVSRDRVL GARKAAASGA EVVVMDDGHQ NPDLRKTLSL VVVDGETRED EWPFGDGRVF
PAGPMREPLN VSLGRTDAVI VLLPADLPEA DPRLLALFGD TPVLIARLEP AAPPPKGRQV
GFAGIGKPWK VERALKAAGC HLVDFAPYPD HGQYDEATLN FLWERAQTYS AGLVTTEKDW
VRLPQAWRDR VTPWPVRARF EDEGALGALL ESVGL