Gene Caul_3543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3543 
Symbol 
ID5900998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3820981 
End bp3822237 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content71% 
IMG OID641564050 
ProductO-antigen polymerase 
Protein accessionYP_001685168 
Protein GI167647505 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.981417 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCGA CCGCTCAGCT CTCGGACCGC CCGGCGCGAC ACACCCGCTG GCTGAGCGGG 
GTCGCGATCT TCGTCGTCGT GATGACGCCC TTGCTCGCCT ATCTGGCGCC GCTGGGCTTC
GCGCCGCTGA TGGCCCTGGC GGGGCTGCTG GCGCTGCCCG CCCTGAGGCT GAGCCGCGCG
GCCGCCCCGC CTCTGCTGAT CCTGGTGATC CTGGCCCTGT GGGCGGCGGT CAGCCTCGCC
TGGAGTCCCG CCGCGATCGA TCCCTCGACG CTCAAGGGCT ATGGCGACAT CGAGACCCTG
ACGGGCCTGA AGCTGTTCCT GCAACTGGCG ACCTATGGCG CGGCTGTGGT GGCCCTGCGT
GGCCTGTCCG AGCCTGGCGC GCGCCGGGCC GGGGCGGTGC TGGCTTGGGG CATGGTCGCC
CTGGCCGTCC TGACGGCGAT CGACTCGCTG GCCGGGGCGG CGATCTACCA GCAACTGCAC
GCCGTGACCG GCGAGGCGAT CCGGCCGGAC GTCGCCCTGG TCAAGGTCTC GCTATCGACC
TACGCGATGG TCCTGCTGTT CTGGCCCGTG TCGTTGATCC TCTGGCGACG GTCCGGCGCG
CGGCCGATCT TGGCGCTCGC CGCGGGGATG ATCATCACCT CGGTGATCGG CAGCTCGGAC
GCCTGCCTCG TCGCCCTGGC GGCGGGGGGC GCCGCCTGGC TGCTGGTGCG CTACCTGGGC
CGGAACGGCG CCAAGGTGCT GGTCGCCCTG GTGGCCGCGC CGTTCGTGCT GGCGCCCCTG
GCCGTTCTGA TCGGGGTCGA GACCGGCTTT GTCGCCTGGC TCCACAAGCT GGTCCCGCCC
TCCTGGGACG CGCGGCTGAA CATCTGGACC TTCGCGGCGG ACCATATCCA GAACCACCCC
TTCCGAGGCT GGGGCCTGGA CGCCAGCCGC ACCTTCGGCC CGGCCATTCC GCTGCACACC
CACAACGCCC AGCTTCAGCT GTGGCTGGAA CTGGGCGCGA TCGGGGCGGC CCTGGCGGGG
GTGTTCTTCT GCTGGCTGGC CTATGGCGTG GTGAGGATCA GCGAACGCTC GCGGGGCGAG
GCGGCGATGG CCGCCGGCGC CTTGGTCAGC TACCTGGTGA TCGGGGCCTT GAGCTTCGGC
GTCTGGCAGG AATGGTGGCT GGGCCTGGGC GCCCTGACGC TGATCGCCTG CGGCTTGGCG
CGGGCGACCG CGGAGCCTGA CTGGGGTTTG CGGGACGAAC TTACCCTAAT CGAGTGA
 
Protein sequence
MIATAQLSDR PARHTRWLSG VAIFVVVMTP LLAYLAPLGF APLMALAGLL ALPALRLSRA 
AAPPLLILVI LALWAAVSLA WSPAAIDPST LKGYGDIETL TGLKLFLQLA TYGAAVVALR
GLSEPGARRA GAVLAWGMVA LAVLTAIDSL AGAAIYQQLH AVTGEAIRPD VALVKVSLST
YAMVLLFWPV SLILWRRSGA RPILALAAGM IITSVIGSSD ACLVALAAGG AAWLLVRYLG
RNGAKVLVAL VAAPFVLAPL AVLIGVETGF VAWLHKLVPP SWDARLNIWT FAADHIQNHP
FRGWGLDASR TFGPAIPLHT HNAQLQLWLE LGAIGAALAG VFFCWLAYGV VRISERSRGE
AAMAAGALVS YLVIGALSFG VWQEWWLGLG ALTLIACGLA RATAEPDWGL RDELTLIE