Gene Caul_1531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1531 
Symbol 
ID5898986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1622345 
End bp1623604 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content70% 
IMG OID641562018 
ProductO-antigen polymerase 
Protein accessionYP_001683159 
Protein GI167645496 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.543111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.517094 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCTTGG CGCGCGCGGC GGCCCATGCC CCGCCGGACC GGCTGACCTT TCTCCAGATC 
GCGCTGTTCG TGCTGCCGGT CCTGGCCCTG CTGCTCTACA GCGGCGGCTG GGAGCTGCCG
CTGGTCGGCG AGACCGCCAC CGAGGCCGGC TCGGCCCTGC TGCGGATCGG CTACCTGCCA
GCCTACGCCG CCGGCTTCCT GCTGATCGCC CTGCGGCCGG GCTCGACCTT CCGGGTGCTG
ATCCGCCAGC CGTTCCTGAT CGCCCTGCTG ATCATCGTCG TCGCCTCGAT GTTCTGGTCG
GTCAATCCAG ACCAGACCGC CCGGCGCGGC TTCGCCCTGG TCTGCACCAC CCTGGGCGGC
GTGGCCCTGG CGGCCCGTTT CCGCTGGCCG CAATTGGCCG AGGTGGTGGC CGCGGCCTTC
GCGGTGCTGA TCGTCGCCTG TTTCGTCGTC TGCCTGGCCT TTCCCCGCAT CGGGGTGATG
ACCGAGCTGT TCCCCGGCGC CTGGCGCGGC CTGTGGCGCG AGAAGAACGG CCTGGGCGGC
AACATGGCGT TCGGCTTCTG CATCCTGTCG GCCGCGGCCC TGCTCAACCC GCGCCGCGCC
CGGCTGTGGT GGACCTTCGC GGGCCTGGCC CTGGTGCTGG TGCTGATGTC GACCTCCAAG
ACCTCGCTGG TGTCGCTGAT GCTGGGCGTG GCGGCGATCG GCTTCGTCTG GATCGCCCGT
CGCAGCCCAG CCGCCGGCGC CGCCGCCACC TGGACCGGGG TGACCGGCGT GGTGCTGCTG
GGGGCCTTCA TCCTGTTCGC CTCGGACGTG TTCTTCGCCA TCCTCGGCAA GGACGCCACC
CTGACGGGCC GCACCAAGAT CTGGTCCGCG GTGATGCGCG AGATTGAGGG CCGACCGTGG
CTGGGCTACG GCTACCAGGC GGTGTGGGGC GACAAGTCCG GCTGGGGTCC GTTCGCCTGG
ATCAGCAAGA ACGCCGGCTT CCAGGCCCAG CACGCTCACA ACAGCTGGCT GGAGCAATGG
CTGGGCATGG GCTTGCTGGG CCTGATCGCC TGGGGGCTGT TCTATCTGCA GGCCATGACC
CTGGCGGTGA TCGCCGTGTT CCGCGACCGA GGGGCGCTGC TCGCCTTCCC GTTTCTGGTC
GTCTACAGCC TGGTCGCCCT GACCGAGAGC ATCGCGGTGA TCTACAACGA CTTTCGCTGG
GTGTTGTTCG TGGCCTTCGC CGCCAAGCTG GCGTTTCCGG ACCGCGAGGT CGAGGGGTAG
 
Protein sequence
MSLARAAAHA PPDRLTFLQI ALFVLPVLAL LLYSGGWELP LVGETATEAG SALLRIGYLP 
AYAAGFLLIA LRPGSTFRVL IRQPFLIALL IIVVASMFWS VNPDQTARRG FALVCTTLGG
VALAARFRWP QLAEVVAAAF AVLIVACFVV CLAFPRIGVM TELFPGAWRG LWREKNGLGG
NMAFGFCILS AAALLNPRRA RLWWTFAGLA LVLVLMSTSK TSLVSLMLGV AAIGFVWIAR
RSPAAGAAAT WTGVTGVVLL GAFILFASDV FFAILGKDAT LTGRTKIWSA VMREIEGRPW
LGYGYQAVWG DKSGWGPFAW ISKNAGFQAQ HAHNSWLEQW LGMGLLGLIA WGLFYLQAMT
LAVIAVFRDR GALLAFPFLV VYSLVALTES IAVIYNDFRW VLFVAFAAKL AFPDREVEG