Gene Caul_1595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1595 
Symbol 
ID5899050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1684548 
End bp1685951 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content70% 
IMG OID641562083 
ProductO-antigen polymerase 
Protein accessionYP_001683223 
Protein GI167645560 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAGCAC AACAACACGA AGACCGGCCC GACCTCCCGC GCAAGCTGGA AGCGCTGGCC 
TGCGGCTTCG TGCTGTTCAT GCTGTCCAAC GCCTTCATCG GCCCGCTGCT CGACCCGCTA
CAGGCTGGCG GCGAGAACAT TCCGGTGCTG CGGCTCATGT GGCTGCCGGT CTACGCCCTG
ATCCTGGGCC TGGTCGCCTG GCGCGCCCCC CGGCTGATGC GCTTCTGGCT GCCGGCCGCC
ATGCTCAGCC TGCTGGTCTT CTGGGTGTTC GCCTCGGCCT CGTGGTCGCT GAACCCCGGC
GCCACCAACC GCCGGGCCTT GGCGGCGGCC TTCACCACCC TGTTCGGCTT CTATTTCGCC
GCCAGCTTCG ACGGCAGGCG GATGGCCGAG ATCATCGCCG CCACCTTCCT GCTGCTGGCG
ATCGGCGGGG CGCTGACGGC CGTGGCCTAT CCGACCATGG GCGTCCACCA CGACATCAAC
GCCGGCGACT GGCGCGGCCT CTGGTACGAG AAGAACCAGA TGGGCGCGAT GATGGTCTAC
GGCGCCCTGG CGGCGATGGC CGCCATCCTG GCCGGCTCGA CCCGGCGCAA ACAGCTTGTC
TTCACCATCG TGCTGTGCGC GGCCCTGATC GTCATGACCA AGTCCAAGAC CTCGCTGGTG
GTCCTGATGA TCGGCCTCCT GGGCTCGATG CTGCTGGCGG CCATGCGGCG CGGACCGGCC
ACGGCGGTGA TCGTCGTCTG GCTGGGCGTC ACGGTGATCG CCACCACCGT GATGGTCCTG
TGGCTGGCCC CCGACCTGGT GTTCAAGGCC CTGGGCAAGG ACCCCACCCT GACCGGCCGC
ACCGACATCT GGGCCGCCGT GCTGCGTCAG TCGGCCAAGG CCCCGCTGAC CGGCTACGGC
TACGCGGTGT TCTGGACGCT GGAGTCCCAG CCCGCCCAAT GGATCCGCAA GGAGACTGGT
TGGCTGGTGC CCACCGCTCA CAACGGCTGG CTCGACATCC TGGCCCAGCT GGGCTGGATC
GGCGTGGGCC TGTGCGCCCT GGTGCTAGGC GGGTCCCTGC TGGTCGCCCT GGTCCGCTTT
CGCAGGGTGC GGGACGGCTA TTGGGCCACC CTGTTCCTGG CCATCTTCCT GATGACCACC
TTTTCCGAGA GCTTCATCCT GGAGCGCAAC GGCATCGCCT GGGCCCTGGC CTGCGCGGCG
GTGACGCGGC TGCTGGGACC AGTGCTGGCG CTGGGCGCGC CGCGCGAGAA GGTCGTCCGC
GCGCCGCTGT TCGCCGAGCC GCCCCTGGCC TGGTCCCTGG CCCCGCCGGA CTCCGCGCCG
GAGATCTGGA CGCCCACGCC CGCCCGTCGG CCGGCCTTCA CGCCCACATT TGGCAAGCGC
GCGGTCTCGC CTTTCGCCGC TTAG
 
Protein sequence
MEAQQHEDRP DLPRKLEALA CGFVLFMLSN AFIGPLLDPL QAGGENIPVL RLMWLPVYAL 
ILGLVAWRAP RLMRFWLPAA MLSLLVFWVF ASASWSLNPG ATNRRALAAA FTTLFGFYFA
ASFDGRRMAE IIAATFLLLA IGGALTAVAY PTMGVHHDIN AGDWRGLWYE KNQMGAMMVY
GALAAMAAIL AGSTRRKQLV FTIVLCAALI VMTKSKTSLV VLMIGLLGSM LLAAMRRGPA
TAVIVVWLGV TVIATTVMVL WLAPDLVFKA LGKDPTLTGR TDIWAAVLRQ SAKAPLTGYG
YAVFWTLESQ PAQWIRKETG WLVPTAHNGW LDILAQLGWI GVGLCALVLG GSLLVALVRF
RRVRDGYWAT LFLAIFLMTT FSESFILERN GIAWALACAA VTRLLGPVLA LGAPREKVVR
APLFAEPPLA WSLAPPDSAP EIWTPTPARR PAFTPTFGKR AVSPFAA