Gene Smed_4944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4944 
Symbol 
ID5318271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1455769 
End bp1457073 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content61% 
IMG OID640776727 
ProductO-antigen polymerase 
Protein accessionYP_001313659 
Protein GI150377063 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.367217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.330229 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATAT CCAAGGCGAG CCTTGTCAGA CCTGGAGCGA ATGACGTCTT CGGCACCTTC 
GCCTTGGCCC TGTCGGTCTT CGTCTTCGCC TATTCGGCGC GCTTCGGACA GGTTTCGATT
CTCGCCTATT ATGGCCTGTG GCTGCCGCTG GTCCTTGTCG ACTACCGAAA GGTTCTCGGC
AACTATGCCA GCTATCTCTG GATCCTGTCG TTCACCATCT TCGCCTGCCT CACGATCTTC
TGGTCGGCGG CGCCATCGCT GTCGCTGCGG ACGGGAATAC AATATCTGAG CCACGTCGTC
TGCGCTCTCA TCGCAATGCG TACGATCGAC ATCCGCACGC TGACGCGCGG CATGATTGCA
GGCGTTGCGA TCGTGCTCCT CTATTCGCTG CTCTTCGGCT CGTATCATTA CGATCCCCTG
GATGGCACCT ATAGCTTTGT CGGTGCGTTC GCATCGAAGA ACCAGTTGGG CTTCTACGCC
TCGCTCGGCA TCTATTTCGC TTTCGCCGCC GTTTTTGTTC TCGGCGAAAG AGGCCTCTGG
ATGGGCGCGG CAGGGGGCGG CGGATTGCTT GCCGCCTATT GCCTCCTCAT GTCGCAATCG
GCAACGTCGG TGCTGACGAC GGCAGCGGTC ATCGGCCTCT GCCTCGGCAT GCGCGCGATC
ACGACCTTGC GTCCCGCGAG CAGAAAGATC CTCTTCACCG CCGCATCGGT GTTCGGCGGC
GTGGCTGCCG TGGCAATAAT CTACGCCGGC GGCATCGATA TGATCCTCGG CGCTTTCGGC
AAGGACTCGA CGCTGACCGG CCGCACCTAT CTCTGGCAGC AGGGCATCGA GGCGGCAAAG
GCGTCGCCCC TCGTCGGCAT CGGCTATCAG GCTTATTGGG TGCAGGGCTT CTCCGAAGCC
GAGCGGCTAT GGGAGGAGTT CTATATCGGC TCGCGCGCCG GCTTTCATTT CCACAACACC
TTCATAGAGA CGGCTGTGGA GACGGGTCTC ATCGGGCTTG TTCTGCTGAC AATGGTGCTG
GCCATGACTT TCTTCGGGCA GCTGAAGCGC CTGCTTTCGG AGGATTGCGA CCCGGAATCG
ATGGTCCTCT TCGGTGTTGG AGCACTGCTT TTGGTGCGCG CCTTCGTGGA GATCGATATC
CTCACGCCCT ACCATATCGG TTCCTTCCTC CTGTACTTCA CCGCTGGGAA GCTGACGATC
CCGCGCCGCC GGGTGGCGGC ATTGCCTTGG CCGCCGCAGC TTCATCCCGC ACCCTACGGA
CGGGCCTTCA CTCCGATGAT GCAGCGACCC GGGGAAAACC GTTGA
 
Protein sequence
MRISKASLVR PGANDVFGTF ALALSVFVFA YSARFGQVSI LAYYGLWLPL VLVDYRKVLG 
NYASYLWILS FTIFACLTIF WSAAPSLSLR TGIQYLSHVV CALIAMRTID IRTLTRGMIA
GVAIVLLYSL LFGSYHYDPL DGTYSFVGAF ASKNQLGFYA SLGIYFAFAA VFVLGERGLW
MGAAGGGGLL AAYCLLMSQS ATSVLTTAAV IGLCLGMRAI TTLRPASRKI LFTAASVFGG
VAAVAIIYAG GIDMILGAFG KDSTLTGRTY LWQQGIEAAK ASPLVGIGYQ AYWVQGFSEA
ERLWEEFYIG SRAGFHFHNT FIETAVETGL IGLVLLTMVL AMTFFGQLKR LLSEDCDPES
MVLFGVGALL LVRAFVEIDI LTPYHIGSFL LYFTAGKLTI PRRRVAALPW PPQLHPAPYG
RAFTPMMQRP GENR