Gene M446_4026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4026 
Symbol 
ID6132877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4488336 
End bp4489835 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content69% 
IMG OID641644183 
ProductO-antigen polymerase 
Protein accessionYP_001770823 
Protein GI170742168 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTGC GTCTCTCGCC GATCGAGGCC GAAGGCGGCC GCCTGGCGCC GCTCCTGGCG 
GTCGGGCTGC TCGGGATCGC GAGCGTGCCG CTCGCCGGCT CCCTCGAGAC CGTGAGCCCG
CGTCTCCTGC TCGTCGGGGT CGCCGGCCTC GCCGCGCTCG CCGCGGTGGT GCTTGCCGGG
CTGTTGAGGC CGGTGCTGCT CGCCCTCTTC GTCGTGGCCC TGACCTACAA TCGCCAATAC
TATGACTTCG ACTGGCTGCA CGGAGACCTC GGCGGCCGTG GCCTGTACTG GTGCCCGGCG
GACGTCTGCC TGGTGGGGCT GTTGCTGCTG TGGCCAATAG AGCGGGCGCT GCGCCTGCCG
GCGCCACCGC GCCCGACCCG GGGAGGCCCG GTGCTGTGGC TGCTGCCGCT CATCGCGGTG
GGGCTGCTCT CGGCGGCCTC CAGCACGGAG CCGCTCGGCA GCACCGTCGA GATCGTCCGC
TACCTCAAGC TCGCCCTGCT CCTGCTCTAC CTGCAGTACA ATCTCGACGC GCGCGGCCTC
ACGATCATCG TCGCGGCCCT CGCGGCGGTC ATTCTCCTGC AAACGCCGAT CTCGGTCCTC
CAGGCGGCCT TCGCCTCCGG CCAGAACGGC CTCTCGCAGA TCTTCCAGTC CAGCGAGCCG
GCCGAACTCG CCCGCCGTGC GGCCGGCACC CTCGGGCATC CGAACTACCT CGCCCCCTAC
CTGCTCCTGA TCACGCCGCC CTTCATCGCC GTCGCGCTCG GCTTGCACGG CTCCTGGATC
GGCCGGGCGG CCGCACTGGT GGCGCTGTCC GGCACTCTCA CGATCTGTCT GACCCAGTCG
CGCGGTCCGA TCGCTCTCCT GCTCGTCACG ACCCTCGTTC TGATCGCCCT GATGACGGCG
CGCCGCGCGC TCCGAGCCCT GCGGGCGGTC GGCCTGATCG TGGCGGGTGC GGTGCTCCTC
GTCGCCGTGG TGGCCCCGCT CGCCCCTGCG ATCGAGAAGC GGCTCTCGGG CGATTTCGGG
GCCTCGGTGG ATTTCCGGGC TGCGTACAAC GATGCCGCAA CGCGCATGTG GGAGTCCAGC
CCCATACTCG GCGTCGGACC GAACAATTTC GGCCTGGAGA TCCGGCATTA TTCCCCGGAT
CTCTACGTGC TCATGGCCCT CGACGCGCTC TCGAGCAACG AATCCCGCAA CAAGGTCCAT
CTCCGCAGCA CCGCGCCCGT CCATAATGTC TACCTGCTCG TGCTGTCCGA GCTTGGACTT
CTCGGCCTGA TCGGCTTCGT GCTGTTCCTG CTGCGCGGTC TCGTGCTGGC TTGGCGGGCC
TCGGCGGCCT CGTCCCAGGT GACGGGTCTG TTCGCGCTTG GATTGTTCTG CGGTATCATC
GCCGAGTACG TGCAGCAGCT GATCGATTAC TCGCTGCTCT GGGATCCCTT GCTGTTCACC
ATCACCCTGC TGATGGGTGT CATGCACGCG ATCGTCGCCA CTCAGGAGGC TCACCCGTGA
 
Protein sequence
MSLRLSPIEA EGGRLAPLLA VGLLGIASVP LAGSLETVSP RLLLVGVAGL AALAAVVLAG 
LLRPVLLALF VVALTYNRQY YDFDWLHGDL GGRGLYWCPA DVCLVGLLLL WPIERALRLP
APPRPTRGGP VLWLLPLIAV GLLSAASSTE PLGSTVEIVR YLKLALLLLY LQYNLDARGL
TIIVAALAAV ILLQTPISVL QAAFASGQNG LSQIFQSSEP AELARRAAGT LGHPNYLAPY
LLLITPPFIA VALGLHGSWI GRAAALVALS GTLTICLTQS RGPIALLLVT TLVLIALMTA
RRALRALRAV GLIVAGAVLL VAVVAPLAPA IEKRLSGDFG ASVDFRAAYN DAATRMWESS
PILGVGPNNF GLEIRHYSPD LYVLMALDAL SSNESRNKVH LRSTAPVHNV YLLVLSELGL
LGLIGFVLFL LRGLVLAWRA SAASSQVTGL FALGLFCGII AEYVQQLIDY SLLWDPLLFT
ITLLMGVMHA IVATQEAHP