Gene M446_1075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1075 
Symbol 
ID6131530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1197448 
End bp1199133 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content75% 
IMG OID641641367 
ProductWecB/TagA/CpsF family glycosyl transferase 
Protein accessionYP_001768039 
Protein GI170739384 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases
[COG1922] Teichoic acid biosynthesis proteins 
TIGRFAM ID[TIGR00696] bacterial polymer biosynthesis proteins, WecB/TagA/CpsF family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.255325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.693925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCTCG TCGTCTGCAC GGTGGGCCGG CTGGAACCCC TGGAGAGGCT GCTCGCCTCC 
CTGCGCCGCC AGACGCGCCG GCCCCTCGAG ATCCTCCTCG TCGACCAGAA CCCGGCCGGC
ACGCTCAGCG CCCTGCTCAC CCGCTTCCGC GACCTGCCCC TCGTGCACCT CGTCGACCTC
GCCGATGCGC GGGGCCTGTC GCGGGCCCGC AACCTCGGCC TGGCCTGCGC CCGCGGCAGC
GTGGTGGGCT TTCCGGACGA CGATTGCTGG TACGACCCCG AGGTCGTCGC GCGGGTCGCC
GACCTGTTCT CCGTCCCGGG AAGCCCGGGC CTGATCTGCG GGCGCACGGT CGATGCCGGG
GGCGCCGAAT CCGTCAGCGC GCATCTGCCG GTCCCGGCCG AGATCGCGCG CGACACCGTC
TTCCTGGCGG GCAATTCCAA CGGCCTGTTC GTGCGTCGGG GCCTCGCCAA GCGCGTCGGC
GGATTCGACG AGACGCTGGG AGTGGGCGCC GCGACGCCGT TCCAGTCCGG CGAGGAGACG
GACTTCATCC TGCGCGCCCT CGCTCTCGGC GCGTCCTGCC GCTTCGAGCC CGGCCTGGTC
GTCCGCCACG ACCAGCCCGA GGCGAATCCG GCCGCCGCGG CCGCGCGGGC CGCCCGCTAC
GCGCCGGGCT TCGGGCGGGT GCTGCGCCTG CACGGTTTCG GACCCGGCTA CGTCGGCAAC
CGCGTGCTGC GGGCCTTCGG GCGCGGGGCG CTCCTCCTCC TCGGCGGCCG CCGGGACGAC
GCGCGCCACC GCTTCGCCTG GGCGCTCGGC ACCCTGCGGG GCTACGCCGC CCCGGCCCGC
GCCCGCGCGG CGGCCCCGCC GCGCGGGGCC GCCGCGCGGG AGCCGGGCGC GCAGCCCAGG
CCCTTCGGCC TCTCCTTCGC GCCGCTCGAC GACGGGCAAC TCGCCCGGCA GCTGGCCGGC
CCGCTGGTTC CGGCCGGCGC GGGCCCGCGG ATCGTCGCCA CCGCCAACCT CGACCACATC
GTCCAGCTCT CGCGCAACAC CGTCTTCCGG GAAGCCTATC GCCGCGCCTG GATCGTCACC
GCCGACGGGA TGCCGGTCTA CCTCTACGCG AGGCTGCGCG GGGCGAAGCT GCCCGGCCGG
CTCACCGGCG CGGACCTGTT CGCGCGGCTG ATGACGATGC TCTCCCCGGC CCGGCACCGC
TGCTTCTTCG TGGCCTCCTC GGAGGAGACC GCCGCGCGGA TCGAGGCCCT GCTCCTCGCC
CGCGGCTTCT CGCGCGAGCA GCTCGCCTTC CGGGTGCCGC CCTTCGGTTT CGAGACCGAC
GCGGCCTATT CGGACGCCCT CGCGGGGGCG ATCCGGGCCC ACCGCGCCAC CCACCTGTTC
CTCGGCCTCG GCTCGCCGAA ATGCGAGATC TGGAGCCACC GCTACCGCGG CGCCCTCGGC
GACTGCTACG TGCTCAACGT CGGCGCCGGC CTCGACTTCT ACAGCGGGAC CAAGCGGCGC
GCCCCGGTCG TCCTCCAGCG GACCGGCCTC GAATGGGCGT GGCGCGTGGC CCAGGAGCCG
CGTCGGCTGT TCCACCGCTA CTTCGTCGCC TCCTGGCGCT TCCTCTGGAT CGCCGCCGCC
GACTTCGCCC GGTCGGACCG CACCCTGCCC CCCTCACGCG CCATCGAGGT GGAACGCCAT
CGATGA
 
Protein sequence
MSLVVCTVGR LEPLERLLAS LRRQTRRPLE ILLVDQNPAG TLSALLTRFR DLPLVHLVDL 
ADARGLSRAR NLGLACARGS VVGFPDDDCW YDPEVVARVA DLFSVPGSPG LICGRTVDAG
GAESVSAHLP VPAEIARDTV FLAGNSNGLF VRRGLAKRVG GFDETLGVGA ATPFQSGEET
DFILRALALG ASCRFEPGLV VRHDQPEANP AAAAARAARY APGFGRVLRL HGFGPGYVGN
RVLRAFGRGA LLLLGGRRDD ARHRFAWALG TLRGYAAPAR ARAAAPPRGA AAREPGAQPR
PFGLSFAPLD DGQLARQLAG PLVPAGAGPR IVATANLDHI VQLSRNTVFR EAYRRAWIVT
ADGMPVYLYA RLRGAKLPGR LTGADLFARL MTMLSPARHR CFFVASSEET AARIEALLLA
RGFSREQLAF RVPPFGFETD AAYSDALAGA IRAHRATHLF LGLGSPKCEI WSHRYRGALG
DCYVLNVGAG LDFYSGTKRR APVVLQRTGL EWAWRVAQEP RRLFHRYFVA SWRFLWIAAA
DFARSDRTLP PSRAIEVERH R