Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_1075 |
Symbol | |
ID | 6131530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 1197448 |
End bp | 1199133 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641641367 |
Product | WecB/TagA/CpsF family glycosyl transferase |
Protein accession | YP_001768039 |
Protein GI | 170739384 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases [COG1922] Teichoic acid biosynthesis proteins |
TIGRFAM ID | [TIGR00696] bacterial polymer biosynthesis proteins, WecB/TagA/CpsF family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.255325 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.693925 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGCTCG TCGTCTGCAC GGTGGGCCGG CTGGAACCCC TGGAGAGGCT GCTCGCCTCC CTGCGCCGCC AGACGCGCCG GCCCCTCGAG ATCCTCCTCG TCGACCAGAA CCCGGCCGGC ACGCTCAGCG CCCTGCTCAC CCGCTTCCGC GACCTGCCCC TCGTGCACCT CGTCGACCTC GCCGATGCGC GGGGCCTGTC GCGGGCCCGC AACCTCGGCC TGGCCTGCGC CCGCGGCAGC GTGGTGGGCT TTCCGGACGA CGATTGCTGG TACGACCCCG AGGTCGTCGC GCGGGTCGCC GACCTGTTCT CCGTCCCGGG AAGCCCGGGC CTGATCTGCG GGCGCACGGT CGATGCCGGG GGCGCCGAAT CCGTCAGCGC GCATCTGCCG GTCCCGGCCG AGATCGCGCG CGACACCGTC TTCCTGGCGG GCAATTCCAA CGGCCTGTTC GTGCGTCGGG GCCTCGCCAA GCGCGTCGGC GGATTCGACG AGACGCTGGG AGTGGGCGCC GCGACGCCGT TCCAGTCCGG CGAGGAGACG GACTTCATCC TGCGCGCCCT CGCTCTCGGC GCGTCCTGCC GCTTCGAGCC CGGCCTGGTC GTCCGCCACG ACCAGCCCGA GGCGAATCCG GCCGCCGCGG CCGCGCGGGC CGCCCGCTAC GCGCCGGGCT TCGGGCGGGT GCTGCGCCTG CACGGTTTCG GACCCGGCTA CGTCGGCAAC CGCGTGCTGC GGGCCTTCGG GCGCGGGGCG CTCCTCCTCC TCGGCGGCCG CCGGGACGAC GCGCGCCACC GCTTCGCCTG GGCGCTCGGC ACCCTGCGGG GCTACGCCGC CCCGGCCCGC GCCCGCGCGG CGGCCCCGCC GCGCGGGGCC GCCGCGCGGG AGCCGGGCGC GCAGCCCAGG CCCTTCGGCC TCTCCTTCGC GCCGCTCGAC GACGGGCAAC TCGCCCGGCA GCTGGCCGGC CCGCTGGTTC CGGCCGGCGC GGGCCCGCGG ATCGTCGCCA CCGCCAACCT CGACCACATC GTCCAGCTCT CGCGCAACAC CGTCTTCCGG GAAGCCTATC GCCGCGCCTG GATCGTCACC GCCGACGGGA TGCCGGTCTA CCTCTACGCG AGGCTGCGCG GGGCGAAGCT GCCCGGCCGG CTCACCGGCG CGGACCTGTT CGCGCGGCTG ATGACGATGC TCTCCCCGGC CCGGCACCGC TGCTTCTTCG TGGCCTCCTC GGAGGAGACC GCCGCGCGGA TCGAGGCCCT GCTCCTCGCC CGCGGCTTCT CGCGCGAGCA GCTCGCCTTC CGGGTGCCGC CCTTCGGTTT CGAGACCGAC GCGGCCTATT CGGACGCCCT CGCGGGGGCG ATCCGGGCCC ACCGCGCCAC CCACCTGTTC CTCGGCCTCG GCTCGCCGAA ATGCGAGATC TGGAGCCACC GCTACCGCGG CGCCCTCGGC GACTGCTACG TGCTCAACGT CGGCGCCGGC CTCGACTTCT ACAGCGGGAC CAAGCGGCGC GCCCCGGTCG TCCTCCAGCG GACCGGCCTC GAATGGGCGT GGCGCGTGGC CCAGGAGCCG CGTCGGCTGT TCCACCGCTA CTTCGTCGCC TCCTGGCGCT TCCTCTGGAT CGCCGCCGCC GACTTCGCCC GGTCGGACCG CACCCTGCCC CCCTCACGCG CCATCGAGGT GGAACGCCAT CGATGA
|
Protein sequence | MSLVVCTVGR LEPLERLLAS LRRQTRRPLE ILLVDQNPAG TLSALLTRFR DLPLVHLVDL ADARGLSRAR NLGLACARGS VVGFPDDDCW YDPEVVARVA DLFSVPGSPG LICGRTVDAG GAESVSAHLP VPAEIARDTV FLAGNSNGLF VRRGLAKRVG GFDETLGVGA ATPFQSGEET DFILRALALG ASCRFEPGLV VRHDQPEANP AAAAARAARY APGFGRVLRL HGFGPGYVGN RVLRAFGRGA LLLLGGRRDD ARHRFAWALG TLRGYAAPAR ARAAAPPRGA AAREPGAQPR PFGLSFAPLD DGQLARQLAG PLVPAGAGPR IVATANLDHI VQLSRNTVFR EAYRRAWIVT ADGMPVYLYA RLRGAKLPGR LTGADLFARL MTMLSPARHR CFFVASSEET AARIEALLLA RGFSREQLAF RVPPFGFETD AAYSDALAGA IRAHRATHLF LGLGSPKCEI WSHRYRGALG DCYVLNVGAG LDFYSGTKRR APVVLQRTGL EWAWRVAQEP RRLFHRYFVA SWRFLWIAAA DFARSDRTLP PSRAIEVERH R
|
| |