Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2537 |
Symbol | |
ID | 5833221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 2847943 |
End bp | 2849892 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641368338 |
Product | capsule polysaccharide biosynthesis protein |
Protein accession | YP_001640002 |
Protein GI | 163851959 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3563] Capsule polysaccharide export protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACC TGCGCCCCAT CGAGCCGCGA GGCGATACCG GGCTGCCCCG CACCTACCTG CCGCTCGGCC GGACGGTCCG GCTTCTGCGC GCGCAGATCG AGGCGGCAAC GGGCGCGTCG CTGACCTTCC GCTCCCCCGG CCCGCCGGCC CTGTGGTCCG CGACGGCAGC CGACGGCAAG TCGGTTCTGC GGGTCGCGCC GGGCCCGCTC CTCAGCCCCT ACGATTCGCC GGAGACCGGC CTTTCGAGCC TTAGCCTCAG CTATGCCGGC CGGCCCCTCG GAGGCGAGCC CCCTTCCCCG GATTTCGGCG ACCTTCTGCG CGCACGCGGC CTCGCCGGCT ACAACCGCTT CGCCCGCTCG CTTCCGGATG GAGCGGCCGA ACTCTTCTCA GGACAGACGC GGCCTGCCCT CGCCCTGATC GATCCGCAAA CCGCCGCGTC CGGCGCGGTC CTGCGAAGCT TCCTGACGCG GCTTCTCGCC GAGAATCCGG GGACGCGCTT CGTCGCCGCC GGCCCCGACG GATGCCCGCG TGATGGCGCG CCGACCGATC CGCGGCTCAC CCTGATCGCG GGCCCGGCCG ATCCGTGGCT GCTGTTTCCC CTCGCGGCGC GCATCCATGT GGCGCGATGG GACGCGGCCT GCGAGGCGGC GCTGGCCAGC TTCGAGACCT ATTGCCCCGA CCCTGCAGCG GGTCCGCGCC CGGCGGATGC GGCGGCGTTC CTGGCGCTCC GTTACGGCAT CGGCGTGCAC GGCTTCGATC CGTGGACGCG CCGGCCGATC CCGCTCGCGG ATGCGGTGGA GCGCGTGGCG TGGCTGCGCG ACCGCTTCCT CGGCAACGAC CGCCGCGTCG TCCTCGTCGG CGTGTCCGGC TGGAAGCGCG CCGCGCTCGA TGTCTTCGCC ACCGGGCCGG CGGGACCGCC ACTCCACACC ATGATCGCGG ACGAGGCGGT GGCGCTCGCT CGCGCGCAGG GCGGCCGGGT TCAGGCCTGG GCGACCCGCT GCCCGGAGCA GCTCCCGCGT CTCTGCGCGG AGGCCGGCTT GCCCTTCGCG CGGATCGAGG ACGGATTCCT GCGCTCGGTC GGGCTCGGCG CTAGCCTGGA GCCGGGCGCC TCCATCGTCG TGGACGATCT CGGCATCTAC TACGACCCGC GGGTGGAGAG CCGGCTGGCC CGCACGCTGA AGGAAGCCGA GTTCCCGCCC GGACTGACCG CCCGCGCCGC CGCCTTGCGC GAATCGATCG TGGCGCGGCG CCTGAGCAAG TACAATGTCG GCCTCGAAGG CGTCGGCGAG GACTGGCCGA CGGACCGGCG GATCGTGCTG GTGCCGGGCC AAGTCGAGGA CGACGCCTCG GTGCTGACCG GATCGCCGCA GGTGCGCGGC AACCTCGCCC TGCTGCGGGC CGCCCGCGCC CGCAACCCGG ACGCCTTCCT GCTCTACAAG CCGCATCCCG ACGTCGAGGC CGGGTTCCGT CCCGGTGCGA TCCCGGAGGA GGAGGTGCGG CGGCTCGCCG ACCGTGTCGT CGGCGGCCTC TCCATCGTCG ACCTGCTCGA CCGTTGCCAC CATGTCGAGA CCATGACCTC GCTGGCAGGC TTCGAGGCGC TGATCCGGGG GCTGAGCGTC GCCGTCCACG GCCGCCCCTT TTATGCCGGC TGGGGGCTGA CCGAGGATCT GGCACCGGGG GCCGACCGCG GTCGCACGCT GTCCCTCGAC GCCCTGGTGG CGGGTGCGCT GATCCTCTAC CCGCTCTATC TCGATCCGGT GGCGATGAAG CCGTGCACGC CCGAGCAACT GCTCGACCGG CTCAGCGAGG CCCGTGCGGC CGCGCCGCCC TCGCGTCTCG CCCTCGGCGC GGTCCGTCAC GCGGCGATGC GGCTGCGCTA CGCCCTGATC AATCCGGTCA TCCGCCGCCT ACGCGCTCGC CGCGGCGTGC GCAGTGAATC CGGCCGCTGA
|
Protein sequence | MTDLRPIEPR GDTGLPRTYL PLGRTVRLLR AQIEAATGAS LTFRSPGPPA LWSATAADGK SVLRVAPGPL LSPYDSPETG LSSLSLSYAG RPLGGEPPSP DFGDLLRARG LAGYNRFARS LPDGAAELFS GQTRPALALI DPQTAASGAV LRSFLTRLLA ENPGTRFVAA GPDGCPRDGA PTDPRLTLIA GPADPWLLFP LAARIHVARW DAACEAALAS FETYCPDPAA GPRPADAAAF LALRYGIGVH GFDPWTRRPI PLADAVERVA WLRDRFLGND RRVVLVGVSG WKRAALDVFA TGPAGPPLHT MIADEAVALA RAQGGRVQAW ATRCPEQLPR LCAEAGLPFA RIEDGFLRSV GLGASLEPGA SIVVDDLGIY YDPRVESRLA RTLKEAEFPP GLTARAAALR ESIVARRLSK YNVGLEGVGE DWPTDRRIVL VPGQVEDDAS VLTGSPQVRG NLALLRAARA RNPDAFLLYK PHPDVEAGFR PGAIPEEEVR RLADRVVGGL SIVDLLDRCH HVETMTSLAG FEALIRGLSV AVHGRPFYAG WGLTEDLAPG ADRGRTLSLD ALVAGALILY PLYLDPVAMK PCTPEQLLDR LSEARAAAPP SRLALGAVRH AAMRLRYALI NPVIRRLRAR RGVRSESGR
|
| |