Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1856 |
Symbol | |
ID | 5831624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 2075146 |
End bp | 2076756 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641367655 |
Product | exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
Protein accession | YP_001639326 |
Protein GI | 163851283 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.508023 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.726353 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCACA CTCGACGACC GGGGAGGCTC GATGTTTTCA CGTGGAACAT TCCGATGAGC GCCATCGACG TTCGGGATCT GCTGAAGGTC GCGAGGGAAA GCGGCGCACT CACATCCGTG CCGACGCCGC TCGTGCTCCA CAGCGACGAG AATTCTGATT CCGCGCCGTC CGCGTCGAAC CCCGCGCCGA CCCCACCGAA AGGGGCTTGG CTGTCGCCGG TCGTGCTCGC CGGTTGCGTG CGGCTCGCCG AATTCTGCGG CTTGATCCTG CTCGGTTTGG CTCTGCATCA GGCCTTGCTG CGCGGCGTCG TGCCCCTCGC CCCGCGCTAC CACGCCGCCA TCTTGGCGGT GACCCTTGCG GCGCTCGGAC TGTTTCAGGC TTCGGGCAGT TACCGGATCA GCGCATTCCG CGATCTGCCG AGAACGGCGG TGAAGCTCGC CACCGGCTGG TCCATCGCGT TCCTGATGGT GGCCGCGGCC ATGGTGCTCG CCAAGGTCGC CGATCATTAC TCCCGGATCT GGCTGCTCAG CTACTACATG GCCGGCCTTG GCATCCTGCT CGGCGGCCGC GCGGCGCTCT CGGCCTTCGT CCGCCTGCAG ATGGCCAAGG GGCGCTTCGA CCGTCGCACC GCGATCGTCG GCGGCGGACC GGCGGCCGTG GAACTGATCC ATGCCCTGGA AGCGAGCGGC GACAACGGCA TCCGCATCAT CGGGATCTTC GATGACCGGG GCGACGACCG ATCCAGCACG GACGTCGCCG GCTACCCCAA GCTCGGCAAT GTCAGCGACC TCGTCACCTA TGCCCGCCAC GCGCCCGTCG ATCTCGTGGT GTTCACCCTG CCGATCTCGG CCGAGACGCG CATCCTGCAG ATGCTCGCCA AGCTCTCGGT TCTGCCGGTC GATATCCGCC TCTCAGCCCA TGCGACCAAG CTGCGCCTGC GCCCGCGCGC CTATTCCTAT CTCGGCGGCG TGCCGCTGCT CGACGTCTTC GACAAGCCGC TAGCCGATTG GGACGTCATC CTGAAGGGCG CGTTCGACCG CGTCGTCGGC CTGCTGCTGC TGCTGGCCCT CTCACCGGCG ATGATCGCTG TGGCGCTCGC GGTGAAGCTC ACTTCGCCGG GGCCGGTGCT GTTCCGGCAG AAGCGCTACG GCTTCAACAA CGAGCTCATC GAGATCTTCA AGTTCCGCTC GATGTACGTC GATCTCTGCG ACGCGGGCGC ATCGCAGCTC GTCACCAAGA CCGATGCCCG GGTGACGCCC GTGGGCCGCT TCATCCGCAA GACATCGCTG GACGAGCTAC CTCAGCTATT CAACGTGATC CGCGGCGATC TCTCGCTGGT CGGGCCGCGC CCGCATGCGG TCCAGGCCAA GGCGGCGAAC ACCCTCTACG ATCAGGTGGT GGACGGGTAC TTCGCCCGCC ACAAGGTCAA ACCCGGCATC ACCGGCTGGG CGCAGATCAA TGGCTGGCGC GGCGAGACCG ACACCAGCGA GAAGCTCCAG CGCCGGGTGG AGCACGACCT GCACTACATC GAGAATTGGT CGATCCTGTT CGACCTCAAG ATCCTGCTCA CCACGCCGCT CGCGCTCTTC AAGACCGACA ACGCGTATTG A
|
Protein sequence | MRHTRRPGRL DVFTWNIPMS AIDVRDLLKV ARESGALTSV PTPLVLHSDE NSDSAPSASN PAPTPPKGAW LSPVVLAGCV RLAEFCGLIL LGLALHQALL RGVVPLAPRY HAAILAVTLA ALGLFQASGS YRISAFRDLP RTAVKLATGW SIAFLMVAAA MVLAKVADHY SRIWLLSYYM AGLGILLGGR AALSAFVRLQ MAKGRFDRRT AIVGGGPAAV ELIHALEASG DNGIRIIGIF DDRGDDRSST DVAGYPKLGN VSDLVTYARH APVDLVVFTL PISAETRILQ MLAKLSVLPV DIRLSAHATK LRLRPRAYSY LGGVPLLDVF DKPLADWDVI LKGAFDRVVG LLLLLALSPA MIAVALAVKL TSPGPVLFRQ KRYGFNNELI EIFKFRSMYV DLCDAGASQL VTKTDARVTP VGRFIRKTSL DELPQLFNVI RGDLSLVGPR PHAVQAKAAN TLYDQVVDGY FARHKVKPGI TGWAQINGWR GETDTSEKLQ RRVEHDLHYI ENWSILFDLK ILLTTPLALF KTDNAY
|
| |