Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4129 |
Symbol | |
ID | 5833620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4595458 |
End bp | 4597662 |
Gene Length | 2205 bp |
Protein Length | 734 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641369919 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_001641569 |
Protein GI | 163853526 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0489] ATPases involved in chromosome partitioning [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.62537 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCCGT TCCACGAAGC CGACCCGCCC GGCGGCGGGC GATCGCTGAC CTCCCATTAC GAGGCGGTCA CCGGACACGC CCTGGCGGTG CTCTGGCGGC GCCGGATCAT GATCGCCTCG ATCGCCGTGC TGTCGGCGCT GGCAGCGGGG GCGGCCACGC TCGCCCTGCC GCCGAGCTAC ACCGCCGAGG TGCTGCTCCA GTTCGATTTC GGCCAGACCC AGGATGTGCG CGGGACGGGC AAGTCCGGCC CGCAGATCGC CCTCGAACCG GCTTCGGTCG TCGAGAGCGA GGCGCGCATC ATCCGCTCGT TTGCCATCGC GCAATCCGTG GCGGGCCGGC TTGAGGACGC CGGGCAAGCC AAGCAGGCGG CCCAGGCGGG GCCTGCCGCC AAGGAGGCCT CCGCGGGATG GGTCGGCTCC CTGAGGCAGG CCCTCGAGAA CCGGGTTCAG GCCGCGCTGC CCGCCCTCCC CGGCACCCGT CCGGAAGAAA CCGCGCAGCA GGAGACCGAG CGGGACGCCC AGGCCCAGCG GCTCCTGCGC GGGCTTACCG TCGGCAACGA CGCCAAGGCC TACCTGATCA CCATCGGCTA CCGCGACCGC GACCCGAAGC GGGCGGCGGA GACCGCCAAC ACCTTCGCCA CCGAGTATCT CGCCCGCAAG ATGCAGGCGG CCTCGCTCGC CACCGAGCGG GCGAGCCGCT GGTACGCCGA GCAGATCCAG GTCTCGCGCG GCGAACTCGC CCGGCTGGAC AGCGAGATCG ACGCCTTCCG CAAGCGCACG GGCTTCGTCG AGTCGGTGCG CGAGGGCGTG GACCTGCGCG AGCAGCAGTT GCGCGACGCC CTCACCGAAC TCTCCAGCGC CTCGGCCAAG CGGCTGGCGG AGGAGGCCCG GCTGCAGCGC GCCGGGGACG TGATGCGCTT CGGCGGCGTG CCCAACGCCT CCGACACGAC GATCTCGCCG GTGATCCAGC AGATGCTGGC CGATGACAGC AAGCTGCGCC AGGAACTGGA GCAGCTCACC GCCGTCTCCG GCGACAACCA CCCCAGCGTG CTGCGGCTCA AGGCCTCGAT CGCCGCGCTC CAGCAGCGGC TTCAGCTCCG CATGGCCGAG GTGCTGAAAC TGGTCGAGGT CGATCTCGGC GCCGCCCGCG GGCTCGAAGC CTCGGCCGAG GCGCGGGTCG CGACCGTGCG CCGCTCGCTC ATCGAGAGCA AGGCGCTCGA ATCGAAGCTC CGCGCACTCC AGGCCGACGC GACGGCCGCC CGCGACCGCC TGCGCGTCCT CGACGAGGGC TACCAGACCG CCAACGCGCT GAGCGAACTC AAGCCGGTGA TGGCGCAGAT GCTGGCGCCG GCCAAGGTGC CGACCCTGCC CACCGGGCCC GGCCTCGGCC TGGTGCTCGG CCTCGGCCTG TTCGCCGGGG CCGGCGCCGG CTCGGTGCTG GCGCTCGGGC TCGACCGCCG CGACCGGGGC TACCGCAGCC AGGGCGAGAT GGAGGCCGAG ACCGGCCTGC CCTGCCTCGC GCTGCTGCCG AACCGGCCGG CGGGTGCGCA ATCGGGCGAG CGCCTGGAAC AGGATCTCCT CTTCCGCGAG GCGGTGCGGG CGGCCGTGGC CGAACTCGTC GCGGTGCGCG AGCCGCTCAA GGTCGTCCTC GTCACCTCGG CGATGCCCGG CGAGGGCAAG TCCGTCGTCG CCCGCTCGGT GGCACGCTCG CTCGCCGCGA TGGGGCGCCG GGTGCTGATC CTCGACGGCT CGCCCCGCCG CGCGGCGATC GAGGACGGTC GCCACGGCAA TCTCGGACAC GAGACGCCCA CCGACGACGA GGAGGCGGAG GGCCGCATCT CGACCATCCG CCGCGTCTCC GGCCTCAAGG ACGGGCACGA CATCTACGCC GAGCCGACCT TCGGCATGCT CGTGCGGGAG GCGCGCGAGC GCTACGACAT CATCCTCGTA GAGGTGCCCC CGGTGCTGTT GCTGGGCGAC GTCGCGCTCC TGCGCCAGCA CGCCGATGCC GTGGTCCATG TCGTGGCGTG GCACGGCACG CAGAAGGCCG CCGTCACCGC GAGCCTCGCC TATATGCGCC GGCTCGGGCT GACGGTCGCC GGGCTCGTCC TGAACAAGGT CGATCTGAAG CGCCACGAGG GCCGCGCCGC GGATCGCGGA ACGCTCTACC GCGACTTCTC GCACTACTAC CGGAACAGCG CGTGA
|
Protein sequence | MDPFHEADPP GGGRSLTSHY EAVTGHALAV LWRRRIMIAS IAVLSALAAG AATLALPPSY TAEVLLQFDF GQTQDVRGTG KSGPQIALEP ASVVESEARI IRSFAIAQSV AGRLEDAGQA KQAAQAGPAA KEASAGWVGS LRQALENRVQ AALPALPGTR PEETAQQETE RDAQAQRLLR GLTVGNDAKA YLITIGYRDR DPKRAAETAN TFATEYLARK MQAASLATER ASRWYAEQIQ VSRGELARLD SEIDAFRKRT GFVESVREGV DLREQQLRDA LTELSSASAK RLAEEARLQR AGDVMRFGGV PNASDTTISP VIQQMLADDS KLRQELEQLT AVSGDNHPSV LRLKASIAAL QQRLQLRMAE VLKLVEVDLG AARGLEASAE ARVATVRRSL IESKALESKL RALQADATAA RDRLRVLDEG YQTANALSEL KPVMAQMLAP AKVPTLPTGP GLGLVLGLGL FAGAGAGSVL ALGLDRRDRG YRSQGEMEAE TGLPCLALLP NRPAGAQSGE RLEQDLLFRE AVRAAVAELV AVREPLKVVL VTSAMPGEGK SVVARSVARS LAAMGRRVLI LDGSPRRAAI EDGRHGNLGH ETPTDDEEAE GRISTIRRVS GLKDGHDIYA EPTFGMLVRE ARERYDIILV EVPPVLLLGD VALLRQHADA VVHVVAWHGT QKAAVTASLA YMRRLGLTVA GLVLNKVDLK RHEGRAADRG TLYRDFSHYY RNSA
|
| |