Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4589 |
Symbol | |
ID | 5834888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 5128075 |
End bp | 5130198 |
Gene Length | 2124 bp |
Protein Length | 707 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641370383 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_001642028 |
Protein GI | 163853985 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.260554 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATGA TCGAGCGGAT GCCTTCGCGG TTCTTCGTCG GCGCCGAGCC GGGCAAGCCG GACGTGACGC CTGAACCCTG GTTCCTCGAC CCGCGTGAGA TCGGACGGGC CCTGCGCGCG CGCTGGGCGC TCGTGCTGGC CCCGGCTGTG CTCCTCTTGG TGGCGGCCGT GGCGTGGCTC GCGCTGGTGC CGCCGCTCTA CGCCGCCGTG ACGCAGATCC TGATCGACCC GCGCGGCATC CAAGTGGTCA AGGACGGCGT GACGCCCTCG GACCAGGCGA GCGATGCGAG CCTGTTCCTC GTCGACAGCC AGATCCGCGT CCTCATCTCC GACGAGGTGC TGCGGCAGGT CGTGACTCGG TTCAAGCTCG ACCAGGACCC GGACTTCGTT CGTCCCGCCT CGCCGCTCGA GACGCTCAAG AGCCGCCTCT CCTCGCTGAT CGTCACCGCC GGCGGCCCTG CCGACGACAC GCTCACCGCC CTGCGCACGC TGCGCGATCG CACGACCGCG CGCCGCCTGG AGCGCAGCTT CGTGGTCGAA CTCGCCGTCT CCAGCGAGGA ACGCCGGAAA TCCGCCGAGC TCGCCCAGGC CATCGCCGAA ACCTACCTCA CCACCGTCTC GCAGGCGCAG GCGCAGGTCA CCCGCAAGGC CGGCGAGGCG GTGTCGAGCC GGCTCGGCGA GTTGCAGGAC GACCTCCGGC AGGCCGAGGA CAAGGCGCAG AAGTTCCGCG CCGCCAACAA CCTCGTCGGC ACCCGCGGCC AGCTCGTCAG CGAGCAGGCG CTGACCCAGC TCAACCAGCA GCTCGGCGCG GCGCGTGCCC GGGCCGGCGA GCTGCGCGGG CGGCTCGCCC AGATCGAGGC GGTCGCCAAC GGGCGGGCCG ACCTCAATTC GGTGACCGAG ATCGTCCAGT CCACGACGGT CGCGCAATTG CGCGCCCAGC TCGCCCAGAT CGAGGCGGCC CGGGCCGACA CCCTGTCCAA CCTCGGCCCC CGTCACCCCA CCCTGCGCAC CGGCGAGCTG CAGGTGCAGA CCCTGCGCAA CGACATCAAC GCCGAAATCC GCCGCATCGC CGCGGCCACC CGCAACGACT ACCGGTCGGC CTTGTCCAAC GAGGCCTCAC TCGCCGCCAC CCTGGAGAGC CGCAAGAAGG AGGCTCTGTC CGTCGACAAG AGCTTCGTGC GCCTGCGCGA ACTGGAGCGG CAGGTCGAAG CGAGCCGCGC GGTCTACGAG GCCTTCCTCG TCCGCGCCCG CGAGCTTCAG GAGCAGCAGC GCCTCGACAC CTCGACCTCG CGCGTCATCT CGCCCGCCTC GCTGCCGGAG CGCCGGCTCG GCCCGCCGAT CCCGGCCATC TTCGCCGCGG CGCTGGCGGC CGGGCTCGGC TTCGGCACCG CGCTCGCCCT CCTCGCCGTG CCGGCCGCGG GGCGGATCGG TTCGCGCCGC CGGTTTCAGC AGCTCGCGGG GCTCCCCGTG GTCGCCGCCC TGCCGGCCAA GGTGCCGACC CGGACGCGGA GCAAGGCCGG CAGCGAATCC CTGCGCGCCG ACACCGCCTA CGACGTGGCC GTGGCCCGTC TCGGCAGCCG TCTGCAGCGC GATTTCGGCG CCACGCGGCC GACGGTGGTC CTCGTCACCT CGGCGGACGA CCGGAGCGGC AAGTCGGAGC TGGCGCGCAG CCTCGCCGCC TCGGCTGCGC TCGACGGCCA GCGGGTGCTG CTCGTCGATG CCGACCCGGA GGCGATGATC TCGGGCGATC TCCGGAGCCA GGCCAAGCGC GGCGCCGCCG ACGTGCTGCG GACGCATTCG GGGCTCGGCG ACGCGTTGGT CGAGGGGCCG ACCGGGGTCA AGATCCTGCC CTACGACGAC GCGGCCCTGC GCCTCGGCAC CGCGGCCTAT ACCAGTGCGA TCCTGACGGC GGCTTCTGCC TTCGACACGG TGTTCGTCGA TATCGGGCTG ATCGGCACCG ACATCGCCGC CGAGCGTCTC GCCCAGGACC AGCGCTTCCC GGCGCTGCTT CTGACGGCCA GCGCCGCCCG CAGCGGCACC GCCCGGCTGC GGCGGGCGCT CGACGCCCTC GGCCGCGACC CGCGGGTGCA GCTCGTCATG ACCGACGCCG AGGCCGAGGG GTGA
|
Protein sequence | MTMIERMPSR FFVGAEPGKP DVTPEPWFLD PREIGRALRA RWALVLAPAV LLLVAAVAWL ALVPPLYAAV TQILIDPRGI QVVKDGVTPS DQASDASLFL VDSQIRVLIS DEVLRQVVTR FKLDQDPDFV RPASPLETLK SRLSSLIVTA GGPADDTLTA LRTLRDRTTA RRLERSFVVE LAVSSEERRK SAELAQAIAE TYLTTVSQAQ AQVTRKAGEA VSSRLGELQD DLRQAEDKAQ KFRAANNLVG TRGQLVSEQA LTQLNQQLGA ARARAGELRG RLAQIEAVAN GRADLNSVTE IVQSTTVAQL RAQLAQIEAA RADTLSNLGP RHPTLRTGEL QVQTLRNDIN AEIRRIAAAT RNDYRSALSN EASLAATLES RKKEALSVDK SFVRLRELER QVEASRAVYE AFLVRARELQ EQQRLDTSTS RVISPASLPE RRLGPPIPAI FAAALAAGLG FGTALALLAV PAAGRIGSRR RFQQLAGLPV VAALPAKVPT RTRSKAGSES LRADTAYDVA VARLGSRLQR DFGATRPTVV LVTSADDRSG KSELARSLAA SAALDGQRVL LVDADPEAMI SGDLRSQAKR GAADVLRTHS GLGDALVEGP TGVKILPYDD AALRLGTAAY TSAILTAASA FDTVFVDIGL IGTDIAAERL AQDQRFPALL LTASAARSGT ARLRRALDAL GRDPRVQLVM TDAEAEG
|
| |