Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_5050 |
Symbol | |
ID | 7118924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | - |
Start bp | 5402049 |
End bp | 5404172 |
Gene Length | 2124 bp |
Protein Length | 707 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643527744 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_002423743 |
Protein GI | 218532927 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.376238 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.397595 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATGA TCGAGCGGAT GCCTTCGCGG TTCTTCGTCG GCGCCGAGCC GGGCAAGCCG GACGTGACGC CTGAACCCTG GTTCCTCGAC CCGCGTGAGA TCGGACGGGC CCTGCGCGCG CGCTGGGCGC TCGTGCTGGC CCCGGCTGTG CTCCTCTTGG TGGCGGCCGT GGCGTGGCTC GCGCTGGTGC CGCCGCTCTA CGCCGCCGTG ACGCAGATCC TGATCGACCC GCGCGGCATC CAAGTGGTCA AGGACGGCGT GACGCCCTCG GACCAGGCGA GCGACGCGAG CCTGTTCCTC GTCGATAGCC AGATCCGGGT CCTCATCTCC GACGAGGTGC TGCGGCAGGT CGTGACTCGG TTCAAGCTCG ATCAGGACCC GGACTTCGTT CGTCCCGCCT CGCCGCTCGA GACGCTCAAG AGCCGCCTCT CCTCGCTGAT CGTCACCGCC GGCGGCCCTG CCGACGACAC GCTCACCGCC CTGCGCACGC TGCGCGAGCG CACCACCGCG CGCCGCCTGG AGCGCAGCTT CGTGGTCGAA CTCGCCGTCT CCAGCGAGGA ACGCCGGAAA TCCGCCGAGC TCGCCCAGGC CATCGCCGAA ACCTACCTCA CCACCGTCTC GCAGGCGCAG GCGCAGGTCA CCCGCAAGGC CGGCGAGGCG GTGTCGAGCC GGCTCGGCGA GTTGCAGGAC GACCTCCGGC AGGCCGAGGA CAAGGCGCAG AAGTTCCGCG CCGCCAACAA CCTCGTCGGC ACCCGCGGCC AGCTCGTCAG CGAGCAGGCG CTGACCCAGC TCAACCAGCA GCTCGGCGCG GCGCGTGCCC GGGCCGGCGA GCTGCGCGGG CGGCTCGCCC AAATCGAGGC GGTCGCCAAC GGGCGGGCCG ACCTCAACTC GGTGACCGAA ATCGTCCAGT CCACGACGGT CGCGCAATTG CGCGCCCAGC TCGCCCAGAT CGAGGCAGCC AGGGCCGACA CCCTGTCCAA CCTCGGGCCC CGTCACCCCA CCCTGCGCAC CGGCGAGTTG CAGGTGCAGA CCCTGCGCAA CGACATCAAT GCCGAGATCC GCCGCATCGC CGCGGCCACC CGCAACGATT ACCGGTCGGC ATTGTCCAAC GAGGCCTCGC TCGCCGCCAC CCTGGAGAGC CGCAAGAAGG AGGCTCTGTC CGTCGACAAG AGCTTCGTGC GCCTGCGCGA ACTGGAGCGG CAGGTCGAAG CGAGCCGTGC GGTCTACGAG GCCTTCCTCG TCCGCGCCCG CGAGCTTCAG GAGCAGCAGC GCCTCGACAC CTCGACCTCA CGCGTCATCT CGCCCGCCTC ACTGCCGGAG CGCCGGCTCG GCCCGCCGAT CCCGGCCATC TTCGCCGCGG CGCTGGCGGC CGGGCTCGGC CTCGGCACCG CGCTCGCCCT CCTCGCCGTG CCGGCCGCGG GGCGGATCGG TTCGCGCCGC CGGTTTCAGC AGCTCGCGGG GCTCCCCGTG GTCGCCGCCC TGCCGGCCAA GGTGCCGACC AGGACGCGGA GCAAGGCAGG CAGCGAATCC CTGCGCGCCG ACACCGCCTA CGACGTGGCC GTGGCCCGTC TCGGCAGCCG TCTGCAGCGC GATTTCGGGG CCACGCGGCC GACGGTGGTC CTCGTCACCT CGGCGGACGA CCGGAGCGGC AAGTCGGAGC TGGCGCGCAG CCTCGCCGCC TCGGCCGCGC TCGACGGCCA GCGGGTGCTG CTCGTCGATG CCGACCCGGA GGCGATGATC TCGCGCGATC TCCGGAGCCA GGCCAAGCGC GGCGCCGCCG AGGTGCTGCG GACGCATTCG GGGCTCGGTG ACGCGCTGGT CGAGGGGCCG ACCGGGGTCA AGATCCTGCC CTTCGACGAC GCGGCCCTGC GCCTCGGCAC CGCGGCCTAT ACCGGGGCGA TCCTGACGGC GGCTTCTGCC TTCGACACGG TCTTCGTCGA TATCGGGCTG ATCGGCACCG ACATCGCCGC CGAGCGCCTC GCCCAGGACC AGCGCTTCCC GGCCCTGCTG CTGACGGCCA GCGCCGCCCG CAGCGGCACC GCCCGGCTGC GGCGGGCGCT CGACGCCCTC GGCCGCGACC CGCGGGTGCA GCTCGTCATG ACCGACGCCG AGGCCGAGGG GTGA
|
Protein sequence | MTMIERMPSR FFVGAEPGKP DVTPEPWFLD PREIGRALRA RWALVLAPAV LLLVAAVAWL ALVPPLYAAV TQILIDPRGI QVVKDGVTPS DQASDASLFL VDSQIRVLIS DEVLRQVVTR FKLDQDPDFV RPASPLETLK SRLSSLIVTA GGPADDTLTA LRTLRERTTA RRLERSFVVE LAVSSEERRK SAELAQAIAE TYLTTVSQAQ AQVTRKAGEA VSSRLGELQD DLRQAEDKAQ KFRAANNLVG TRGQLVSEQA LTQLNQQLGA ARARAGELRG RLAQIEAVAN GRADLNSVTE IVQSTTVAQL RAQLAQIEAA RADTLSNLGP RHPTLRTGEL QVQTLRNDIN AEIRRIAAAT RNDYRSALSN EASLAATLES RKKEALSVDK SFVRLRELER QVEASRAVYE AFLVRARELQ EQQRLDTSTS RVISPASLPE RRLGPPIPAI FAAALAAGLG LGTALALLAV PAAGRIGSRR RFQQLAGLPV VAALPAKVPT RTRSKAGSES LRADTAYDVA VARLGSRLQR DFGATRPTVV LVTSADDRSG KSELARSLAA SAALDGQRVL LVDADPEAMI SRDLRSQAKR GAAEVLRTHS GLGDALVEGP TGVKILPFDD AALRLGTAAY TGAILTAASA FDTVFVDIGL IGTDIAAERL AQDQRFPALL LTASAARSGT ARLRRALDAL GRDPRVQLVM TDAEAEG
|
| |