Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_4578 |
Symbol | |
ID | 7117973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 4851240 |
End bp | 4852715 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643527276 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_002423281 |
Protein GI | 218532465 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.290602 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0783 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACCT ACACAGCCAC GCGCGGAGCC ATTCCGCTCT CCTCCTACGA TCGGTCTGTC GCGTCCGCCG GGCAGTCCCC TTCCATCGCG CCGGCGCAGC TCCTGGCGAT GCTCAGGCAC CGCCGTCGGC TCATCGCCCT TTCGATGATC CTCGGCACCC TGGGCGGCGT GCTCGTCGTC GCGCGGACGG AACCGAACTA CGTCGCCACC GCCCAGGTCA TCATCGATCC GCGCGCCTTG CGCGTGGTCG AGCGGGAGGT GACGCCGAGC ATCGATAACG CCGATGCGCA GATCGCATCG GTCGAGAACG AGATGCGCGT CCTGCGGTCG AGCACGGTGC TCAACGCCCT GATCACCCGC GACCGGCTCG ACGAGGATCC CGAATTCGCC GGCGAGCCGC CCGGCCTTCT GCGCTGGGCG AAGGCCGCGA TCCTCGACCG CGTCGGCCTG TCGCCGCCCC GCTCGCCCGA TCCGCGCCGC GCGGCGCTCC AGACCCTGGA GCGCAAGATC GCCGTCCGCC GGGCCGAGCG CAGCTTCGTC GTCGAGGTGC ATGTCGCAAC CCGTGATCCC GACAAGTCGG CCCGGATCGC GAATGAACTC GTGACGCTCT ACACCGAGCA GGCGAACCGG ACCCGGTCCG AGCTCGCGCG CCGCTCCGGC GCCTCCCTCG ACGATCGCCT CGCCGAATTG CGCTCCGCCG TGCGCGCCGC GGAGAACCGG GTCGCGACCT TCCGGAGCGA GCACGACCTC GTCAGCGCCG ACGGCTCCCT CACCAGGGAC CGGCGCCTGC GCGACCTCAA CACGCAGCTC GCCGCCGCCC GGACCCGGTC CACCGAGGCC CTGGTCCGGA TGGAGCAGGC GAGCGCCCTG CGCGGCCGGC TCGACGGCCT GTCGGAGGCC GTCCAGTCTC AGGCCATGAT CCAGTTGCGC TACCAGATCA CCGAGGCCCG TCGCCGCCGG GCCAACCTCG CCAACGTGCT GGGACCGCGC CATCCCGAGC TGAACTCGGT CGGCCGCGAG ATCGATGCCC TTCAGGATCA GCTCGGCCAG GAGCTGCAGC GGATCGGCGA CGCGGCACGG AACGATTATC GCCGAGCCAA GCAGACCGAG GAGGAGCTGC GTAGGACGGT GGAGACGATG TCCACGGCCT CGCTCGCCGA CGACCGCGCG CTCGCCGAGC TGCGCGCGCG GGAGGCGGAG GCAGAGGCCC AGCGCAAGCT GTTCGCCCAG TATCTCGTGC GCTCCCGCGA ACTCATCGAG CAGACGCAGG TGGACGTGAA CAACATCCGC GTGATCGGCG CGGCGATCCC GCCCGAGCTG CCGACGAACC TGAGCAAGGC CATCGTCCTC GTGCTCGGAA CGCTGATCGG CCTAGCGCTC GGCATCCTGG CGGCGATCAC CCTCGGCGTC CTGCGCGGCG AGGCGGCGTC CCTGCCGCGC TACCCGGTCG GGTCGGGCGA AGCGGTGATC GCCTGA
|
Protein sequence | MTTYTATRGA IPLSSYDRSV ASAGQSPSIA PAQLLAMLRH RRRLIALSMI LGTLGGVLVV ARTEPNYVAT AQVIIDPRAL RVVEREVTPS IDNADAQIAS VENEMRVLRS STVLNALITR DRLDEDPEFA GEPPGLLRWA KAAILDRVGL SPPRSPDPRR AALQTLERKI AVRRAERSFV VEVHVATRDP DKSARIANEL VTLYTEQANR TRSELARRSG ASLDDRLAEL RSAVRAAENR VATFRSEHDL VSADGSLTRD RRLRDLNTQL AAARTRSTEA LVRMEQASAL RGRLDGLSEA VQSQAMIQLR YQITEARRRR ANLANVLGPR HPELNSVGRE IDALQDQLGQ ELQRIGDAAR NDYRRAKQTE EELRRTVETM STASLADDRA LAELRAREAE AEAQRKLFAQ YLVRSRELIE QTQVDVNNIR VIGAAIPPEL PTNLSKAIVL VLGTLIGLAL GILAAITLGV LRGEAASLPR YPVGSGEAVI A
|
| |