Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_6338 |
Symbol | |
ID | 6135055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 6961171 |
End bp | 6962358 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641646433 |
Product | hopanoid biosynthesis associated glycosyl transferase protein HpnI |
Protein accession | YP_001773037 |
Protein GI | 170744382 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | [TIGR03472] hopanoid biosynthesis associated glycosyl transferase protein HpnI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0080021 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACTCCA CCTGGATCTC CACCCTGCTC CTCACCCTGG CGGCGGCGGG CTGCGCCTAC GCCCTCGCGG CGGCGTGGCT CGCTGGCCGC GCGGCGGGGC GCCCGACGCC GACGCTCCCG GCCGGCGCGG CCCGGCCCTC GGTCACGCTG ATGAAGCCGC TCTGCGGGGA CGAGCCGAAC CTCTACGAGA ACCTGACCAG CTTCTGCCGC CAGGATTACG CCGGCCCGGT CCAGATCATC TTCGGCGTGC AGAGCGCCGC CGACCCGGCC CTGGCGATGG TCGCCCGGCT CAAGGCCGAG CACCCGGACC TGCGCATCGA CCTCGCGCTC GACGCCCGCC AGCACGGTTC GAACCGCAAG GTGTCGAACC TGATCAACAT GGCCGGGCTG ATCGCCCACG AGGTCGTGGT GCTGGCCGAC AGCGACATGG TGGTGCGCCC GGACTACCTG GAGCGCATCG TCGCGGAGCT CGGCCGGCCG GGCGTGGCGG CGGTGACCTG CCTGTATCAC GGCGTGCCGG CCGAGCGGAG CGTCTGGGCG CAGCTCTCGA CGCTGGCCAT CGACACGCAG TTCCTGCCGA ACGTGCTCGT CGGCACCGGC CTCTCCCTGG CCGAGCCCTG CTTCGGCTCC ACCATCGCGT TCCGGGCCGA GGCGCTCGCG GCGATCGGCG GCTTCGAGCG GGTGAAGGAC GACCTCGCCG ACGATTACGC GCTCGGCGCG GCCCTGCGCG GGGCCGGGGG CGGGATCGTC GCCATCCCGA ACTTCACCAT CGGGCATACC TGCGTCGACA CCTCGCTCTC GGACCTGTGG CGCCACGAGA CGCGCTGGAA CCGGACCATC CGCAACGTCG ACCCGGCCGG CTACGCGGGC AGCCTCGTCA CCCACGCCTT CCCGCTCGCC CTGATCGGCG CGCTGATGCC CAACACCAGC CCGCAGGGCC TCGCCATCGC CGCCCTGGCG CTCACCTGCC GCATCGTCCT GTGCCTGCGG CTGGAGCGGG CCTTCGGCCT CGACCCGCAT CCCTACTGGC TGCTGCCGAT CCGCGACCTG GTCTCGTTCG CCGGCTTCGT GTGGTGCTTC GCCTCCGGCG CCGTGACTTG GAAAGGTCAC GATTATCGCG TTGTGGCTGA CGGCACGCTC ATCCCCGAGC CGGGCCTCGC GCAGGAGACC GGCGCCCCCA CGACCTGA
|
Protein sequence | MDSTWISTLL LTLAAAGCAY ALAAAWLAGR AAGRPTPTLP AGAARPSVTL MKPLCGDEPN LYENLTSFCR QDYAGPVQII FGVQSAADPA LAMVARLKAE HPDLRIDLAL DARQHGSNRK VSNLINMAGL IAHEVVVLAD SDMVVRPDYL ERIVAELGRP GVAAVTCLYH GVPAERSVWA QLSTLAIDTQ FLPNVLVGTG LSLAEPCFGS TIAFRAEALA AIGGFERVKD DLADDYALGA ALRGAGGGIV AIPNFTIGHT CVDTSLSDLW RHETRWNRTI RNVDPAGYAG SLVTHAFPLA LIGALMPNTS PQGLAIAALA LTCRIVLCLR LERAFGLDPH PYWLLPIRDL VSFAGFVWCF ASGAVTWKGH DYRVVADGTL IPEPGLAQET GAPTT
|
| |