Gene M446_6338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_6338 
Symbol 
ID6135055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6961171 
End bp6962358 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content72% 
IMG OID641646433 
Producthopanoid biosynthesis associated glycosyl transferase protein HpnI 
Protein accessionYP_001773037 
Protein GI170744382 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03472] hopanoid biosynthesis associated glycosyl transferase protein HpnI 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0080021 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACTCCA CCTGGATCTC CACCCTGCTC CTCACCCTGG CGGCGGCGGG CTGCGCCTAC 
GCCCTCGCGG CGGCGTGGCT CGCTGGCCGC GCGGCGGGGC GCCCGACGCC GACGCTCCCG
GCCGGCGCGG CCCGGCCCTC GGTCACGCTG ATGAAGCCGC TCTGCGGGGA CGAGCCGAAC
CTCTACGAGA ACCTGACCAG CTTCTGCCGC CAGGATTACG CCGGCCCGGT CCAGATCATC
TTCGGCGTGC AGAGCGCCGC CGACCCGGCC CTGGCGATGG TCGCCCGGCT CAAGGCCGAG
CACCCGGACC TGCGCATCGA CCTCGCGCTC GACGCCCGCC AGCACGGTTC GAACCGCAAG
GTGTCGAACC TGATCAACAT GGCCGGGCTG ATCGCCCACG AGGTCGTGGT GCTGGCCGAC
AGCGACATGG TGGTGCGCCC GGACTACCTG GAGCGCATCG TCGCGGAGCT CGGCCGGCCG
GGCGTGGCGG CGGTGACCTG CCTGTATCAC GGCGTGCCGG CCGAGCGGAG CGTCTGGGCG
CAGCTCTCGA CGCTGGCCAT CGACACGCAG TTCCTGCCGA ACGTGCTCGT CGGCACCGGC
CTCTCCCTGG CCGAGCCCTG CTTCGGCTCC ACCATCGCGT TCCGGGCCGA GGCGCTCGCG
GCGATCGGCG GCTTCGAGCG GGTGAAGGAC GACCTCGCCG ACGATTACGC GCTCGGCGCG
GCCCTGCGCG GGGCCGGGGG CGGGATCGTC GCCATCCCGA ACTTCACCAT CGGGCATACC
TGCGTCGACA CCTCGCTCTC GGACCTGTGG CGCCACGAGA CGCGCTGGAA CCGGACCATC
CGCAACGTCG ACCCGGCCGG CTACGCGGGC AGCCTCGTCA CCCACGCCTT CCCGCTCGCC
CTGATCGGCG CGCTGATGCC CAACACCAGC CCGCAGGGCC TCGCCATCGC CGCCCTGGCG
CTCACCTGCC GCATCGTCCT GTGCCTGCGG CTGGAGCGGG CCTTCGGCCT CGACCCGCAT
CCCTACTGGC TGCTGCCGAT CCGCGACCTG GTCTCGTTCG CCGGCTTCGT GTGGTGCTTC
GCCTCCGGCG CCGTGACTTG GAAAGGTCAC GATTATCGCG TTGTGGCTGA CGGCACGCTC
ATCCCCGAGC CGGGCCTCGC GCAGGAGACC GGCGCCCCCA CGACCTGA
 
Protein sequence
MDSTWISTLL LTLAAAGCAY ALAAAWLAGR AAGRPTPTLP AGAARPSVTL MKPLCGDEPN 
LYENLTSFCR QDYAGPVQII FGVQSAADPA LAMVARLKAE HPDLRIDLAL DARQHGSNRK
VSNLINMAGL IAHEVVVLAD SDMVVRPDYL ERIVAELGRP GVAAVTCLYH GVPAERSVWA
QLSTLAIDTQ FLPNVLVGTG LSLAEPCFGS TIAFRAEALA AIGGFERVKD DLADDYALGA
ALRGAGGGIV AIPNFTIGHT CVDTSLSDLW RHETRWNRTI RNVDPAGYAG SLVTHAFPLA
LIGALMPNTS PQGLAIAALA LTCRIVLCLR LERAFGLDPH PYWLLPIRDL VSFAGFVWCF
ASGAVTWKGH DYRVVADGTL IPEPGLAQET GAPTT