Gene Mext_3454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3454 
Symbol 
ID5832089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3832393 
End bp3833589 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content68% 
IMG OID641369253 
Productputative glucosyltransferase 
Protein accessionYP_001640911 
Protein GI163852868 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03472] hopanoid biosynthesis associated glycosyl transferase protein HpnI 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.250583 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.104768 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTGA ACGAGCTGCC GGCCTGGATC GCATCTGTGC TACTGCTGAT GGCGCTCGCC 
GGCTGCGTCT ATGCCCTGCT CGCGGCCTGG CTCGTCAACC GTTTCGCCGC GCGGCCGTCG
CCCGCGCTCG CGGCCGATGC GCCGCGCCCC GGCGTGACGA TCCTCAAGCC CCTCTGCGGC
CTGGAGCCGG ACCTCTTCGA GAACCTCGGA AGCTTCTGCC GCCAGGATTA TGCCGGCCCG
GTGCAGATCG TGTTCGGCGT CCAGAACGCG GCCGACCCGG CGATTGCCGT GGTGCAGCGC
CTGCGCGAAG CCCATCCCGC CCTGCGCCTC GACCTCGTGG TGGATCCGAG CCAGCACGGC
TCGAACCGCA AGGTCTCCAA CCTCATCAAC ATGTCGGAGA AGATCGCCCA CGCCGTCGTG
GTGCTGGCCG ACAGCGACAT GTCGGTGAAG CCCGATTATC TCGAGCGCGT CGCCGCCGCC
CTGTCGCAGC CCGGCATTTC CGGCGTGACC TGCCTCTATC ACGGCGTGCC GGGCGACCGG
GGCCTGTGCG CCCAACTCGC GGCGCTCGCC ATCGACGTGC AGTTCGTGCC CAACGTCATC
CTCGGCACCA CCTTCGATCT CGCCCGGCCC TGCTTCGGCT CGACCATCGC GATGACGGCC
GAATCGCTGG CCCGCATCGG CGGCTTCCGC GCGTTCAAGG ATGATCTGGC CGACGATTAC
GCGATCGGCG AAGCGCTGCG CGCCGAGGGC GGCACGGTGG CGATTCCCGC CCTCACCATC
GGGCATGCCT GCGTCGATAC CGAGCTGTCG GGCCTGTGGC GGCACGAGCT GCGCTGGAAC
CGCACCATCC GCAACGTCGA TCCGAAGGGC TATGCCGGAT CGGTCGTGAC CCACGCCTTT
CCGCTGGCGC TGCTCGCCGC ACTGATGCCC GGCGCCGGCT CCGGCGCGCT CGCGGTCGCC
GCCCTGGCCC TTACCTGCCG CATCCTGCTG TGCCTGCGCA TCGAGCGGGC CTTCGGGCTC
TCCCCCCACG CCTACTGGCT GTTGCCGATA CGTGACATGC TGTCCTTCAT CAACTTCACC
TGGAGCTTCG TCTCGGGTGC GGTGACATGG AAAGGTCACG ATTACCGTGT GGTTGCGGAC
GGTACGCTGA TTCCGGAGCA CGGCCTCGGT CGCGAGTCGC GCGCGACTTC GGTCTAA
 
Protein sequence
MDLNELPAWI ASVLLLMALA GCVYALLAAW LVNRFAARPS PALAADAPRP GVTILKPLCG 
LEPDLFENLG SFCRQDYAGP VQIVFGVQNA ADPAIAVVQR LREAHPALRL DLVVDPSQHG
SNRKVSNLIN MSEKIAHAVV VLADSDMSVK PDYLERVAAA LSQPGISGVT CLYHGVPGDR
GLCAQLAALA IDVQFVPNVI LGTTFDLARP CFGSTIAMTA ESLARIGGFR AFKDDLADDY
AIGEALRAEG GTVAIPALTI GHACVDTELS GLWRHELRWN RTIRNVDPKG YAGSVVTHAF
PLALLAALMP GAGSGALAVA ALALTCRILL CLRIERAFGL SPHAYWLLPI RDMLSFINFT
WSFVSGAVTW KGHDYRVVAD GTLIPEHGLG RESRATSV