Gene M446_3307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3307 
Symbol 
ID6133975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3663975 
End bp3665609 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content73% 
IMG OID641643494 
Productglycosyl transferase family protein 
Protein accessionYP_001770146 
Protein GI170741491 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.432845 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGCGC CGATTACCAC CCCGTCGAAT GAGAAGTCGA TCCAGCTCAT CGCCGGGAGC 
GACGTCACGG TCGTCATCAC CACCTACAAC CACGCCCACT TCTTGGGCGA GGCACTGGCG
AGCGTCGCCG CGCAGACCAG GGTGCCGGAG GAGGTGATCG TCGTCGACGA CGGGTCGAGC
GACGATCCGG CCGCGATCGT GGCCCGGTGG CCGGGGGTGC GGCTCATCCG TCAGCCCAAT
CAGGGCCTCT CCTCCGCCCG CAACACCGGG CTCGCGCAGG CCCGCACGAC CTACATCCTG
TTCCTGGACG CCGACGACAT GCTGCGCCCC GCCGCGATCG CGCAGGGCCT CGCCGCGTTC
GCCCGCTCGC CGGGCGTCGC CTTCGTGTAC GGTGCCCACG AGAGGGTGGA CCGCGATGGG
AATCCCCTCG GAGGAGCCGT CTACCACGCG ATGGCCGGTG ATCCCGTCGC CGCGCTGCTG
CACCACAACA TCGTCGGCAT GCACGCGACG GTCCTGTACC GCCGCGACAT CCTCTCGGCC
GCGGGCGGTT TCGACACCCG GCTGAGGCGA TGCGAGGATT ACGACGTCTA CCTGCGCCTC
GCGCAGGCCC ACGCGATCGC GTCGCACCCG ACGGTCGTCG CCCGCTATCG GTGGCACGAC
AGCAACATGT CCTACGACTT CCGCGCGATG CGGGACGCGG CCCTCGCCGT CCACGCGCGG
TACCGGCCCG CGGCGGAGGC CGACTCGCTG CGCAGGGAGA CCTGGTCCGA GGGGCGCCGC
TTCTGGCGCC GGTACTACGC GGGCGAGGCG CTGCGGGCCG GCGGACGCAT CTCGCCGCGC
GGCGTGGCGG CCGCCGCGCG TCTCGACCCG CTGTGGACCG CGGGCGAGCT GGCGCGCCGG
GCGGCGATCC GGCTCGGCGG TGCCGTCTCC CGCCGCGCGG GCTATCGGCT GCGCCGCCGC
CTCGTCGGTC GCGCCTCGCC CCCGGTCGGC CGGGTTGATT GGGGTGATTT CGGAACGCCC
TGGCCGGTGA GCGAGGATTT CGGCTGGGAT CGCGGGCTCC CGGTGGACAG GTTCTACATC
GAGCGATTTC TGGAGGGGGC GGCCGCCGAC ATCCGCGGCC GCGTGCTCGA GATCGGCGAC
GACGCCTACT CACGGCGCTT CGGCGGCGCG CGGGTCGAGC GGCAGGACAT CCTCCACGTC
GAGGCCGGCA ATCCCAGGGC GACGCTCATC GGCGACATCA GCCGGGCGGG CACGCTGCCG
CAGGCCGCCT TCGATTGCAT CGTCCTCACC CAGACGCTGC ACCTCATCTT CGACATGGCG
AGCGCCCTGC GCCAGCTGCA CGGCGCCCTC AGGCCCGGCG GCGTGCTCCT GCTGACCGTG
CCCGGCATCA GCCCGATCGA CCGCGGCGAG TGGTCCGGGA CCTGGTTCTG GTCCCTGACC
CCCGCCGCGC TGACGCGCCT CCTCGACGAG GTGTTCGGCG CCGGCGCGGC GACGGTCCGG
AGCCACGGCA ACGTCTACGC CGCGACGGCC TTCCTCCAGG GGCTCGCCCT CTCGGAGGTG
GATCCGACGA AGCTCGACGT CGCGGACGCA TCCTACCCGG TGATCGTGGC CGCCCGGGCG
GCGCGGAGGC CGTGA
 
Protein sequence
MEAPITTPSN EKSIQLIAGS DVTVVITTYN HAHFLGEALA SVAAQTRVPE EVIVVDDGSS 
DDPAAIVARW PGVRLIRQPN QGLSSARNTG LAQARTTYIL FLDADDMLRP AAIAQGLAAF
ARSPGVAFVY GAHERVDRDG NPLGGAVYHA MAGDPVAALL HHNIVGMHAT VLYRRDILSA
AGGFDTRLRR CEDYDVYLRL AQAHAIASHP TVVARYRWHD SNMSYDFRAM RDAALAVHAR
YRPAAEADSL RRETWSEGRR FWRRYYAGEA LRAGGRISPR GVAAAARLDP LWTAGELARR
AAIRLGGAVS RRAGYRLRRR LVGRASPPVG RVDWGDFGTP WPVSEDFGWD RGLPVDRFYI
ERFLEGAAAD IRGRVLEIGD DAYSRRFGGA RVERQDILHV EAGNPRATLI GDISRAGTLP
QAAFDCIVLT QTLHLIFDMA SALRQLHGAL RPGGVLLLTV PGISPIDRGE WSGTWFWSLT
PAALTRLLDE VFGAGAATVR SHGNVYAATA FLQGLALSEV DPTKLDVADA SYPVIVAARA
ARRP