Gene M446_5389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5389 
Symbol 
ID6133485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5920869 
End bp5924666 
Gene Length3798 bp 
Protein Length1265 aa 
Translation table11 
GC content76% 
IMG OID641645523 
Productglycosyl transferase group 1 
Protein accessionYP_001772139 
Protein GI170743484 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.604421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.386182 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGCC GGCGCCTCAC CCGCGCCGGC CCCACGGACG TCCTCGCAGG GGTCAGCCAC 
GTGCCGAAGC GCCGGATCAA GCTCCTCGCC AAGCTCGGCC GGAAGATCCG GTCGGTGTCC
TGGCGGTTCG GGAGGGACGC CTTCGACCCC GACTTCTACG CGCAGCTCTA TCCGGACCTC
TCCGGCCTGA ACCGGCACGC CCTCCGGGAG CACTACGTCC GGCACGGCCG CGCCGAGGGC
CGCTTCCCGA CGCGCGACGC CTACCTGGCG GACCTGACGG CGCGCCACGG CGCGCTGCCG
GAGGATTTCA ACCCCGGCCT CTACGGCGAC CTGAATCCCG ACCTCGCCCG GCGCTTCACG
CGGGACTGGC AGTTCACCGC GCATTTCCTC AAGCACGGGC GTGAGGAGAA GCGGCTCTAC
GCCCTCGACA TCGCGGCGGA GGCCGAGACC TTCGCGCGGC TCGCCGGGGC CGAGCCGCGC
GAGGGCTTGG CGCTGTTCAC GGGCTTCCTC GCCGCGCACG GCCTGGCGCC GGGCCGCTGG
CTGCGGCGCT TCGACCTTGC CCAGTTCATC CACCTGAACG CCGACTGGCT CCCCGCGCCG
CCGCGGACCC GGGCGGAGGG CATCCGCCTC TTCGTCGAGC AGGGCATCGA CCGCCTCGCC
CCGATCCGCA CCGACGCGAT CTTCGATCCG GCCTTCTACC GGGCGACCTA CCGGGCGGAG
GGAGCGGCCG ACGACGCGGC GCTCTACCGG CACTGGCTGC GGCAGGGCAT CGAGCAGGAC
TTCGCGCCGA ACGAGGCGGC CCGCATCGCC CTGCTGATCG GCGAGAGCGC CTTCCCGGCC
GCCTTCGCGG TGGAGACCTA TCGGGCGTCC CTGCCCCGCG GCGCGATGAG CCGCACCGAC
CTCTTGGAGC ACTGGCTCGA CCACGGCTTC CCGGAGGGCC GCCTGGACGT GGTGACGGGG
CCGGAGGCCG CGGCCTTCCT CACCCTGGTG GGCCTGCGCT GCCTGCTGCG CCAGCGCTTC
GCGCTCGCCC GCCGGGCCTA CGACGAGGCG ATCGCGCGCG GCGGCCCGAC CGCCCGCCGG
CTGCACGGCC GCGGCGAGGC CGCGCGGGCC CTGGGCGACC TCGCTTCCGC GGCCGCCGAT
TACGCGGCCG CCGCGGCCGA CCCGCGGTCG AGCCTCTGGA GCCACATCCA CGCCGCCGAG
TGCCTGGCCG CCACCGGAGA TCTGCCGGCC GCCTGCGCGC AGGTCCGGGC GTCGTCGCCG
CGCTGGATCA AGGATGCGCG CTGGCGCGAG GCGGGGCTCG GCATCCTCGG CCGGGCCTTC
GACGCCGCCT GCGAGGCCGC CCGCGCGGAG TACCGCCAGG GCCGCCGCGC CGAGGCCGAC
GCCCTCCTGG ACGAGGCCCT GAACCGGCTC GCCGCCGACC TCCCCCTCGC CGCTCCCTTG
CCCGTCCCCT TGCCCGTCCC CTCGCCCGTC CCCTCGCCGC GGCCGACCGT CGCCATCCTG
GCCAACCTCG ACCTGCCCCA GTGCGTCCAC TACCGCGTCG AGCAGCGCCG CCGCCAGCTC
GAGCACGCCG GATGGACCGT CCGCGTCTTC CCCCCCGACC AGGCCCGGGC CTTCCGCCAG
GCCCTGCACA CCGCCCGCGC CGCCCTCTTC TACCGCCTCC CCGCCTTCCC CGACATCCTC
CACAGCATCC TCTACGCCCG CGCCCTCGGC CTGCCCACCT TCTACGACGT CGACGACCTC
ATCTTCGACG CGCGCTGCTA CCCCGACCCC TTCGACAGCT TCGAGGGCCA GATCTCCCCC
GAGGACTACG TCGGGCTGCA GTTCGGCGTG CCCCTGTTCC GCTTCGCCCT CGCCCAGTGC
GACGAGGGCC TCGCCTCCAC CCCGGCTCTC GCCGAGGCCA TGCGCCCGCT CCTGCGCACG
GGCCGCTGCC ACGTCCTGCG CAACGGCCTC GACCAGCGCA ACGCCCCCTT CCTCGACCGG
CCGGACCCCG CCCCCGCCCG CCCGGAGGCG CCCGTGACGC TGTTCTACGG CTCGGGCACC
AAGGCGCATA ACCGCGACTT CAACGCCCTG GCCGGCCCGG CCCTGCTGCG CCTGCTGGAG
CGCCACCCGC AGCTGCGCCT GCTGATCGCC GGCCACCTCA CCCTCGATCC CGCCTTCGCG
GCCCACCGGG ACCGGATCCG CCGCCTCGGC TTCACCGCGC AGGTGGAGGA TTACTGGGAG
ATCCTGTCGG GGGTCGACGT GAACCTGGCG GTGCTCGCCC CCGGCGCGGC GGCGGACGCC
AAGAGCGAGA TCAAGTGGCT GGAGGCGGCG GTGGCCGGGG TGCCCTCGGT GGTGAGCGGG
ACGCGCACCT ACCGGGAGAT CCTGGTCGAG GGCGAGGACG TGCTGTTTGC CGACACGCCC
GAGGAGTGGG CCTCGGCGCT GGAGCGGCTG GTGGGGGATG CCGGGCTGCG GCGGCGGATC
GGCCTGGCGG CGCGGCGCAA GGCGCGGCGG GATTACGGGC TGGCGGCGGG GGCGGGGCGG
GTGGCGTCCG TGCTGGCGCG GCCGGCGGCG CCCGCCGTGG CGGCGGCGCG GCCGCGGATC
CTGCTGGTGA ACGTGTTCTT CCCGCCCCAG ACGATCGGCG GGGCGACGCG GGTGGTGCGC
GACAACCTCG ACCACTTCCT GGCGGCGGCG GGGGAGCGCT ACGCCTTCGC GGTGGCGGCG
AGCGACGAGG GGGTGCAGCC GGCGGGCCGG GCGCGGCTGG ACGGCTACCG GGGGGTTCCG
GTGCTGCGGC TGTCGACGCC GCTGGAGCCC GGGATGGACT GGCGCCCGTT CAACCCGGCG
CTGGGGGCGC TGTTCGGGGA GTTCCTGGAC CGGCTGTGCC CGGACCTGGT GCACTTCCAC
TGCGTGCAGC GGCTGAGCGG GTCGGTGGTG GAGGCGGCGC AGGCGCGGGG CCTGCGGCAC
GTGGTGACGG TGCACGACGG CTGGTGGATC TCGGACCACC AGTTCCTGCT CGACCGGGAC
GGGCAGGTGG TGCGTCCCTC GCCGGACCTG CTGGAGGCGG CCTCGGACGC GCCGCACGGG
GCGGGGGCGC AGCTCCTGCG GCGGCGGCGG CTGGGGCGGC TTCTGTGTGG GGCGGACCGG
GTGCTGGCGG TGTCGGAGAG CTTCGCGGGC CTGTACCGGG CGGCGGGGTT CCCGGAGGTG
CGGAGCGTGC CGAACGGTCT GTCGCTGGGG CGGGTACCGG CGCGGCGGGC GGGGCGGGGT
CCGCGGGTGC GGCTGGGCCA CGTGGGGGGT CTGGAGGCGC ACAAGGGGGC GCCGCTGCTT
GAGGTGGTGC TGCGCACGAC GCCGTTCCGG TGCCTGGGCC TGACGCTGGT CGACCTGTCG
CGGGAGCCGG GCTCCGCGAG CGAGGAGGTG TGGGGGACGA CGCCGGTGCG GATCGTGGGC
CCCGTGGCGC AGGAGGAGAT CGGGGGGTTG TACGGGGAGC TTGACGTGCT GCTGGTGCCG
TCGCTGTGGC CGGAGAGCTT CGGTCTCGTG TCGCGGGAGG CGCGGGCGTT CGGTCTGTGG
GTCGTGGCGA GCGACCGGGG CGCGGTGGGG GAGGAGGTGC GGCCGGGGGT GGACGGGTTC
GTGATCGACG TCTCGACGCC GGGCCCGCTG CGGGAGGTGC TGGGGGCGAT CGACGCGGAT
CCGGCGCGGT TCCAGGCGCC GCCGCCGCCC GGCCCGCCGC TGCGCCGTGC GGCCGACCAG
GGCGACGACC TGCTGCGGCT CTACGACGAG ATCCTGGGTA CGAAGCGGCC GGCCGCGGGG
CGCCCGGCGG CCGAATAG
 
Protein sequence
MMRRRLTRAG PTDVLAGVSH VPKRRIKLLA KLGRKIRSVS WRFGRDAFDP DFYAQLYPDL 
SGLNRHALRE HYVRHGRAEG RFPTRDAYLA DLTARHGALP EDFNPGLYGD LNPDLARRFT
RDWQFTAHFL KHGREEKRLY ALDIAAEAET FARLAGAEPR EGLALFTGFL AAHGLAPGRW
LRRFDLAQFI HLNADWLPAP PRTRAEGIRL FVEQGIDRLA PIRTDAIFDP AFYRATYRAE
GAADDAALYR HWLRQGIEQD FAPNEAARIA LLIGESAFPA AFAVETYRAS LPRGAMSRTD
LLEHWLDHGF PEGRLDVVTG PEAAAFLTLV GLRCLLRQRF ALARRAYDEA IARGGPTARR
LHGRGEAARA LGDLASAAAD YAAAAADPRS SLWSHIHAAE CLAATGDLPA ACAQVRASSP
RWIKDARWRE AGLGILGRAF DAACEAARAE YRQGRRAEAD ALLDEALNRL AADLPLAAPL
PVPLPVPSPV PSPRPTVAIL ANLDLPQCVH YRVEQRRRQL EHAGWTVRVF PPDQARAFRQ
ALHTARAALF YRLPAFPDIL HSILYARALG LPTFYDVDDL IFDARCYPDP FDSFEGQISP
EDYVGLQFGV PLFRFALAQC DEGLASTPAL AEAMRPLLRT GRCHVLRNGL DQRNAPFLDR
PDPAPARPEA PVTLFYGSGT KAHNRDFNAL AGPALLRLLE RHPQLRLLIA GHLTLDPAFA
AHRDRIRRLG FTAQVEDYWE ILSGVDVNLA VLAPGAAADA KSEIKWLEAA VAGVPSVVSG
TRTYREILVE GEDVLFADTP EEWASALERL VGDAGLRRRI GLAARRKARR DYGLAAGAGR
VASVLARPAA PAVAAARPRI LLVNVFFPPQ TIGGATRVVR DNLDHFLAAA GERYAFAVAA
SDEGVQPAGR ARLDGYRGVP VLRLSTPLEP GMDWRPFNPA LGALFGEFLD RLCPDLVHFH
CVQRLSGSVV EAAQARGLRH VVTVHDGWWI SDHQFLLDRD GQVVRPSPDL LEAASDAPHG
AGAQLLRRRR LGRLLCGADR VLAVSESFAG LYRAAGFPEV RSVPNGLSLG RVPARRAGRG
PRVRLGHVGG LEAHKGAPLL EVVLRTTPFR CLGLTLVDLS REPGSASEEV WGTTPVRIVG
PVAQEEIGGL YGELDVLLVP SLWPESFGLV SREARAFGLW VVASDRGAVG EEVRPGVDGF
VIDVSTPGPL REVLGAIDAD PARFQAPPPP GPPLRRAADQ GDDLLRLYDE ILGTKRPAAG
RPAAE