Gene M446_6257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_6257 
Symbol 
ID6135762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6876121 
End bp6879084 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content72% 
IMG OID641646352 
Productglycosyl transferase family protein 
Protein accessionYP_001772957 
Protein GI170744302 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0286803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0257469 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAGGT GGTTGCGCTC GCCGAGAAAA CTGTCGGTCC GCGTGCATTT CGACGAGCGA 
TTCTATCGGC TCCGCTACCC CGATGTCGGG GTGGCGGGGG CCGATCCCTT CGCGCACTAC
ATGTCCTTCG GCTGGAAGGA GGGGCGGGAC CCGTCGCCCT CCTTCCCGAC CCTCCTCTAC
AAGGACAAGC ATCTCGGCCC GACGCCGCGG ACCAACCCGC TCGCCCATTA CGCGGCGCGG
CCCGAGGCGG AGCGGCAGCG GGCGGTCGCG TTCTCGGCCG AGGAGGCGGT CGCGATCCAG
GCCCGCGTCG TCCGGGACCA TTTCGACGAG GCCTTCTACA GGGACGCGGC GCGCCTCGAT
GCCGGGGCGG ACGCCCTCAC GCATTATCTC ACGGTCGGAT GGCGGGAGGG GTACGAACCG
GCGCCCGGCT TCTCGTCGGC CGCGCATGCG GGCAAGCACC GCCACATCGC GGCGACGGGC
CTGTGCCCCT TCTACCACTT CGTGTCCACC TACGGCCTGC TCGATCACCC GGAGCCGCTC
GAGGCCGCCC TGGCCTGCGG CGCGAGCGGA GGGAGGGCAC GGGTGTCCCG CCCCGTCGTC
CTCAGCACCA TCGCGCAGGA ATTCGACCGC GCCTCCTACC TGACCCGGTA CGCGGATGTG
CGGCAATCGG GCATCGACCC GGTCGAGCAC TACGTGGATT TCGGCTGGAA GGAGGGCCGC
GATCCCAGCG AGTGGTTCTG GACGGGCTTT TATCGGGAGG AGCAGGCCCC GCACCTCGGC
GACGCGGTGA ACCCCTTCTA CCACTACCTG ACGGAGGGGC GGCCGGCGGG GTTGCTGCCG
AACCCCTTCG GCTGCGGCGA ATGGCCGCCT CTGGAGGCGC CCCGCGCGGA TGAGTGGGAT
GCGGCGCGGC CGGCCGCCGA CCTTGCGGCG GCCGAGGTCG TCGTGATCGT CCCGGTCTAC
AAGGGACGGG CCGAAACCCT GCGGGCGATC CACGCCGTCC TGTCGAGCCG CCAAACCACG
TCCTTCGCCC TCGTCGTGGT GGACGATTGC GGGCCGGAGC CCCAGCTGCG CGCCGCCCTG
CAGGCCCTGG CGGCGAGGAA GTTCCTGATC CTCGTCGAGA ACGCGGAGAA TCTCGGCTTC
GTGCGCTCGG TCAACCGGGG CATCGCGGCG AGCGGCGACC GCGACGTGAT CCTGCTCAAT
TCCGACGCCG TGCCGTCCGG CGACTGGATC GACCGCCTGC GGGCCCATGC CCGCGCCAAC
CCGGACGCGG CGACCCTGAC CCCCCTCTCC AACAATGCCA CCATCTGCAG CTACCCGCAG
GCCAACATCG ACAACCGCCT CGCCCTCGAG ATCGGCCCGG CCGAGATCGA CGCCTGCGCG
CGCGCCTGCA ACCCGGGCCG CGCCGTCGAG GTGCCGACCG GGGTCGGCTT CTGCTTCTAC
ATCCGCGGCG AGGCGCTCGC GCGGATCGGT CCCTTCGACG CGGAGACCTT CGGGCACGGC
TACGGCGAGG AGAACGACTT CTGCATGCGG GCCAAGCAGG CCGGCTACCG CAACCTGCTC
GTCCAGGACG TCTTCGTGTA CCACGCGGGC GGCGTGTCCT TCTCGACCGC CTACATCGAC
AACGCGCCGC GCATCGAGCG GCGGCTGAGC CTCAAGCACC CCGACTATTT CGGGGCCGTG
CAGCGCTTCA TCGCGGCGGA TCCCGGCCGC GAGGGGCGGA TGCGCCTCGA TCTCTACCGC
ATCGCCCGGC AGGCCGGCCC GCGCGCGGCC CTGTTCGTCA CCCACGAGAG GGGCGGCGGG
ATCGAGACCC ACGTCCGGGA CGCGGCCGCG CGCCTCGCGG GCGAGGGCGT CCCGGTCGTG
CTGCTGCGCG TGGCGGGGCA CTCCACGGTC AAGGTGGAAT TCGCGCCCGA GAGCCGCCGC
GCCGTCCTGA CCTGCTCCTG CGACGCCATC CACGTGCTGC GGCACGCGGC CTCCCTCGAG
AGCTTCCTCG GATGGCTGCA GCCGCTCTTC GTCCACGTCC ATTCCCTGGT CGGGCTGGAG
TGGCGCGCGA CCCGCCGCCT GATGGAGATC CTCGCTCCCC TCCCGCGCCG CTACGTGACG
CTCCACGACT ACTCGCCCGT CTGTCACCGC AACGACCTCG TGACGAGGCT CGGCACCTAT
TGCGGCCTGC CCGGGGTCGA GACCTGCCGC GGCTGCTTGG CGGCCGACCA CGACGATCCC
GATTGCGTCG ACCCCGACGA GCGCCGGAGC GCCTACGCGG CCTTCGTGGC GGGGGCCGAG
GCGGTCTTCG CCCCTTCCCG GGACATCGCC GCCCGGATCG GGCCGCTCCT GCCCGGGGCG
CGGATCGCGC TGCGCCCCCA CGCCGAGACC CTGACGCCCC GCCCGCTCCC CGCCCTGCGC
CGGGGCGAGG TGCTGCGCGT GGCGGTCGTC GGCGCGATCG GGCCGCACAA GGGAGTCGCC
CTGCTCCACA GCCTGGGGCT CGACGCCAGG CTGCGGGATC TGCAGATCCG CTACGCGGTC
GTCGGCTACA CCTCCATGAC GGACCGGCTG TCGGAGATCG GCGTCACCGA GACCGGACGC
TACCGCTCGC CCGACGAGGC CATGGACCTC CTCGAGGCGG AAGGGGCAGA CGTCGTGCTA
ATCCCGTCGA TCTGGCCGGA AACCTACTGC TACGCGCTCT CCATCGCGCT CGCGGCCGGC
CTGCCGCCGC TCGTCTTCGA TCTCGGAGCT CCGGCCGAGC GCCTGCGGGC CCGCCAGGAG
GGGCTGCTGC TCGATCCCGC GCTGATCGAG CGGCCGCAGG AGGTCAACGA CAGGCTGCTC
GACCTGCCGC TCGCCGAGCT CTACGCGCGC CGCCGGCCCT ACCCGTCCCT GTCGTACGGG
TCGATGCTCA CGGAGTATTA CGGCCTGAAA GCCGAGGAGG TCTCCCCGGC GCAGCCGTCC
CGCGGGGCCG CGCGCCGGGC GTGA
 
Protein sequence
MLRWLRSPRK LSVRVHFDER FYRLRYPDVG VAGADPFAHY MSFGWKEGRD PSPSFPTLLY 
KDKHLGPTPR TNPLAHYAAR PEAERQRAVA FSAEEAVAIQ ARVVRDHFDE AFYRDAARLD
AGADALTHYL TVGWREGYEP APGFSSAAHA GKHRHIAATG LCPFYHFVST YGLLDHPEPL
EAALACGASG GRARVSRPVV LSTIAQEFDR ASYLTRYADV RQSGIDPVEH YVDFGWKEGR
DPSEWFWTGF YREEQAPHLG DAVNPFYHYL TEGRPAGLLP NPFGCGEWPP LEAPRADEWD
AARPAADLAA AEVVVIVPVY KGRAETLRAI HAVLSSRQTT SFALVVVDDC GPEPQLRAAL
QALAARKFLI LVENAENLGF VRSVNRGIAA SGDRDVILLN SDAVPSGDWI DRLRAHARAN
PDAATLTPLS NNATICSYPQ ANIDNRLALE IGPAEIDACA RACNPGRAVE VPTGVGFCFY
IRGEALARIG PFDAETFGHG YGEENDFCMR AKQAGYRNLL VQDVFVYHAG GVSFSTAYID
NAPRIERRLS LKHPDYFGAV QRFIAADPGR EGRMRLDLYR IARQAGPRAA LFVTHERGGG
IETHVRDAAA RLAGEGVPVV LLRVAGHSTV KVEFAPESRR AVLTCSCDAI HVLRHAASLE
SFLGWLQPLF VHVHSLVGLE WRATRRLMEI LAPLPRRYVT LHDYSPVCHR NDLVTRLGTY
CGLPGVETCR GCLAADHDDP DCVDPDERRS AYAAFVAGAE AVFAPSRDIA ARIGPLLPGA
RIALRPHAET LTPRPLPALR RGEVLRVAVV GAIGPHKGVA LLHSLGLDAR LRDLQIRYAV
VGYTSMTDRL SEIGVTETGR YRSPDEAMDL LEAEGADVVL IPSIWPETYC YALSIALAAG
LPPLVFDLGA PAERLRARQE GLLLDPALIE RPQEVNDRLL DLPLAELYAR RRPYPSLSYG
SMLTEYYGLK AEEVSPAQPS RGAARRA