Gene M446_3112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3112 
Symbol 
ID6130324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3443680 
End bp3446559 
Gene Length2880 bp 
Protein Length959 aa 
Translation table11 
GC content73% 
IMG OID641643303 
Productglycosyl transferase family protein 
Protein accessionYP_001769956 
Protein GI170741301 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.264403 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCG ATCCCGTGCC CCGCGCCCCG GCGGATGCGC CGTCCTCCCC GGCGCGCCCG 
CGCCTGGCGG TGATCGTCCC GGTCTTCAAG CACAGCGGGC TGGTGCGCGA GGCGGTCGCC
TCGCTCACGC GCCAGTCGCG GTTCGCGGAC GTGGACGTGG TCCTGGTCGA CGACGGGTGC
CCCGACCCGC AGACCTTCAC GACGCTGACC GCCTTCTGCG CGCGCTGGCC CAACATCCAC
TGCGTGAGGC AGGCCAATGG CGGCCTCAGC GCGGCCCGCA ACCGCGGCAT CGCCTTCGCG
CTCCGCCGGC TCCCGGCGGC GGAGGGCGTC TACTTCCTCG ATGCCGACAA CCTCCTCGCC
CCCTACGGCA TCGCGGCGAT GCAGGAGGCG CTGCTCCTTC ATCCCGAGGC GGATTGGTTC
TACCCGGACA TCCGGATGTT CGGTGTCCGG GCCTTCCACG ATTACAGCGG CGACTTCAAC
GCCTACGCGG CCTCGGTCGT CAACCTGTGC GAGGCCGGCA GCCTGATCCG GCGCCGCATG
ATCGAGGCGG GTCTGCGCTT CGACGAGGGG ATGCGGCTCG GCTACGAGGA TTGGGATTTC
TGGCTCTCGG CCGTGGGGCG CGGCTTCCGC GGCCGCCACC TGCCGAATCT CGGCCTGTCC
TACCGGAAGC GGGCGGAGAG CATGCTCGCC GACTCGACCC GCTCGGACAC GCAGATCCGG
GAATACCTGC GCCGCAAGCA CGAGGCGCTG TTCCAGGTGA ACGCCTTCGT CGCGCGGGAG
CACGACGAGC TGCCGCGCTT CGCCGTGCGC CTGTCCGACG AGGCGAGCGT GGAGATGGCG
AGCGACCTGC AGCGCGGCGG CCCGCGCATG GACCAGCAGG CCTTCGAGAC GCGGATCTGG
CAGGCGATCC GCGAGCCGGC CTTCGTCTGG GCGGGCCAGT ACCTCCTGTC GACGACCGGC
CGGACGCTCG CCCTGCTGCG CGGGGCGGGC CTGACGCGCT GGCTCTGCCT GGAGATCGAG
CGGGCCCTCG CCGGCCACAA CTTCGTGTCG CTGACCCTCG CGGCCTCGGA CGACGACACC
ATCCGGGTCC GGCGCGGGCA CGGCTTCTCG CCGCACTGCC ACCTCCTCGC CGTCTCGCAG
AAGCTCCTGC AGGCGATCGC CCTCGACGAG AAGGACGCCT GGATCACGGA ACTGCCGATC
CACGGCGCCA ATTACCGCGT CGCCACCCTG GAGATCCGCC TCCCGCCTCG GGCGATGGCG
GCGGAGACCC TGAAGGACGT GGCGGTGACG GACTTCGTCC AGTTCTGCCT GCACCTGCGC
TGCCACGGCC TGCGCGGCCG TCCGAGCAAC CTGGTCGAGG AGGTCTTCCT CGGCTCCCGC
CCGCTCGACG CGATGGCGCC GCGCGCCCGC GCCCAGTTCG ACGGCGCCCT GCTCCCGCCG
GTGGCCGAGG CGCGCGAGGG CCGCGTCGCC TGCGTGCTGC CGCATTGCGA TTTCGGCGGC
GTCGAGAAGG TCACGTTCTG CCTCGCCCGC GAGTTGCGCC GCCAGGGGCT GCGCACCAGC
CTGATCCTGC TCGGGAGCGA CGTCGCCTAC CGGGCGCACC GCGCGCTCGA GGCCTTCGAC
GACATCTACC TCGTCGACGC CGGGGGCCGC ATCGCCGCCT GGGCGGGCGA TTCCTTCCTG
GGCACGCAGC TGCCCAGGAT CCTGGACGAG GCCTGGGCCC GCGACTTCGC GAATGTCCTG
ACCACGTTCG AACTCGTGGT GAGCTGCCAC TCGGCCGAGA TCATGGGATT GTTCAGCGGG
CTGCGGCGCC GGGGCGTCAC CACGGCGACC TACCTGCACC TGTTCGACAA GTCGCGCATC
GGCGCCGCCT GCGGCCATCC GATGCTCGCC CTCGCCTACG AGCACGCGAT CGACCTCGTC
CTGACCTGCT CGGAGGGGAT GGCCTGCGAG ATGGCGAGCC TCGGCATCCC GCGCGACAAG
ATCCTGGCCC TGCCGAACGC GCCCTCGCTG GAGCCCGATC CGCACCGCGC CGTCGCGCCG
CGCGCCCCGG CCGGTCGCCC CCTGCGCCTG CTCTACCTGG GCCGCCTCGA TACCCAGAAG
GGGCTCGACC GGCTGGCCGA GATCATCGAC GCGCTCGACC CGGACCCGCT CTTCGAGATC
CGGGTGGTCG GGAAGGCGGT GTTGACCGAC GCCCACCTGA CCCTCAGCCG GCACGCGCAC
CTGGTCGAAC CGCCGGTCTA CGACGATGCC GGCCTCTCCG AGATCTACCG CTGGGCCGAT
ATCCTGCTGC CGTCCCGCTA CGAGGGGCTC CCGCTCACCG TGCTGGAGGC CATGGTCCAC
GGCGTGGTGC CGATCGTGGC CGCCTGCGGG GCGGTGGCCG AGGCGGTCGA GTCGGGCGTC
AGCGGCGTCG TCGTCCCGCA GGAGCGCTGC GTTCCGGGCT TCCTCGACCA CCTGCGGGCC
CTGGCGGCCG CGCCGGAGCG GTTGGAGGCG ATGAGCCGCG CGGCGATGGC GAGGGCCGCC
GGCCGGCCCT GGAGCGTGCT CGCGGAGCGC CTGCGCGCGC GGCTCGGCGC CGTCCGGGCG
GCCCGCGCCC GAGAGAGGGC CGGAGCGGGC CGCGCAGGAG CGGGAGCCGG GCTGATCGGC
GCCCGCGGCG GCCGAGCCGT GCCCCCACGG CGGGGCCGCC CCGCTCCCGG CGACCTTGCA
GGATCGCGCC GAATCGCTAG CTGGGAAGCG ACCCTCCGGA GAGTTCGGCA TGAATGCCCG
TCCCGACGCG CGCGACTTTG CGGCCCCCCA GCAGTTCGGC ATCGGTCAGC CGGTGCCGCG
GGCCGAGGAT CCCGTGCTGG TGCAGGGGCA GGGGCGCTAC ACGGACGACC TCGCCCTTGA
 
Protein sequence
MSADPVPRAP ADAPSSPARP RLAVIVPVFK HSGLVREAVA SLTRQSRFAD VDVVLVDDGC 
PDPQTFTTLT AFCARWPNIH CVRQANGGLS AARNRGIAFA LRRLPAAEGV YFLDADNLLA
PYGIAAMQEA LLLHPEADWF YPDIRMFGVR AFHDYSGDFN AYAASVVNLC EAGSLIRRRM
IEAGLRFDEG MRLGYEDWDF WLSAVGRGFR GRHLPNLGLS YRKRAESMLA DSTRSDTQIR
EYLRRKHEAL FQVNAFVARE HDELPRFAVR LSDEASVEMA SDLQRGGPRM DQQAFETRIW
QAIREPAFVW AGQYLLSTTG RTLALLRGAG LTRWLCLEIE RALAGHNFVS LTLAASDDDT
IRVRRGHGFS PHCHLLAVSQ KLLQAIALDE KDAWITELPI HGANYRVATL EIRLPPRAMA
AETLKDVAVT DFVQFCLHLR CHGLRGRPSN LVEEVFLGSR PLDAMAPRAR AQFDGALLPP
VAEAREGRVA CVLPHCDFGG VEKVTFCLAR ELRRQGLRTS LILLGSDVAY RAHRALEAFD
DIYLVDAGGR IAAWAGDSFL GTQLPRILDE AWARDFANVL TTFELVVSCH SAEIMGLFSG
LRRRGVTTAT YLHLFDKSRI GAACGHPMLA LAYEHAIDLV LTCSEGMACE MASLGIPRDK
ILALPNAPSL EPDPHRAVAP RAPAGRPLRL LYLGRLDTQK GLDRLAEIID ALDPDPLFEI
RVVGKAVLTD AHLTLSRHAH LVEPPVYDDA GLSEIYRWAD ILLPSRYEGL PLTVLEAMVH
GVVPIVAACG AVAEAVESGV SGVVVPQERC VPGFLDHLRA LAAAPERLEA MSRAAMARAA
GRPWSVLAER LRARLGAVRA ARARERAGAG RAGAGAGLIG ARGGRAVPPR RGRPAPGDLA
GSRRIASWEA TLRRVRHECP SRRARLCGPP AVRHRSAGAA GRGSRAGAGA GALHGRPRP