Gene M446_3495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3495 
Symbol 
ID6129104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3899374 
End bp3901359 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content73% 
IMG OID641643666 
Productglycosyl transferase family protein 
Protein accessionYP_001770314 
Protein GI170741659 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.204493 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.401757 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCG ACCGGGGCAT CTGCCTCTGC ATGATCGTGA AGGACGAGGC GCCCGTGATC 
CGGCGCTGCC TCGACTCCGT GCGGCCCCTG ATCGACCACT GGGTGGTCGT CGATACGGGC
TCGACCGACG GGACGCAGGA GATCGTGCGC GACGCCCTCG CCGGGCTGCC CGGCGCGCTG
GTGGAGCGGC CCTGGCGCGA CTTCGCCCAC AACCGCAGCG AGGCGCTGGA CCTCGCCCGG
CCGCGCGGCG CCTATTCGCT GATCATCGAC GCCGACGACA CGCTGGAGAT CCCGGACGGC
TTCGTCCTGC CGCCCCTCGA CGCGGATTCC TACACGCTCG ACATCCGGTT CGGGGCGATC
GCCTACCGGC GCCCGCAGCT CGTGCGCAAC GCCCTCCCGT GGCGCTACCG GGGCGTGCTG
CACGAGTTCC TGGCCTGCGA GGAGGCGCGC AGTTCCGGGC ACCTGCCCCT GACGATCCGC
GTGAGCGAGG ACGGGCGGCG GCGGCGCGAT CCGGCGACCT ATCGGCGCGA CGCGGCGGTG
CTGGAGCAGG CGCTCGCCGC CGAGACCGAC CCCTTCCTGG TCGCGCGCTA CACCTTCTAC
CTCGCGCAGA CCTACCGCGA TTGCGGCGCC GTCCAGCAGG CGGTCGAGGC CTATCTCCGG
CGGGCGACCC TGGGCTTCTG GGAGGAGGAG GTCTTCGTCA GCCTCTACCA GGCCGGCCGC
CTGATGGAGG CCCTGCGGGC GGATCCGGAC GAGATCCTGG CGGTCTACCG CCGGGCCGGC
GAGGTCCGGC CGGGCCGCGT CGAGGCGGCC CACGCGGCGA GCCGCTTCTG CCGCGGCATC
GGGCGCAACC GCCAGGGCCA CGCCATCGCC AAGGAGGCCC TGGGCCGGCC CGCCCCGGCC
GACGGGCTCT TCGTCGAGCG CTGGATCTAC GCCTACGGCC TCGCCGACGA GTACGCGATC
AACGCCTACT GGGCGGGCGC GATGCACGAT TGCATCGACG GCTGCCTGCG CGCCCTGCAC
AGCGAGATGC TGCCTGGCCC CGACCGCGAG CGGATCCTGC GCAACCTGCG CTTCGCCGCC
TCGGCCCTGG AGCAGGCCCC GCCGCGGCAG CCCTCCCCGC CCCTCGATCC GCCGGCGGCG
CTGCGGCCGC TGCGAAGCCG CCTGCCCGAG CCGGCCCCGC GCGTGCTCCT CGCCGTCCTG
GCCAAGCAGA AGGAGCCCGT GCTCGACCTC TACCTCGACT GCATCGAGGC CCTCGACTAC
CCGAAATCAT CGATCGTCCT GTGCGTGCGC ACCAACAACA ACACGGACCG GACGGGCGGG
ATGCTGCGCG CCTGGCTCGA CCGGGTCGGG GGGCTCTACG CCGGGATCGT CTTCGACGAC
GCGGACGTGC CCGAGCCGGT GCAGGACCTC GCGGTGCACG AGTGGACGCC CCAGCGCTTC
GCGGTCTTGG GCGCGATCCG CCAGCGCAGC CTCGCCCTCA CCCTGGCGCG GGACTGCGCC
TTTTACTTCG TGGCGGATGC CGACAACTTC CTGATCCCGT CGACCCTGCG CGATCTCGTG
AGCCTGAACC TGCCGATCGT GGCGCCGATG CTGCGGGAGG TGAAGCCCGG CTCGCGCTAC
GCGAATTTCC ACGCGGCGGT GGATGCGCAG GGCTACTTCG CGGAATCGCG GGACTACGAC
GCGCTGCTCG AGCGGCGGAT CCTCGGCGTG GTCGAGGTGC CGGTGGTCCA CTGCACCTAC
CTCGTGCGGG CGGACGCGAT CCCGCTCCTG CGCTACGAGG ACGGCAGCGG GCGGCACGAA
TACGTGGTGT TCTCCGACCA TGCGCGGCGC CGGGGCATCC CGCAATACCT GGACAATCGG
CGCTGCTACG GCTGCCTGAC CCTGGAGGAC GACGACCCGG AGGCGCTCGC CGCGCGGCTC
CCGGCGATCC TGGCCTTCCT GCGCGAGGTG CGGGCGGGTG ATCGGCCGGA GCCCGCGCGG
GCCTGA
 
Protein sequence
MTADRGICLC MIVKDEAPVI RRCLDSVRPL IDHWVVVDTG STDGTQEIVR DALAGLPGAL 
VERPWRDFAH NRSEALDLAR PRGAYSLIID ADDTLEIPDG FVLPPLDADS YTLDIRFGAI
AYRRPQLVRN ALPWRYRGVL HEFLACEEAR SSGHLPLTIR VSEDGRRRRD PATYRRDAAV
LEQALAAETD PFLVARYTFY LAQTYRDCGA VQQAVEAYLR RATLGFWEEE VFVSLYQAGR
LMEALRADPD EILAVYRRAG EVRPGRVEAA HAASRFCRGI GRNRQGHAIA KEALGRPAPA
DGLFVERWIY AYGLADEYAI NAYWAGAMHD CIDGCLRALH SEMLPGPDRE RILRNLRFAA
SALEQAPPRQ PSPPLDPPAA LRPLRSRLPE PAPRVLLAVL AKQKEPVLDL YLDCIEALDY
PKSSIVLCVR TNNNTDRTGG MLRAWLDRVG GLYAGIVFDD ADVPEPVQDL AVHEWTPQRF
AVLGAIRQRS LALTLARDCA FYFVADADNF LIPSTLRDLV SLNLPIVAPM LREVKPGSRY
ANFHAAVDAQ GYFAESRDYD ALLERRILGV VEVPVVHCTY LVRADAIPLL RYEDGSGRHE
YVVFSDHARR RGIPQYLDNR RCYGCLTLED DDPEALAARL PAILAFLREV RAGDRPEPAR
A