Gene M446_5190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5190 
Symbol 
ID6130928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5714193 
End bp5716208 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content68% 
IMG OID641645325 
Productglycosyl transferase group 1 
Protein accessionYP_001771949 
Protein GI170743294 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00612346 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0328949 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCT CCCCCATCCT CCAGGTCGCC AATCCGGAGC GCGACCCGGA AGGTCTCATC 
ACGAAGTACG TTCAGTTCGA GAGCTTCCGC CTCGTGCGGG ACCACCACCT CGACGACATG
ACCGGCTCGC TGCAGGATCG GCTCAAGGTC CTGATGTGGT ACCTGACGGA CTATCAGGCG
CACCGCAAGG CGCAGGGCAC GCGCTTCGAG ATGCCGCTGT CGCGCGCGCA GGTGGCGTTC
CTGAACCGGC CGATGCCGCT CGCCGGCCTG TCGCCGGCCG TGACCGTCGC CCTCTACAAT
TCCGTCGTGC GGGAACTGCC GAGCCACCTC AACCTGGCCG ATGTGCGCGT GCTGCGCGAA
GCCGTCTACT GGTGGTGCTT CGAGCGGACG ATCGACGCCA AGCTGCATCG CGCGCTCGTG
ACGCGGGAGC AGATCGCGGT CCTGCAGGCG CCGTCGAGCG CGCCGTACGC CGACTTCCCC
TTCAACATCT TCATGGAGAT CCAGTTCGAG CGCGACCGGG CCAAGCTCGA CCTGACCAGG
GACAAGGCCT CGGACCGCGC CGCCTATCTC TGCTACCTGA TCCTGTCGAG CTACACGCGG
CCCTACCTCA GGAGCTTCCT GCCGGGCAGT CAGGTCCGGC AGCTGCTGCG GGCGAGGGCG
GAGGCCGCCT CGCTCTTCGA CGAGATCATC GCGGCCGTCG CCCTGCCGCC GGGCGCGCCG
CCCTCCCGCG TGGCGGCCCT GCGCGGGCAG GCGGAGGCCC TCGCGGCCAA GCTGGCGAGG
CAGGGGGCCG GCGAGAGCAC GGCCGAGGTT CGGGATTACG GCGAGTTCCT GCTGCCGAAG
CGGGATTTCC CGCGCTCCCA CCCCGAACCC GGCGTCGCCC TGATCGGTCC GCTCCTCCAG
ACGTCGGGCC TCGGCCAGGC GACGCGCATC TCCTACGAGA TCCTGACGGC CGCCGAGCGC
GTGCGTCCCA CCGCGCTGCC CTTCGGCCTC GACAACCCGG CGCCCATCGG CTTCGCCACC
GAGCTGAACT TCGAGACCTT CACCGCTCCG CGCGAGATCA ACCTCATCCA CCTCAACGCC
GAATCGATTC CCCTGGTCTA CGCGTTCGAG CAGCGCGAGA TCGTGGCCAA CAGTTACAAT
ATCGGATATT TCTTCTGGGA GCTGAATCAG ATCCCGAAGT GCCATAATCT GGCGCTCGAT
CTGCTCGATG AGATCTGGGT CTCTTCGGAG TACAACCGCG AGATCTACGC CCGGTTCACC
GACAAGCCGG TGGTGAACGT GGGCATGGCG GTCGAGCCGC TGCCCGAGGT GGAGGCGATG
GACCTCGGCA GCCTCGGGCT GGAGCGCGAC GCGACAATCT TCCTCACCAC CTTCGACTCC
TTCTCCTTCA TCGAGCGCAA GAACCCGCTG GCCGCCGTCG AGGCGTTCCG GCAGGCCTTT
CCGCTCGGCA CCGAGGCGGT CGCCCTCGTC ATCAAGACCC AGAACAGGAC CCGGGTCGGC
GATCCGCACC AAGTGGCGAT CTGGAGGAAG ATCGACGACG CCTGCAGGGC GGACCCGCGG
ATCCTGATCG TCGACGAGAC GCTCAAGTAC AGGGACCTCC TCGCCCTCAA GAAGGCCTGC
GACTGCTACG TCTCGCTGCA TCGGTCCGAG GGCTGGGGCT TCGGCATGAT CGAGGCCATG
CAGCTCGAGC GGCCGGTGAT CGCGACGGCC TATGGCGGCA ACATGGATTT CTGCAGCGAG
GAGAGCGCCT ACCTGATCGG GTACGACCTC GTCGGGGTGC AGAGGGACGA GTACATCTTC
GTCGAGCGCG GCAGCGTCTG GGCCGATGCC GACCTCCGGC AGGCCGCCGC CGCCATGCGC
CACGTCGCGA CCGACCAGGC CGCGGCGCGC GCCAAGGGCG TGAGCGCGGC CCGTCTCGTC
AAGGCGCGTT TCAGCATCCC GGCGATCGCG AAGCGCTACG GCGCGCGGCT GGAGGAGATC
CGCGCCGCGC CCGCTCGCCG CCTGGCCGCC TCGTGA
 
Protein sequence
MSSSPILQVA NPERDPEGLI TKYVQFESFR LVRDHHLDDM TGSLQDRLKV LMWYLTDYQA 
HRKAQGTRFE MPLSRAQVAF LNRPMPLAGL SPAVTVALYN SVVRELPSHL NLADVRVLRE
AVYWWCFERT IDAKLHRALV TREQIAVLQA PSSAPYADFP FNIFMEIQFE RDRAKLDLTR
DKASDRAAYL CYLILSSYTR PYLRSFLPGS QVRQLLRARA EAASLFDEII AAVALPPGAP
PSRVAALRGQ AEALAAKLAR QGAGESTAEV RDYGEFLLPK RDFPRSHPEP GVALIGPLLQ
TSGLGQATRI SYEILTAAER VRPTALPFGL DNPAPIGFAT ELNFETFTAP REINLIHLNA
ESIPLVYAFE QREIVANSYN IGYFFWELNQ IPKCHNLALD LLDEIWVSSE YNREIYARFT
DKPVVNVGMA VEPLPEVEAM DLGSLGLERD ATIFLTTFDS FSFIERKNPL AAVEAFRQAF
PLGTEAVALV IKTQNRTRVG DPHQVAIWRK IDDACRADPR ILIVDETLKY RDLLALKKAC
DCYVSLHRSE GWGFGMIEAM QLERPVIATA YGGNMDFCSE ESAYLIGYDL VGVQRDEYIF
VERGSVWADA DLRQAAAAMR HVATDQAAAR AKGVSAARLV KARFSIPAIA KRYGARLEEI
RAAPARRLAA S