Gene M446_3122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3122 
Symbol 
ID6132556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3457049 
End bp3458836 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content61% 
IMG OID641643312 
Productcapsular polysaccharide biosynthesis protein-like protein 
Protein accessionYP_001769965 
Protein GI170741310 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4421] Capsular polysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.043835 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGATG TGAACCTTGC CGCGTCTCGA CGCGAGTCGC TCGATCAGAT CGGACTTCGT 
CTCGGCACTC TTCGGGCGTC CAACGTTCAC GACTATCTGC GGCGCTACGA AGCATTATTG
CGCCCGAAAC TTTCGCGCCC GATCAAGATT CTCGATCTCT CGTTGTTGGA GATCACCGGC
GCGCGCGCGC TGGCGGAGTT CATCGAAACA GGATTGATCG TCGTCAGCGT CGGTCCGGAG
CGCGAGATCC CGGACCTCGA GCTCGCCGAT CATCCGCGCC TCCTCCTGAC GCGCGGCGAT
TGCCGCGATC CGGCTTACCT CTGCAGCCTT CACGCGTACG GCCCGTTCGA CCTCATTCTG
GAGAACGACC GGCACGTGGT CGAGGATCAG CTGATCGCAC TGGAATACCT GTTCCCCGCG
TTGGCGCCGG GCGGATCCTT CGTCTTCGAG AGCGCGTTCG CCTCGACAGC GGAAACGCCG
CGCCTGAGCG AGGCGGGGAT CACGGGCATC ATCGACGTCG CGCGAGACCT CGGGACCAGC
TTGACGGCCA GAGAGCCGAG GATCCGGCAG TTCGACGACG AGATCGTCAC GAAGGCGCTC
GATGCCGTGA TCTTCGAGAG GTCGAACATC ACCCTGAGGC GCACCGACAA GCCGAAGAAC
GCGCCGGTGA TCCTGCACGC CAAACCGTTC GCGGAGATCG GCGACGGCGT CGTGGAAGCG
CTCGAGAGCA AGCCGTACAC GCGCACGGAC CCGGTTGTCC AAACCCGGCT GGCTTGGATG
ACCGAGCGGC TGCTGGAGCG CGTCGGAAAG GTCGAGCATC CCCCGGCGGG CCAGATCGGC
ACCGTGAGCA ACGCCATCAT CTTCGGCGAA GGCATCATCG TCGATCGCGC CGGCCGCCTG
GTGGTCGAGA GCTTGATGAA CGAGCGGGAC GTTCCGCGTC TTCCCTACAT CAAGAAGCTG
CACGGCGACC GCTACGCGAT GCTGGACCAC GATCAGGTCG AGCATCTGAG CGGCGAGAAC
ATCGTTGCCG TCAAGCAGCG CTGGGATACC AATTACGGCC ATTGGCTTGT CGAAACCCTG
CCCAGGGTCG GCCTGCTGGC GGAGCGCATG CCGCTGGACA GCTGCAAGCT GCTCATCACC
GCTTGGTCGG ACGCAATGGC GAGCGTGATG CGGCAGTCGC TCGTCCTATG TGGCGCGACG
AACGAAAACA TCCTCCAGGT GTCTGGCGCC CCCCTGTCGG TCGACAAACT GATCTATGTG
ACGCCGATCT CCAGCCATCC ATTTGTTTTC CACCCTTACT CGGCGAGATT TTTGCGGTCG
CTCGCCGTCA AGCACATGGA GCGCATCGGG GTGTCCGGCG CCCAGCCGAC CAAGGTGTAC
GTCTCTCGAA ACAAGGGAAA TTCGCGCCGC ATCGTCAACG AGGACGAGAT CGCTGCGATC
CTCACGTCGC GTGGATACAG GATCGTTTAT CCTGAGAATC ACACTTTCTA TGAGCAACTC
GAGATTTTCG CAAATTGTAC GCATATCGTG GGAAACCTCG GTGCCGCGTT GACGAACGTT
GCCTTCGCAC GCGACGGAGT CGGCCTTCTT GCGCTCGCCT CGGAGTTCAT GCCGGACGAT
TTCTTCTGGG ATCTCACCAG TCAGCGCGGG GGGAGGTATT TCTCCATTCA CGGCACAGCG
GTGGAGAAAC ACGATGAAGG TCATCGATCG GAAATGAATG CTTGGTTCGT GCTGGATATC
CCGAATTTTG TCGAGATGCT CGACCGGTTC GAAGCCGAAG TTGCTTGA
 
Protein sequence
MEDVNLAASR RESLDQIGLR LGTLRASNVH DYLRRYEALL RPKLSRPIKI LDLSLLEITG 
ARALAEFIET GLIVVSVGPE REIPDLELAD HPRLLLTRGD CRDPAYLCSL HAYGPFDLIL
ENDRHVVEDQ LIALEYLFPA LAPGGSFVFE SAFASTAETP RLSEAGITGI IDVARDLGTS
LTAREPRIRQ FDDEIVTKAL DAVIFERSNI TLRRTDKPKN APVILHAKPF AEIGDGVVEA
LESKPYTRTD PVVQTRLAWM TERLLERVGK VEHPPAGQIG TVSNAIIFGE GIIVDRAGRL
VVESLMNERD VPRLPYIKKL HGDRYAMLDH DQVEHLSGEN IVAVKQRWDT NYGHWLVETL
PRVGLLAERM PLDSCKLLIT AWSDAMASVM RQSLVLCGAT NENILQVSGA PLSVDKLIYV
TPISSHPFVF HPYSARFLRS LAVKHMERIG VSGAQPTKVY VSRNKGNSRR IVNEDEIAAI
LTSRGYRIVY PENHTFYEQL EIFANCTHIV GNLGAALTNV AFARDGVGLL ALASEFMPDD
FFWDLTSQRG GRYFSIHGTA VEKHDEGHRS EMNAWFVLDI PNFVEMLDRF EAEVA