Gene Mchl_5044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_5044 
Symbol 
ID7113647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp5394282 
End bp5395634 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content71% 
IMG OID643527738 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_002423737 
Protein GI218532921 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.740829 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.449869 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAG CCCACACCGC GCCCGTCCCG CCGGCAGGCC TGCTCCGGCG CGCCGCCGGT 
CTCCTCCGGC GCCCGCCGGC CGCGCTCGCC GCCCTGGCCG ACCAGGGCGT CGTCAGCGGC
TTCGGCTTCC TCAGCGGCAT CGTCGCCGCC CGTCTGCTCG GCATCGCCGA GTTCGGGCAT
TTCGCGATGA TTCTGATCGT GCTTACCTTC GCGCAGGCCC TGCACAACGC CCTCATCACC
GCGCCGATGA TGACGCTGGT CGGCGCCCGC AGCGGCGTCT CGAAGGCCTA TGCCGCGACC
ATCCTCACGG GCGCCTTCCT GCTCTGCGTG CCCGGCGCGG TCTTCGTCGT GATCGCCCTC
CTCATCGGCG GGATGTCGGG CGAGACGCTT GTGGCGGCCT GCGCCCTGAT GCTGGCGCAG
AACCTCCAGT TCACCCTGCG CCGCCTCCTG TTTGCGAAAG GTCGGGGCGT GCAGGCCCTG
CTCATGGATT TCGCCCGCGC CGCGAGCTTC CCCTTCATCG CCCTGCTGAT CTGGCTTGAG
CACGACGTCA TCGGCAGCAA TGGCTTCGTC TGGCTGCTCG CCGCGACCTC TTTCGCGACC
TGCCTGCCCT TCATCGTCGC GTTCGGCCGG CCGATCCTGC GCCGGCCCGG CTGCGTGCAG
ACCGGCGCGG TCTTTCGGCG CCACATCCCG CTCGCGCGCT GGCTCCTGCC GATCGTCTTC
GTCACCTTCG TCCAGGAGCA GCTCATCTGG CTGGTGGCGG GCGCGACGCT GGGCCTCGAG
GAACTCGGCG GCCTGCGGGC GGCGCAGTAC CTCGTCGGGA CCGTGCTGCT GCTGCTCGCC
GCCACGGAGA ACGTCCTGCC GGTGGCCGCC GCGCGCGCGC ATTCCGAGGG CGGAGAGGCG
GCCCTGCGCC GCTACCTCAT GCGCACGGGC ATCAAGCTCG GGGTGCCGAT CATCGCGATC
CTCGCGGTCC TCGCCATTCC GGGCGCGATG TGGCTGCGCC TGATCTTCGG GGCGGAATAT
GCGGCCTATG CCAACTGCCT GCACATCCTC TCGGTCAGCG TCGTGATCGT GCTGGCCCGC
GACCTCACCA CGAACTACTT CCGCGCCAAG CAGAACACCC GGGTGCTGTT TGCCTCGCTC
TGCGTGAGCA TGGTCGTGTC GCTCGCCGTA GTGGTCCCGC TGATGCAGGC CGGCGGCGTC
AGCGGCGCCG CGGCGGCGGT GGGGGCAGGC CACCTCGCCT CCCTCATCTA CCTCGTGCTG
GCCGCACGGC GGCAATCGCG CCCGGCCTCG GCCTGGCCGA TGCCGGGCCG GTGGCGGCGC
TCGCTCAGGC CGGCCAAGTC GGCGCAGACC TGA
 
Protein sequence
MPEAHTAPVP PAGLLRRAAG LLRRPPAALA ALADQGVVSG FGFLSGIVAA RLLGIAEFGH 
FAMILIVLTF AQALHNALIT APMMTLVGAR SGVSKAYAAT ILTGAFLLCV PGAVFVVIAL
LIGGMSGETL VAACALMLAQ NLQFTLRRLL FAKGRGVQAL LMDFARAASF PFIALLIWLE
HDVIGSNGFV WLLAATSFAT CLPFIVAFGR PILRRPGCVQ TGAVFRRHIP LARWLLPIVF
VTFVQEQLIW LVAGATLGLE ELGGLRAAQY LVGTVLLLLA ATENVLPVAA ARAHSEGGEA
ALRRYLMRTG IKLGVPIIAI LAVLAIPGAM WLRLIFGAEY AAYANCLHIL SVSVVIVLAR
DLTTNYFRAK QNTRVLFASL CVSMVVSLAV VVPLMQAGGV SGAAAAVGAG HLASLIYLVL
AARRQSRPAS AWPMPGRWRR SLRPAKSAQT