Gene Mchl_5050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_5050 
Symbol 
ID7118924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp5402049 
End bp5404172 
Gene Length2124 bp 
Protein Length707 aa 
Translation table11 
GC content73% 
IMG OID643527744 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_002423743 
Protein GI218532927 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.376238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.397595 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGA TCGAGCGGAT GCCTTCGCGG TTCTTCGTCG GCGCCGAGCC GGGCAAGCCG 
GACGTGACGC CTGAACCCTG GTTCCTCGAC CCGCGTGAGA TCGGACGGGC CCTGCGCGCG
CGCTGGGCGC TCGTGCTGGC CCCGGCTGTG CTCCTCTTGG TGGCGGCCGT GGCGTGGCTC
GCGCTGGTGC CGCCGCTCTA CGCCGCCGTG ACGCAGATCC TGATCGACCC GCGCGGCATC
CAAGTGGTCA AGGACGGCGT GACGCCCTCG GACCAGGCGA GCGACGCGAG CCTGTTCCTC
GTCGATAGCC AGATCCGGGT CCTCATCTCC GACGAGGTGC TGCGGCAGGT CGTGACTCGG
TTCAAGCTCG ATCAGGACCC GGACTTCGTT CGTCCCGCCT CGCCGCTCGA GACGCTCAAG
AGCCGCCTCT CCTCGCTGAT CGTCACCGCC GGCGGCCCTG CCGACGACAC GCTCACCGCC
CTGCGCACGC TGCGCGAGCG CACCACCGCG CGCCGCCTGG AGCGCAGCTT CGTGGTCGAA
CTCGCCGTCT CCAGCGAGGA ACGCCGGAAA TCCGCCGAGC TCGCCCAGGC CATCGCCGAA
ACCTACCTCA CCACCGTCTC GCAGGCGCAG GCGCAGGTCA CCCGCAAGGC CGGCGAGGCG
GTGTCGAGCC GGCTCGGCGA GTTGCAGGAC GACCTCCGGC AGGCCGAGGA CAAGGCGCAG
AAGTTCCGCG CCGCCAACAA CCTCGTCGGC ACCCGCGGCC AGCTCGTCAG CGAGCAGGCG
CTGACCCAGC TCAACCAGCA GCTCGGCGCG GCGCGTGCCC GGGCCGGCGA GCTGCGCGGG
CGGCTCGCCC AAATCGAGGC GGTCGCCAAC GGGCGGGCCG ACCTCAACTC GGTGACCGAA
ATCGTCCAGT CCACGACGGT CGCGCAATTG CGCGCCCAGC TCGCCCAGAT CGAGGCAGCC
AGGGCCGACA CCCTGTCCAA CCTCGGGCCC CGTCACCCCA CCCTGCGCAC CGGCGAGTTG
CAGGTGCAGA CCCTGCGCAA CGACATCAAT GCCGAGATCC GCCGCATCGC CGCGGCCACC
CGCAACGATT ACCGGTCGGC ATTGTCCAAC GAGGCCTCGC TCGCCGCCAC CCTGGAGAGC
CGCAAGAAGG AGGCTCTGTC CGTCGACAAG AGCTTCGTGC GCCTGCGCGA ACTGGAGCGG
CAGGTCGAAG CGAGCCGTGC GGTCTACGAG GCCTTCCTCG TCCGCGCCCG CGAGCTTCAG
GAGCAGCAGC GCCTCGACAC CTCGACCTCA CGCGTCATCT CGCCCGCCTC ACTGCCGGAG
CGCCGGCTCG GCCCGCCGAT CCCGGCCATC TTCGCCGCGG CGCTGGCGGC CGGGCTCGGC
CTCGGCACCG CGCTCGCCCT CCTCGCCGTG CCGGCCGCGG GGCGGATCGG TTCGCGCCGC
CGGTTTCAGC AGCTCGCGGG GCTCCCCGTG GTCGCCGCCC TGCCGGCCAA GGTGCCGACC
AGGACGCGGA GCAAGGCAGG CAGCGAATCC CTGCGCGCCG ACACCGCCTA CGACGTGGCC
GTGGCCCGTC TCGGCAGCCG TCTGCAGCGC GATTTCGGGG CCACGCGGCC GACGGTGGTC
CTCGTCACCT CGGCGGACGA CCGGAGCGGC AAGTCGGAGC TGGCGCGCAG CCTCGCCGCC
TCGGCCGCGC TCGACGGCCA GCGGGTGCTG CTCGTCGATG CCGACCCGGA GGCGATGATC
TCGCGCGATC TCCGGAGCCA GGCCAAGCGC GGCGCCGCCG AGGTGCTGCG GACGCATTCG
GGGCTCGGTG ACGCGCTGGT CGAGGGGCCG ACCGGGGTCA AGATCCTGCC CTTCGACGAC
GCGGCCCTGC GCCTCGGCAC CGCGGCCTAT ACCGGGGCGA TCCTGACGGC GGCTTCTGCC
TTCGACACGG TCTTCGTCGA TATCGGGCTG ATCGGCACCG ACATCGCCGC CGAGCGCCTC
GCCCAGGACC AGCGCTTCCC GGCCCTGCTG CTGACGGCCA GCGCCGCCCG CAGCGGCACC
GCCCGGCTGC GGCGGGCGCT CGACGCCCTC GGCCGCGACC CGCGGGTGCA GCTCGTCATG
ACCGACGCCG AGGCCGAGGG GTGA
 
Protein sequence
MTMIERMPSR FFVGAEPGKP DVTPEPWFLD PREIGRALRA RWALVLAPAV LLLVAAVAWL 
ALVPPLYAAV TQILIDPRGI QVVKDGVTPS DQASDASLFL VDSQIRVLIS DEVLRQVVTR
FKLDQDPDFV RPASPLETLK SRLSSLIVTA GGPADDTLTA LRTLRERTTA RRLERSFVVE
LAVSSEERRK SAELAQAIAE TYLTTVSQAQ AQVTRKAGEA VSSRLGELQD DLRQAEDKAQ
KFRAANNLVG TRGQLVSEQA LTQLNQQLGA ARARAGELRG RLAQIEAVAN GRADLNSVTE
IVQSTTVAQL RAQLAQIEAA RADTLSNLGP RHPTLRTGEL QVQTLRNDIN AEIRRIAAAT
RNDYRSALSN EASLAATLES RKKEALSVDK SFVRLRELER QVEASRAVYE AFLVRARELQ
EQQRLDTSTS RVISPASLPE RRLGPPIPAI FAAALAAGLG LGTALALLAV PAAGRIGSRR
RFQQLAGLPV VAALPAKVPT RTRSKAGSES LRADTAYDVA VARLGSRLQR DFGATRPTVV
LVTSADDRSG KSELARSLAA SAALDGQRVL LVDADPEAMI SRDLRSQAKR GAAEVLRTHS
GLGDALVEGP TGVKILPFDD AALRLGTAAY TGAILTAASA FDTVFVDIGL IGTDIAAERL
AQDQRFPALL LTASAARSGT ARLRRALDAL GRDPRVQLVM TDAEAEG