Gene Mchl_4247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_4247 
Symbol 
ID7117481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp4472138 
End bp4473457 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content51% 
IMG OID643526945 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_002422951 
Protein GI218532135 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.602565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0655667 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGATC TTATTATAGC CACTATGTGG CGGAAAAAAC TTAGCGTAGG GCTTGCCATA 
CTCTTCCCAT TATGTGCGGC GGCGGCAGTC CAGCTATACT ATCCGAGGAC ATACGAAGCA
ACGGCATTGC TTCTACTCGC TGACCAACGC CTGGGAAACC AAGCTGCAAA TGAAGGTGCT
GAAGCTTCAG TTCAAAGAAG AGAGCAAATC ATCAACTCAC AGGTTTACAT TGCGGAAAGC
TTTGGCGTTT TGCATCGTGC CATTCAGAAA TACGGAACAA TGCGTCTCCT AGAGTCATGG
GGACAACATG CGGAGCAGGG CGCGACCGGG TGCATGTTCC CCGGCTGTTG GGCGAGCATT
GCCAGTCCGA TTGAGGAAAA AAGTTCTGAG GACAGCATGT CCGACGTAGA ACGGAAGCTG
TACCTGCGAG TTCGGGGTGC ACTCAAAATT CAAGTCGAGC GTGGCACTGA CCTCATGCGC
TTGTCCTTCA GGCACAGGTC GCCGCAGGTT GCTGCAGATT TTCTGAACGA TATTGTTATC
ACTTTTGTAG AGAGGAATGT GGAGCTGTAC GGCAACTCTG GCGTAGCGTC CTTTTTCGAA
GATGAAAGAG CCCGCTATGA GCAAGCGTTG CGCAATGCGC AGAAAGATCT CGATGAATTT
GCTCGCCAGC ACAAAGTTTA CTCTGTCAGG GAGCAGCGCA ATTTGGCCCT CCATAACCGT
AGCACACTCC TGGCTGAGAT AGCAAAAACT ACTGCATCAG TAGCGGAAAA GGAGCTTGTC
GCGAAATCAT TTGCTCGGCA GTTGCTTGAG TTTAAGCCGC TAAGTAGTAA TGCTTCGCTT
CAACGGCTCG CTAGATCAGC GGGCGGCGCT GCGAACGGGC TTCGAGGTCT TTCCTCTCAA
CAGTTCGTTC AAGAGCCTGC GGATCCTCCG TTGCTGCTCG TGCGGGTATT CCAGGATACT
GTGCAAAGCT TGGTAAAGAC AAATGCGGAA ATAGATGGCA GCCGTGCTTT AGGTCTTCAG
CAGAGCGACG AGTTGAAAAA AATCGAACGA GAACTTAGCG GTTTGTCGAT GATTGAGCCT
GAGTTCGACA GGCTTGAGCG CGAGATCGCC GAGAAGCAGC GAAATTTAGA TCTATATTCG
AGGCGTGCAA CAGAGCAGAG GCTCGAGGCT GATTTTAGGG AAAGACGCTT CTTCAACATG
CGGGTCGCGC AGGCTGCGGT GGTACCTCTC AACCCCGTCT TTCCTTTGCC ACACCTCGTC
TTCCCGGCTG CACTTTTGGT GTCTCTTACT TTAGTGGCAG CAATCGTCTT TTTTGGCTGA
 
Protein sequence
MNDLIIATMW RKKLSVGLAI LFPLCAAAAV QLYYPRTYEA TALLLLADQR LGNQAANEGA 
EASVQRREQI INSQVYIAES FGVLHRAIQK YGTMRLLESW GQHAEQGATG CMFPGCWASI
ASPIEEKSSE DSMSDVERKL YLRVRGALKI QVERGTDLMR LSFRHRSPQV AADFLNDIVI
TFVERNVELY GNSGVASFFE DERARYEQAL RNAQKDLDEF ARQHKVYSVR EQRNLALHNR
STLLAEIAKT TASVAEKELV AKSFARQLLE FKPLSSNASL QRLARSAGGA ANGLRGLSSQ
QFVQEPADPP LLLVRVFQDT VQSLVKTNAE IDGSRALGLQ QSDELKKIER ELSGLSMIEP
EFDRLEREIA EKQRNLDLYS RRATEQRLEA DFRERRFFNM RVAQAAVVPL NPVFPLPHLV
FPAALLVSLT LVAAIVFFG