Gene Mpop_0053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpop_0053 
Symbol 
ID6310233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium populi BJ001 
KingdomBacteria 
Replicon accessionNC_010725 
Strand
Start bp56876 
End bp59161 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content70% 
IMG OID642648766 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_001922784 
Protein GI188579339 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0431346 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAGG CCGGCGTCGC GAAGGACGGC ATCATCGGGC ATCCGCATGT CCGGATCGAG 
GGCCGTGCCA AGGTCACGGG AGCCGCGCTC TACGCATCCG ATCGCCGGGG TGAAGGCGGG
TCTGAGCCTG ATGCGGCCCA TGCCGCGCTC GTCACCAGCA CGATCGCACG CGGCCGGATC
CTCGGCTTCG ATCTCCCCAC GGCGGCGCAG GCGCCGGATC TCCTGGCGAT CTTCACGCAT
CGTGATTTCG GCGGCGAAAT CGCGTCGGTC CCGCACCTGA TGGCGGGCGG CTACGCCAAT
TCCTCGCATC TCCCACTCGG GTCTGACCGC GTCGCCTATG CGGGTCAGAT CGTGGCCCTC
GTCGTCGGGC GGACGCCGGA GGCGGCGGCT CAGGCGGCCG CACTGGTCCG CGTGCGCTAC
GCACCGGAGC CCGCTGCCGC GAGCTTCGAC GATCCCGGCG CCGAGACCGT GCGGCTCGCC
GATCTCAAGG CTCATCACGA GGATATCGCT GTCGGCGATA CCGAGATCGG CTTTAGGGAG
GCCCCGGCAC GGATCGAGGC GCGTTACACG ACGCCGATCC AGCACCACAA TCCGATCGAG
CTGTTCACCA CCCGGTGCGC CTGGGACGGC GGCCGCCTCA CCGTGCACGA GCCGAGCCGC
TACATCGGCG CGGTTCGTCA CGGGCTCGCA GCACAGCTCG GCCTTGACCC GGCCCAGATC
CGGGTCGTCT CCGGATTGAT CGGCGGGCAT TTCGGTTCGA AGTTCGCGCT CTCGCAGCAC
ACCGCCCCCG TGGCGCTCGC CGCCAAGCGT CTCGGCTGTC CGGTCTCGCT GGTGCCGACG
CGGCGGCAAT GCTTCACCAT CGCCAATTAC CGTCCCGAGA CGCGCCACGA CATCCGACTC
GCCGCCGACC GCGACGGACG GTTCACGAGC CTCGTGCACG AGGCTGAGGT TATCGCCTCA
CGCTTCGATC CCTTCGCGAT GGAGGGCGCG GACGTGACGG CAAGCCTTTA TGCTTGCCCG
ACGATCCGCA CGGAGGAGCG GGCGGTGCGG GTGGATCGCA ACACGCCCGG TCCGATGCGG
GCGCCGCCCG AGGTGCCGTA CCTGTTCGCG TTGGAGAGCG CGGTCGATGA GCTGGCGTGG
CGGCTCGGCC TCGATCCGAT CGAACTGCGC CGCCGCAACG ACACCGCCCG CGATCCGGTC
TCGGGCAAGC CGTTCTCCAG CCGGCCTCTG ATGGCCTGCT TCGATGCCGG GGCCGCTGCC
TTCGACTGGA AGCGACGTGC GCCGCAACCC GGGACGATGC GGGAGGGCGC ATGGCGCGTC
GGCCTCGGCT GCGCCGCATC GGTCCGGCCC GTGAAGATCG CCGCGGCGAC CCTGCGGCTG
CGGCTTTTCG CGGATGGTTC GGCGGAGATC GGATGCGCCC ATCACGAGAT CGGCAACGGC
ATCACCACCC TGTTGGCGAT GGGGGCGGCC GAACGGCTCG GCCTGCCGGT GGAGCGGGTG
CGGGTGCAAC TCGGCGACAC CGATCTGCCT GCGGCCGGCC TTTCGGGGGG ATCGAGTACG
ACGACGAGCC TGATGAACGC CCTCGCCCTC GCCTGCGAGC AGGTGCGGGC GCGGTTGGCG
CGAACGGCCG TCGCGCCGGA CCGTCCGTTC GCCGGGAGGG ATCCGGCCGG CTTTCGTCTG
GCCGACGGCC GGCTCACCGC ATCCGACGGG CGCGGGCTTT CCCTTGCCGA TGCGGTCCGC
TTCATCGACC CGATCCGCGT CGAGACGCTC GCCGAGTTCG TCCCGCCGGG CAGCCCGCCG
GACGCGCTGG AAACCCTGCG CGCCGGTCGT ATCGGCCTGA CGACCGGGGG CGATGCGCTG
AAATGGGCGT TCGGGGCGCA GTTCGCCGAG GTGCATATCC ATGCCGAGAC CGGTGAGATC
CGGGTGCCGC GGCTCACAGG CGCCTTCGCG GCCGGGCGCA TCCTCAATCC GCTGACCGCG
CGCAGCCAGC TTGTCGGCGG CATGATCTGG GGCCTGGGTT CGGCCCTGCT GGAGGAAACC
GTCCTCGACG GCCCCGCCTA CCGCAATCCC GACCTCGCCG AGTATCTCGT GCCGACCGCC
ATGGATGTCG GGGCGATCGA GGCCATCCTC GTGCCCGATC CCGACGAGAC CGTGAATCCG
CTCGGGATCA AGGGGCTGGG CGAACTCGGC ATCATCGGCG TCAACGCGGC GATCGCGAAT
GCGGTCTACC ATGCCGTTGG CCGGCGCATC CGCGAGCTTC CGATCCGCAT CGACGATATG
ATTTAG
 
Protein sequence
MDQAGVAKDG IIGHPHVRIE GRAKVTGAAL YASDRRGEGG SEPDAAHAAL VTSTIARGRI 
LGFDLPTAAQ APDLLAIFTH RDFGGEIASV PHLMAGGYAN SSHLPLGSDR VAYAGQIVAL
VVGRTPEAAA QAAALVRVRY APEPAAASFD DPGAETVRLA DLKAHHEDIA VGDTEIGFRE
APARIEARYT TPIQHHNPIE LFTTRCAWDG GRLTVHEPSR YIGAVRHGLA AQLGLDPAQI
RVVSGLIGGH FGSKFALSQH TAPVALAAKR LGCPVSLVPT RRQCFTIANY RPETRHDIRL
AADRDGRFTS LVHEAEVIAS RFDPFAMEGA DVTASLYACP TIRTEERAVR VDRNTPGPMR
APPEVPYLFA LESAVDELAW RLGLDPIELR RRNDTARDPV SGKPFSSRPL MACFDAGAAA
FDWKRRAPQP GTMREGAWRV GLGCAASVRP VKIAAATLRL RLFADGSAEI GCAHHEIGNG
ITTLLAMGAA ERLGLPVERV RVQLGDTDLP AAGLSGGSST TTSLMNALAL ACEQVRARLA
RTAVAPDRPF AGRDPAGFRL ADGRLTASDG RGLSLADAVR FIDPIRVETL AEFVPPGSPP
DALETLRAGR IGLTTGGDAL KWAFGAQFAE VHIHAETGEI RVPRLTGAFA AGRILNPLTA
RSQLVGGMIW GLGSALLEET VLDGPAYRNP DLAEYLVPTA MDVGAIEAIL VPDPDETVNP
LGIKGLGELG IIGVNAAIAN AVYHAVGRRI RELPIRIDDM I