Gene Mext_1728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1728 
Symbol 
ID5832501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1949559 
End bp1952339 
Gene Length2781 bp 
Protein Length926 aa 
Translation table11 
GC content72% 
IMG OID641367527 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_001639198 
Protein GI163851155 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs
[COG2080] Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCTCA CCGTCAACGG GCAGGTTCAG GAGGCTGCAG CCCGGCCCGG CCAGTGCCTG 
CGCACGCTGC TGCGGGATCT CGGCTGGTTC GGGGTCAAGA AGGGCTGCGA TGCGGGCGAT
TGCGGCGCCT GCACCGTCCA TCTCGACGGC GAGCCGGTCC ATTCCTGCCT GCTGCCCGCC
TTCCAGGCGG AGGGGCGCGG CGTCACCACG ATCGAGGGGC TCGCCGGCCC CTGCGCGCCG
GGCGACGGGC CACCTGAACA TCTCCACCCG ATGCAGGACG CGTTCTGCGC CGCGCAGGGT
TTCCAGTGCG GCTTCTGCAC GCCAGGCATG ATCATGACGG CGGCCGCCCT CGACCAGGGC
CAGCGGCAGG ATCTCGGCAC GGCGCTCAAG GGCAATCTCT GCCGCTGCAC CGGCTACCGG
GCGATCCGCG ACGCCGTGGC CGGGATCGCC CATGCCGACG CTTCGCAGAC GGCGGAGGGC
AACCCGGTCG GCCGCAGCCT ACCGGCCCCG GCGAGCCGCG CCCTCGTCTC CGGCCGGGCG
GCCTACACCT TCGACACCGC CGTGCCGGGT CTGCTGCACC TGAAGGTGCT GCGTGCGCCT
CACGCCCATG CCCGGATCCG CAGCATCGAC CGGGCGGCGG CGCTGGCCAT GCCCGGCGTG
GTCGCGGTGC TCACCCACGA GGACGCGCCG CGCCGCCGCT TCTCGACCGG GCGGCACGAG
AACCCTTTCG ATGACGCCGC CGACACCGGC GTGCTCGATT CCGTGATCCG CTTCCACGGC
CAGCGCGTCG CGGCGGTGGT CGCCGAGAGC GCCGCGGCGG CGACCATGGG CGTGCAAGCG
CTGCGGGTGG AATACGACGT GCGGCCCGCC GTGTTCGATC CGGAGGCGGC CCTGCACCCG
GACGCGCCCC TCGTCCACGA TCCGGCCGAC CGCGACCCGC CGCGGCCCGG CGACGATGCG
CCACCGCTCC TCGCGCACCC GAACCTCGCC GCCGAGGCGC ACGGGGCGAT CGGCAATGTC
GAGGCGGGCT TCGGCCAGGC CGACCGGATC CACGAGGCGG AATACGTCTC GCAGCGGGTG
CAGCACGTGC ATCTCGAAAC GCACGGCGCG CTCGGCTGGC TCGATGCGGA GGGCCGGCTG
ACCCTACGCT CCTCGACGCA GGTGCCCTTC CTGACCCGCG ACGCCCTGTG CCGCCTGTTC
GATCTCGACC GGGACCGGGT GCGCGTGCTC TGCGGCCGGG TCGGCGGCGG CTTCGGCGGC
AAGCAGGAGA TGCTGACCGA GGATCTCGTG GCGCTCGCCG TGCTGCGGAC CGGCCGCCCG
GTCTCCTACG AGATGACCCG CGAGGAGAAC TTTTGCGCGG CGACGACGCG CCATCCGATG
CGGGTGCGGG TGAAGATCGG CGCGCGGGCC GACGGGCGCC TCACCGCGCT TTCGCTGAAA
GTGCTCTCGA ACACCGGCGC CTACGGCAAC CATGCCGGCG GCGTGCTGCA CCACGGCTGC
AACGAGAGCA TCGCCGCCTA TGCCTGCCCG AACAAGCGGG TGGAGGGCTA CGCCGTCTAC
ACTCATACCC TGCCGGCCGG GGCCTTTCGC GGGTACGGCC TGAGCCAGAC CATCTTCGCC
GTGGAATCGG CGCTGGACGA ACTCGCCCGC GATCTCGGGA TCGACCCGTT CGCGATGCGC
CGCCTCAACG CGGTGCGGCC CGGCGACCCG ATGGTGTCGA CGAGCCTGGA GCCGCACGAC
GTCGTCTACG GCTCCTACGG GCTCGACCAA TGCCTCGACC GCACCCAGGC CGCGCTGGCG
GACGGCAGCG GCGAGGCGGC GCCGGGGCCG GACTGGCGCG TCGGCGAGGG CATGGCGATG
GCGATGATCG ACACGATCCC ACCGCGCGGC CACGTCGCCC ATGCCCGCAT CCGCCTCGAA
CCGGACGGGA CCTATGCCCT CGCGGTCGGC ACCGCCGAGT TCGGCAACGG CACCAGCACC
GTGCACGGGC AGATCGCCGC CGAGGTGCTC GGCACCACGC CCGAACGCAT CCGCCTGATC
CAGGCCGACA CCGATGCTGT CCGGCACGAT ACCGGCGCCT ATGGCAGCAC GGGCACCGTG
GTCGCCGGAC AGGCGAATTT CCGCGCGGCC ACGGCGCTCG CCGATCAGCT TCGCGCCGCC
GCCGCCGAGC GGGCTGGCGT CGCCCCGGCC GATTGCCGCC TGACTCGCGA CGGCGTCGCG
ACGCCCGCCG GGCTCGTCTC CCTGATCACG CTTGCGCAAG GGGCCATCTT CGAAGCCGAG
GGCGAGGCCG ACGGGACGCC GCGCTCGGTC GCCTTCAACG TCCAGGCCTT TCGCGTGGCC
GTCCATCCGC AGACCGGCGA GATCCGCATC CTCAAGAGCA TTCAGGCGGC GGATGCCGGC
CGGGTCATCA ACCCGATGCA GTGCCGCGGG CAGATCGAGG GCGGGGTGGC GCAGGCGCTC
GGCGCGGCCC TGCACGAGGA CTATCGCTTC GACGAAACGG GCGCGGTCGT CACGCGGACC
TTGCGCAACT ACCACATCCC GGCGATGGCC GACGTGCCGG TGACCGAGGT TCTGTTCGCC
GACACCCACG ACACGGTCGG TCCACTGGGC GCGAAATCGA TGAGCGAGGC GCCCTACAAC
CCCGTCGCCG CCGCCCTGGG CAACGCGATC CGCGACGCCA CCGGCGTACG GCTGACCGCG
ACGCCGTTCG CCCCCGACCG GATCTTTCGA CAGGTCATGG CGGCGCAAGA GAACGAGGAA
CAGGAGACGG CGGACGCATG A
 
Protein sequence
MRLTVNGQVQ EAAARPGQCL RTLLRDLGWF GVKKGCDAGD CGACTVHLDG EPVHSCLLPA 
FQAEGRGVTT IEGLAGPCAP GDGPPEHLHP MQDAFCAAQG FQCGFCTPGM IMTAAALDQG
QRQDLGTALK GNLCRCTGYR AIRDAVAGIA HADASQTAEG NPVGRSLPAP ASRALVSGRA
AYTFDTAVPG LLHLKVLRAP HAHARIRSID RAAALAMPGV VAVLTHEDAP RRRFSTGRHE
NPFDDAADTG VLDSVIRFHG QRVAAVVAES AAAATMGVQA LRVEYDVRPA VFDPEAALHP
DAPLVHDPAD RDPPRPGDDA PPLLAHPNLA AEAHGAIGNV EAGFGQADRI HEAEYVSQRV
QHVHLETHGA LGWLDAEGRL TLRSSTQVPF LTRDALCRLF DLDRDRVRVL CGRVGGGFGG
KQEMLTEDLV ALAVLRTGRP VSYEMTREEN FCAATTRHPM RVRVKIGARA DGRLTALSLK
VLSNTGAYGN HAGGVLHHGC NESIAAYACP NKRVEGYAVY THTLPAGAFR GYGLSQTIFA
VESALDELAR DLGIDPFAMR RLNAVRPGDP MVSTSLEPHD VVYGSYGLDQ CLDRTQAALA
DGSGEAAPGP DWRVGEGMAM AMIDTIPPRG HVAHARIRLE PDGTYALAVG TAEFGNGTST
VHGQIAAEVL GTTPERIRLI QADTDAVRHD TGAYGSTGTV VAGQANFRAA TALADQLRAA
AAERAGVAPA DCRLTRDGVA TPAGLVSLIT LAQGAIFEAE GEADGTPRSV AFNVQAFRVA
VHPQTGEIRI LKSIQAADAG RVINPMQCRG QIEGGVAQAL GAALHEDYRF DETGAVVTRT
LRNYHIPAMA DVPVTEVLFA DTHDTVGPLG AKSMSEAPYN PVAAALGNAI RDATGVRLTA
TPFAPDRIFR QVMAAQENEE QETADA