Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1728 |
Symbol | |
ID | 5832501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 1949559 |
End bp | 1952339 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641367527 |
Product | aldehyde oxidase and xanthine dehydrogenase molybdopterin binding |
Protein accession | YP_001639198 |
Protein GI | 163851155 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [COG2080] Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCTCA CCGTCAACGG GCAGGTTCAG GAGGCTGCAG CCCGGCCCGG CCAGTGCCTG CGCACGCTGC TGCGGGATCT CGGCTGGTTC GGGGTCAAGA AGGGCTGCGA TGCGGGCGAT TGCGGCGCCT GCACCGTCCA TCTCGACGGC GAGCCGGTCC ATTCCTGCCT GCTGCCCGCC TTCCAGGCGG AGGGGCGCGG CGTCACCACG ATCGAGGGGC TCGCCGGCCC CTGCGCGCCG GGCGACGGGC CACCTGAACA TCTCCACCCG ATGCAGGACG CGTTCTGCGC CGCGCAGGGT TTCCAGTGCG GCTTCTGCAC GCCAGGCATG ATCATGACGG CGGCCGCCCT CGACCAGGGC CAGCGGCAGG ATCTCGGCAC GGCGCTCAAG GGCAATCTCT GCCGCTGCAC CGGCTACCGG GCGATCCGCG ACGCCGTGGC CGGGATCGCC CATGCCGACG CTTCGCAGAC GGCGGAGGGC AACCCGGTCG GCCGCAGCCT ACCGGCCCCG GCGAGCCGCG CCCTCGTCTC CGGCCGGGCG GCCTACACCT TCGACACCGC CGTGCCGGGT CTGCTGCACC TGAAGGTGCT GCGTGCGCCT CACGCCCATG CCCGGATCCG CAGCATCGAC CGGGCGGCGG CGCTGGCCAT GCCCGGCGTG GTCGCGGTGC TCACCCACGA GGACGCGCCG CGCCGCCGCT TCTCGACCGG GCGGCACGAG AACCCTTTCG ATGACGCCGC CGACACCGGC GTGCTCGATT CCGTGATCCG CTTCCACGGC CAGCGCGTCG CGGCGGTGGT CGCCGAGAGC GCCGCGGCGG CGACCATGGG CGTGCAAGCG CTGCGGGTGG AATACGACGT GCGGCCCGCC GTGTTCGATC CGGAGGCGGC CCTGCACCCG GACGCGCCCC TCGTCCACGA TCCGGCCGAC CGCGACCCGC CGCGGCCCGG CGACGATGCG CCACCGCTCC TCGCGCACCC GAACCTCGCC GCCGAGGCGC ACGGGGCGAT CGGCAATGTC GAGGCGGGCT TCGGCCAGGC CGACCGGATC CACGAGGCGG AATACGTCTC GCAGCGGGTG CAGCACGTGC ATCTCGAAAC GCACGGCGCG CTCGGCTGGC TCGATGCGGA GGGCCGGCTG ACCCTACGCT CCTCGACGCA GGTGCCCTTC CTGACCCGCG ACGCCCTGTG CCGCCTGTTC GATCTCGACC GGGACCGGGT GCGCGTGCTC TGCGGCCGGG TCGGCGGCGG CTTCGGCGGC AAGCAGGAGA TGCTGACCGA GGATCTCGTG GCGCTCGCCG TGCTGCGGAC CGGCCGCCCG GTCTCCTACG AGATGACCCG CGAGGAGAAC TTTTGCGCGG CGACGACGCG CCATCCGATG CGGGTGCGGG TGAAGATCGG CGCGCGGGCC GACGGGCGCC TCACCGCGCT TTCGCTGAAA GTGCTCTCGA ACACCGGCGC CTACGGCAAC CATGCCGGCG GCGTGCTGCA CCACGGCTGC AACGAGAGCA TCGCCGCCTA TGCCTGCCCG AACAAGCGGG TGGAGGGCTA CGCCGTCTAC ACTCATACCC TGCCGGCCGG GGCCTTTCGC GGGTACGGCC TGAGCCAGAC CATCTTCGCC GTGGAATCGG CGCTGGACGA ACTCGCCCGC GATCTCGGGA TCGACCCGTT CGCGATGCGC CGCCTCAACG CGGTGCGGCC CGGCGACCCG ATGGTGTCGA CGAGCCTGGA GCCGCACGAC GTCGTCTACG GCTCCTACGG GCTCGACCAA TGCCTCGACC GCACCCAGGC CGCGCTGGCG GACGGCAGCG GCGAGGCGGC GCCGGGGCCG GACTGGCGCG TCGGCGAGGG CATGGCGATG GCGATGATCG ACACGATCCC ACCGCGCGGC CACGTCGCCC ATGCCCGCAT CCGCCTCGAA CCGGACGGGA CCTATGCCCT CGCGGTCGGC ACCGCCGAGT TCGGCAACGG CACCAGCACC GTGCACGGGC AGATCGCCGC CGAGGTGCTC GGCACCACGC CCGAACGCAT CCGCCTGATC CAGGCCGACA CCGATGCTGT CCGGCACGAT ACCGGCGCCT ATGGCAGCAC GGGCACCGTG GTCGCCGGAC AGGCGAATTT CCGCGCGGCC ACGGCGCTCG CCGATCAGCT TCGCGCCGCC GCCGCCGAGC GGGCTGGCGT CGCCCCGGCC GATTGCCGCC TGACTCGCGA CGGCGTCGCG ACGCCCGCCG GGCTCGTCTC CCTGATCACG CTTGCGCAAG GGGCCATCTT CGAAGCCGAG GGCGAGGCCG ACGGGACGCC GCGCTCGGTC GCCTTCAACG TCCAGGCCTT TCGCGTGGCC GTCCATCCGC AGACCGGCGA GATCCGCATC CTCAAGAGCA TTCAGGCGGC GGATGCCGGC CGGGTCATCA ACCCGATGCA GTGCCGCGGG CAGATCGAGG GCGGGGTGGC GCAGGCGCTC GGCGCGGCCC TGCACGAGGA CTATCGCTTC GACGAAACGG GCGCGGTCGT CACGCGGACC TTGCGCAACT ACCACATCCC GGCGATGGCC GACGTGCCGG TGACCGAGGT TCTGTTCGCC GACACCCACG ACACGGTCGG TCCACTGGGC GCGAAATCGA TGAGCGAGGC GCCCTACAAC CCCGTCGCCG CCGCCCTGGG CAACGCGATC CGCGACGCCA CCGGCGTACG GCTGACCGCG ACGCCGTTCG CCCCCGACCG GATCTTTCGA CAGGTCATGG CGGCGCAAGA GAACGAGGAA CAGGAGACGG CGGACGCATG A
|
Protein sequence | MRLTVNGQVQ EAAARPGQCL RTLLRDLGWF GVKKGCDAGD CGACTVHLDG EPVHSCLLPA FQAEGRGVTT IEGLAGPCAP GDGPPEHLHP MQDAFCAAQG FQCGFCTPGM IMTAAALDQG QRQDLGTALK GNLCRCTGYR AIRDAVAGIA HADASQTAEG NPVGRSLPAP ASRALVSGRA AYTFDTAVPG LLHLKVLRAP HAHARIRSID RAAALAMPGV VAVLTHEDAP RRRFSTGRHE NPFDDAADTG VLDSVIRFHG QRVAAVVAES AAAATMGVQA LRVEYDVRPA VFDPEAALHP DAPLVHDPAD RDPPRPGDDA PPLLAHPNLA AEAHGAIGNV EAGFGQADRI HEAEYVSQRV QHVHLETHGA LGWLDAEGRL TLRSSTQVPF LTRDALCRLF DLDRDRVRVL CGRVGGGFGG KQEMLTEDLV ALAVLRTGRP VSYEMTREEN FCAATTRHPM RVRVKIGARA DGRLTALSLK VLSNTGAYGN HAGGVLHHGC NESIAAYACP NKRVEGYAVY THTLPAGAFR GYGLSQTIFA VESALDELAR DLGIDPFAMR RLNAVRPGDP MVSTSLEPHD VVYGSYGLDQ CLDRTQAALA DGSGEAAPGP DWRVGEGMAM AMIDTIPPRG HVAHARIRLE PDGTYALAVG TAEFGNGTST VHGQIAAEVL GTTPERIRLI QADTDAVRHD TGAYGSTGTV VAGQANFRAA TALADQLRAA AAERAGVAPA DCRLTRDGVA TPAGLVSLIT LAQGAIFEAE GEADGTPRSV AFNVQAFRVA VHPQTGEIRI LKSIQAADAG RVINPMQCRG QIEGGVAQAL GAALHEDYRF DETGAVVTRT LRNYHIPAMA DVPVTEVLFA DTHDTVGPLG AKSMSEAPYN PVAAALGNAI RDATGVRLTA TPFAPDRIFR QVMAAQENEE QETADA
|
| |