Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1564 |
Symbol | |
ID | 4270586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1788275 |
End bp | 1791358 |
Gene Length | 3084 bp |
Protein Length | 1027 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638126321 |
Product | carbon monoxide dehydrogenase, large subunit apoprotein |
Protein accession | YP_742401 |
Protein GI | 114320718 |
COG category | [C] Energy production and conversion [S] Function unknown |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [COG3427] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02416] carbon-monoxide dehydrogenase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.151361 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACCC CCGCAGAGGA ACTGGACCGC AACGAAAAGC TGGGCGGCAT CGGCTGTTCC CGCAAGCGCA AGGAGGACCC GCGCTTCATC CAGGGTAAGG GCCATTACGT GGACGACATT CAACTGCCGG GCATGGTGTT CGGCGACTTC GTCCGCAGCC CCCACGCCCA CGCCCGCATC AAGGCCATCC ACAAGGACAA GGCCCTGGCC CACCCCGGTG TCCACGCCGT GCTCACCGCC GAGGACCTGG CCCCGCTGAA CCTGCATTGG ATGCCGACCC TGGCCGGCGA CAAACAGATG GTGCTGGCCG ACGGCAAGGT CTGCTTCCAG AACCAGGAAG TGGCCATGGT CATCGCCGAC GACCGCTACA TCGCCGCCGA TGCCCTGGAG CTGGTGGAGG TGGAATACGA GCCGCTGGAG CCGCTGGTGG ACCCGCACCG GGCCATGGAC GATGAGGCCC CGGTGATCCG CGAGGACCTG GCGGGCCAGA GCGAGGGTGC GCACAGCAAG CGCGTGCACC ACAACCACAT CTTCACCTGG GACGTGGGCG ACAAGGCCGC CACCGACAAG CTCTTCGACG AGGCCGAGGT CACCGTCAGC GAGAAGATGC TCTACCAGCG GGTGCACCCC TGCCCACTGG AGACCTGCGG CTGTGTCGCC GATTTCGACA AGGTGAAGGG CGAGCTGACC GTCAACCTCA CCTCCCAGGC GCCCCACGTG GTGCGCACCG TCTTTTCCAT GCTCTCCGGC ATTCCCGAGA GCAAGGTGCA CATCAACGCC CCGGACATCG GCGGCGGTTT CGGCAACAAG GTGGGCGTCT ACCCCGGCTA CGTGGTGGCG ACCGTGGCCT CCATCGTGCT CGGCCGGCCG GTGAAGTGGA TCGAGGACCG CATCGAGAAC CTCTCCACCA CCGCCTTCGC CCGCGACTAC CACATGACCG GCGAACTGGC CGCCACCCGG GACGGCAAGA TCCTGGGCCT GCGCGCCCAC GTGCTCGCCG ATCACGGCGC CTTCGACGCC TGCGCCGACC CCAGCAAGTG GCCCGCCGGC TTTTTCAACA TCTGCACCGG CAGCTATGAC ATCAAGACCG CTTACGCCCG GGTGGACGGG GTTTACACCA ACAAGGCCCC GGGCGGGGTG GCCTACCGCT GCTCCTTCCG GGTCACCGAG GCCTGTTACC TGATCGAGCG CATGATCGAC GTGCTGGCCC AGAAGCTCGA CATGGACAAG GCGGAGATCC GGTTCAAAAA CTTCATCCAG CCCGAGCAGT TCCCCTACCC CTCGGCGCTC GGCTGGGAGT ACGACAGCGG TGACTACCCG CGCGCCCTGC AGCAGGTGCT GGACGCCTGC GACTATCCGG CCCTGCGGCG TGAGCAGAAG GAGCGGCGCG AGCGCGGCGA GATCATGGGC ATCGGCCTGT GCACCTTTAC CGAGATCGTC GGCGCCGGCC CGGGCCGCAA GTGCGATATC CTCGGCGTGG GCATGTTCGA CAGCGCCGAG ATCCGGGTCC ACCCCACCGG CAGCGTGATC GCCCGCATGG GCACCAAGAC CCAGGGTCAG GGCCACGAGA CCACCTACGC CCAGATTATC GCCACCGAGC TGGGCCTGAA CTCCGAGGAC ATCCAGATCG AGGAGGGCAA CACCGACACC GCCCCCTACG GCCTGGGCAC CTACGGTTCG CGCAGCACGC CGGTGGGCGG CGCCGCTACC GCCCGCGCCG CGCGCAAGAT CCGCGACAAG GCGAGAAAGA TCGCCGCCCA CCTGATGGAG GTCAGCGACG AGGACCTGGA GTGGACCGGC GAGGGCTTCC GCGTCAAGGG CGTGCCCGAC CAGACCAAAG GCATACAGGA GATCGCCTGG GCCGCCTACA ACAACACCCC GGAGGGGATG GAGCCGGGGC TGGAGGCGGT GGAGTACTAC GATCCGCCCA ACATGACCTA TCCCTTCGGC GCCTACCTCT GCGTGGTGGA CATCGACCGC TACACCGGCG AGACCCGGGT CCGGCGCTTC TACGCCCTGG ACGACTGTGG CACCCGCATC AACCCGATGG TGATCGAGGG CCAGGTCCAC GGCGGACTCA CCGAGGCCTA CGGCGTCGCC CTGGGCCAGG AACTGCCCTA CGACGGCGCC GGCAACATCC AGGGGGCCTC GCTGATGGAT TACTTCCTGC CCACCATGGT GGAGAGCCCG CACTGGGAGA CCGACCATAC CGTCACCCCC TCGCCCCACC ACCCCATCGG GGCCAAGGGC GTGGGCGAGT CCTCCCACGT GGGGGGCATC CCCTGCATCT CCAACGCCGT CAATGACGCT CTGTCCCCGT TCGGCGTCAC CCACGTGGAC ATGCCCCACA ACGCCTACCG CGTCTGGCAG ACACTGCACG CGTTGAAGCT GGACCGCCAC CCGGAGGCCG ACACCGTCGC ACCCTTCCAG CCGAAGGCCC GCCGGCCCAA GGCCGCGGCG ACGGAACGGC CGGCGGAGGC GCCCGCCGGG AAAGCGGCCG GCGCCAAGGG CATGGAGGTT CGGCTGGAGC GGGACTACGG CCTGGATGTC CCCGCCGACC CGGCCTGGAC GCTGATGCAG GACATCCGCG AGGTGGCCGC CTGCATGCCC GGTGCCTCCA TCGTCGAGCA GACCGGCGAG CGCACCTATC TGGGCGAGAT GCGCCTCAAG GTGGGCCCGA TCACCTCGGC CTTCAAGGGC GATATCGAGG TGCTGGACCT GGACCCGACA CGCCAGACCC TGCGCCTGCG CGGTGAGGGC GGCGACACCA AGGGCAGCTC CAGTGCCCGC ATGACGCTGC AGGCGCGCAT CGTCCCGGAG ACGGAGGCAC AGTGCCGGCT GGAGGGGGTC TGCACCATCG AGCTGACCGG AAAGCTGGCC AGTTTCGGTG GGCGGATGCT GGAGAACATC TCCGACCGGT TGCTCTCCCA GTTCGTCGCC AACTTCGAGA ACCGGGTGGC CGCCGGCGGC GAGGGCAGCA AGGCGGAGGC CGCCCGCGAG CGGGTGGCCA GCGGGCCCAA GGAGCTGAGC GCCCTGGCCC TGCTCTGGCA GATGATCAAG AGCTGGTTCG GGGGCCGGCG CTGA
|
Protein sequence | MATPAEELDR NEKLGGIGCS RKRKEDPRFI QGKGHYVDDI QLPGMVFGDF VRSPHAHARI KAIHKDKALA HPGVHAVLTA EDLAPLNLHW MPTLAGDKQM VLADGKVCFQ NQEVAMVIAD DRYIAADALE LVEVEYEPLE PLVDPHRAMD DEAPVIREDL AGQSEGAHSK RVHHNHIFTW DVGDKAATDK LFDEAEVTVS EKMLYQRVHP CPLETCGCVA DFDKVKGELT VNLTSQAPHV VRTVFSMLSG IPESKVHINA PDIGGGFGNK VGVYPGYVVA TVASIVLGRP VKWIEDRIEN LSTTAFARDY HMTGELAATR DGKILGLRAH VLADHGAFDA CADPSKWPAG FFNICTGSYD IKTAYARVDG VYTNKAPGGV AYRCSFRVTE ACYLIERMID VLAQKLDMDK AEIRFKNFIQ PEQFPYPSAL GWEYDSGDYP RALQQVLDAC DYPALRREQK ERRERGEIMG IGLCTFTEIV GAGPGRKCDI LGVGMFDSAE IRVHPTGSVI ARMGTKTQGQ GHETTYAQII ATELGLNSED IQIEEGNTDT APYGLGTYGS RSTPVGGAAT ARAARKIRDK ARKIAAHLME VSDEDLEWTG EGFRVKGVPD QTKGIQEIAW AAYNNTPEGM EPGLEAVEYY DPPNMTYPFG AYLCVVDIDR YTGETRVRRF YALDDCGTRI NPMVIEGQVH GGLTEAYGVA LGQELPYDGA GNIQGASLMD YFLPTMVESP HWETDHTVTP SPHHPIGAKG VGESSHVGGI PCISNAVNDA LSPFGVTHVD MPHNAYRVWQ TLHALKLDRH PEADTVAPFQ PKARRPKAAA TERPAEAPAG KAAGAKGMEV RLERDYGLDV PADPAWTLMQ DIREVAACMP GASIVEQTGE RTYLGEMRLK VGPITSAFKG DIEVLDLDPT RQTLRLRGEG GDTKGSSSAR MTLQARIVPE TEAQCRLEGV CTIELTGKLA SFGGRMLENI SDRLLSQFVA NFENRVAAGG EGSKAEAARE RVASGPKELS ALALLWQMIK SWFGGRR
|
| |