Gene Mlg_1564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1564 
Symbol 
ID4270586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1788275 
End bp1791358 
Gene Length3084 bp 
Protein Length1027 aa 
Translation table11 
GC content68% 
IMG OID638126321 
Productcarbon monoxide dehydrogenase, large subunit apoprotein 
Protein accessionYP_742401 
Protein GI114320718 
COG category[C] Energy production and conversion
[S] Function unknown 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs
[COG3427] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02416] carbon-monoxide dehydrogenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.151361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCC CCGCAGAGGA ACTGGACCGC AACGAAAAGC TGGGCGGCAT CGGCTGTTCC 
CGCAAGCGCA AGGAGGACCC GCGCTTCATC CAGGGTAAGG GCCATTACGT GGACGACATT
CAACTGCCGG GCATGGTGTT CGGCGACTTC GTCCGCAGCC CCCACGCCCA CGCCCGCATC
AAGGCCATCC ACAAGGACAA GGCCCTGGCC CACCCCGGTG TCCACGCCGT GCTCACCGCC
GAGGACCTGG CCCCGCTGAA CCTGCATTGG ATGCCGACCC TGGCCGGCGA CAAACAGATG
GTGCTGGCCG ACGGCAAGGT CTGCTTCCAG AACCAGGAAG TGGCCATGGT CATCGCCGAC
GACCGCTACA TCGCCGCCGA TGCCCTGGAG CTGGTGGAGG TGGAATACGA GCCGCTGGAG
CCGCTGGTGG ACCCGCACCG GGCCATGGAC GATGAGGCCC CGGTGATCCG CGAGGACCTG
GCGGGCCAGA GCGAGGGTGC GCACAGCAAG CGCGTGCACC ACAACCACAT CTTCACCTGG
GACGTGGGCG ACAAGGCCGC CACCGACAAG CTCTTCGACG AGGCCGAGGT CACCGTCAGC
GAGAAGATGC TCTACCAGCG GGTGCACCCC TGCCCACTGG AGACCTGCGG CTGTGTCGCC
GATTTCGACA AGGTGAAGGG CGAGCTGACC GTCAACCTCA CCTCCCAGGC GCCCCACGTG
GTGCGCACCG TCTTTTCCAT GCTCTCCGGC ATTCCCGAGA GCAAGGTGCA CATCAACGCC
CCGGACATCG GCGGCGGTTT CGGCAACAAG GTGGGCGTCT ACCCCGGCTA CGTGGTGGCG
ACCGTGGCCT CCATCGTGCT CGGCCGGCCG GTGAAGTGGA TCGAGGACCG CATCGAGAAC
CTCTCCACCA CCGCCTTCGC CCGCGACTAC CACATGACCG GCGAACTGGC CGCCACCCGG
GACGGCAAGA TCCTGGGCCT GCGCGCCCAC GTGCTCGCCG ATCACGGCGC CTTCGACGCC
TGCGCCGACC CCAGCAAGTG GCCCGCCGGC TTTTTCAACA TCTGCACCGG CAGCTATGAC
ATCAAGACCG CTTACGCCCG GGTGGACGGG GTTTACACCA ACAAGGCCCC GGGCGGGGTG
GCCTACCGCT GCTCCTTCCG GGTCACCGAG GCCTGTTACC TGATCGAGCG CATGATCGAC
GTGCTGGCCC AGAAGCTCGA CATGGACAAG GCGGAGATCC GGTTCAAAAA CTTCATCCAG
CCCGAGCAGT TCCCCTACCC CTCGGCGCTC GGCTGGGAGT ACGACAGCGG TGACTACCCG
CGCGCCCTGC AGCAGGTGCT GGACGCCTGC GACTATCCGG CCCTGCGGCG TGAGCAGAAG
GAGCGGCGCG AGCGCGGCGA GATCATGGGC ATCGGCCTGT GCACCTTTAC CGAGATCGTC
GGCGCCGGCC CGGGCCGCAA GTGCGATATC CTCGGCGTGG GCATGTTCGA CAGCGCCGAG
ATCCGGGTCC ACCCCACCGG CAGCGTGATC GCCCGCATGG GCACCAAGAC CCAGGGTCAG
GGCCACGAGA CCACCTACGC CCAGATTATC GCCACCGAGC TGGGCCTGAA CTCCGAGGAC
ATCCAGATCG AGGAGGGCAA CACCGACACC GCCCCCTACG GCCTGGGCAC CTACGGTTCG
CGCAGCACGC CGGTGGGCGG CGCCGCTACC GCCCGCGCCG CGCGCAAGAT CCGCGACAAG
GCGAGAAAGA TCGCCGCCCA CCTGATGGAG GTCAGCGACG AGGACCTGGA GTGGACCGGC
GAGGGCTTCC GCGTCAAGGG CGTGCCCGAC CAGACCAAAG GCATACAGGA GATCGCCTGG
GCCGCCTACA ACAACACCCC GGAGGGGATG GAGCCGGGGC TGGAGGCGGT GGAGTACTAC
GATCCGCCCA ACATGACCTA TCCCTTCGGC GCCTACCTCT GCGTGGTGGA CATCGACCGC
TACACCGGCG AGACCCGGGT CCGGCGCTTC TACGCCCTGG ACGACTGTGG CACCCGCATC
AACCCGATGG TGATCGAGGG CCAGGTCCAC GGCGGACTCA CCGAGGCCTA CGGCGTCGCC
CTGGGCCAGG AACTGCCCTA CGACGGCGCC GGCAACATCC AGGGGGCCTC GCTGATGGAT
TACTTCCTGC CCACCATGGT GGAGAGCCCG CACTGGGAGA CCGACCATAC CGTCACCCCC
TCGCCCCACC ACCCCATCGG GGCCAAGGGC GTGGGCGAGT CCTCCCACGT GGGGGGCATC
CCCTGCATCT CCAACGCCGT CAATGACGCT CTGTCCCCGT TCGGCGTCAC CCACGTGGAC
ATGCCCCACA ACGCCTACCG CGTCTGGCAG ACACTGCACG CGTTGAAGCT GGACCGCCAC
CCGGAGGCCG ACACCGTCGC ACCCTTCCAG CCGAAGGCCC GCCGGCCCAA GGCCGCGGCG
ACGGAACGGC CGGCGGAGGC GCCCGCCGGG AAAGCGGCCG GCGCCAAGGG CATGGAGGTT
CGGCTGGAGC GGGACTACGG CCTGGATGTC CCCGCCGACC CGGCCTGGAC GCTGATGCAG
GACATCCGCG AGGTGGCCGC CTGCATGCCC GGTGCCTCCA TCGTCGAGCA GACCGGCGAG
CGCACCTATC TGGGCGAGAT GCGCCTCAAG GTGGGCCCGA TCACCTCGGC CTTCAAGGGC
GATATCGAGG TGCTGGACCT GGACCCGACA CGCCAGACCC TGCGCCTGCG CGGTGAGGGC
GGCGACACCA AGGGCAGCTC CAGTGCCCGC ATGACGCTGC AGGCGCGCAT CGTCCCGGAG
ACGGAGGCAC AGTGCCGGCT GGAGGGGGTC TGCACCATCG AGCTGACCGG AAAGCTGGCC
AGTTTCGGTG GGCGGATGCT GGAGAACATC TCCGACCGGT TGCTCTCCCA GTTCGTCGCC
AACTTCGAGA ACCGGGTGGC CGCCGGCGGC GAGGGCAGCA AGGCGGAGGC CGCCCGCGAG
CGGGTGGCCA GCGGGCCCAA GGAGCTGAGC GCCCTGGCCC TGCTCTGGCA GATGATCAAG
AGCTGGTTCG GGGGCCGGCG CTGA
 
Protein sequence
MATPAEELDR NEKLGGIGCS RKRKEDPRFI QGKGHYVDDI QLPGMVFGDF VRSPHAHARI 
KAIHKDKALA HPGVHAVLTA EDLAPLNLHW MPTLAGDKQM VLADGKVCFQ NQEVAMVIAD
DRYIAADALE LVEVEYEPLE PLVDPHRAMD DEAPVIREDL AGQSEGAHSK RVHHNHIFTW
DVGDKAATDK LFDEAEVTVS EKMLYQRVHP CPLETCGCVA DFDKVKGELT VNLTSQAPHV
VRTVFSMLSG IPESKVHINA PDIGGGFGNK VGVYPGYVVA TVASIVLGRP VKWIEDRIEN
LSTTAFARDY HMTGELAATR DGKILGLRAH VLADHGAFDA CADPSKWPAG FFNICTGSYD
IKTAYARVDG VYTNKAPGGV AYRCSFRVTE ACYLIERMID VLAQKLDMDK AEIRFKNFIQ
PEQFPYPSAL GWEYDSGDYP RALQQVLDAC DYPALRREQK ERRERGEIMG IGLCTFTEIV
GAGPGRKCDI LGVGMFDSAE IRVHPTGSVI ARMGTKTQGQ GHETTYAQII ATELGLNSED
IQIEEGNTDT APYGLGTYGS RSTPVGGAAT ARAARKIRDK ARKIAAHLME VSDEDLEWTG
EGFRVKGVPD QTKGIQEIAW AAYNNTPEGM EPGLEAVEYY DPPNMTYPFG AYLCVVDIDR
YTGETRVRRF YALDDCGTRI NPMVIEGQVH GGLTEAYGVA LGQELPYDGA GNIQGASLMD
YFLPTMVESP HWETDHTVTP SPHHPIGAKG VGESSHVGGI PCISNAVNDA LSPFGVTHVD
MPHNAYRVWQ TLHALKLDRH PEADTVAPFQ PKARRPKAAA TERPAEAPAG KAAGAKGMEV
RLERDYGLDV PADPAWTLMQ DIREVAACMP GASIVEQTGE RTYLGEMRLK VGPITSAFKG
DIEVLDLDPT RQTLRLRGEG GDTKGSSSAR MTLQARIVPE TEAQCRLEGV CTIELTGKLA
SFGGRMLENI SDRLLSQFVA NFENRVAAGG EGSKAEAARE RVASGPKELS ALALLWQMIK
SWFGGRR