Gene Mlg_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2044 
Symbol 
ID4270178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2315215 
End bp2316216 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content65% 
IMG OID638126800 
Productcytochrome d ubiquinol oxidase, subunit II 
Protein accessionYP_742876 
Protein GI114321193 
COG category[C] Energy production and conversion 
COG ID[COG1294] Cytochrome bd-type quinol oxidase, subunit 2 
TIGRFAM ID[TIGR00203] cytochrome d oxidase, subunit II (cydB) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTGA TTGATCTGAC TCTGGTCTGG ATTGCCATTA TCGGGCTGGG CGTGTTCATG 
TACGTCCTGC TCGACGGCTT CGACCTGGGC GTGGGGATTC TCTATCCCTT CGCCCCCGGC
GAGGAGGAGC GTGACCTGAT GATGGACTCG GTGGCCCCGG TGTGGGACGG CAACGAGACC
TGGCTGGTGC TGGGTGGTGC CGGCCTGCTG GGCGCCTTCC CGCTGGTCTA CTCGGTCTTT
CTGCCGGCGC TCTACATCGG TGTCTTCCTG CTTCTGGCGG GCTTGATCTT CCGCGGGGTG
GCGTTCGAGT TCCGGGCCAA GGCGAACCGT TCCCGGCCCT TCTGGAACTG GGCCTTCACC
CTGGGTTCCG GGGTGGCCTC CTTCGCCCAG GGCGCCGTGG TCGGGGCCTA CATCCAGGGT
TTCGAGACCG AAGGCTTCAC CTATGTGGGC GGTCCGCTGG ACTGGCTGAC GCCGTTCGTC
GTGATGACCG GTCTCGGCGT GGTGGCCGGC TACGCGCTGT TGGGCGCCAC CTGGCTGATC
ATGAAGACCG AGGGGGAGCT GCAGGACTGG GCACGCCGGG TCGCCAGCCG TACGCTCCTG
GCCGTGTTGG CCTTCTTCGT CATCGTCAGC CTGTGGACCC CGTTGGCCCA TGCCGAGGTC
ATGAGCCGGT GGCTGGATAA CCTGAACTGG TTGTGGCTCT TCCCGGTGGT GACGCTGCTG
GTCAGCTACT GGTTGTGGCG GGCCTTGCAG CAGCGCCAGG AGGGGACGCC CTTCGTTGCC
GCCATGGCCC TGTTTGCCAC CTTCTACGTG GGTCTGCTGA TCAGCATGTG GCCCTACGCG
GTGCCGCCGC ACCACACCTT CTGGGATGCC TCGTCCAGCC CGGATGCCCA GCTCTTCCTG
CTGGTGGGCA TGCTCCTCTT GCTGCCCGTG GTCATGGGGT ATACCGCCTG GACCTACTGG
GTCTTCCGCG GCAAGGTGCG CCCTGGCGAG GGTTATCACT GA
 
Protein sequence
MELIDLTLVW IAIIGLGVFM YVLLDGFDLG VGILYPFAPG EEERDLMMDS VAPVWDGNET 
WLVLGGAGLL GAFPLVYSVF LPALYIGVFL LLAGLIFRGV AFEFRAKANR SRPFWNWAFT
LGSGVASFAQ GAVVGAYIQG FETEGFTYVG GPLDWLTPFV VMTGLGVVAG YALLGATWLI
MKTEGELQDW ARRVASRTLL AVLAFFVIVS LWTPLAHAEV MSRWLDNLNW LWLFPVVTLL
VSYWLWRALQ QRQEGTPFVA AMALFATFYV GLLISMWPYA VPPHHTFWDA SSSPDAQLFL
LVGMLLLLPV VMGYTAWTYW VFRGKVRPGE GYH