Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1023 |
Symbol | |
ID | 4270053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1164448 |
End bp | 1166115 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 638125775 |
Product | NADH/ubiquinone/plastoquinone (complex I) |
Protein accession | YP_741866 |
Protein GI | 114320183 |
COG category | [C] Energy production and conversion [P] Inorganic ion transport and metabolism |
COG ID | [COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.057602 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGTGA TTCTGGTCCT GCTCTGGCCG CTGCTGCTGG CCGCCGCTCT GTTCACGGCG TGGCGGCCGA GCGCGTGGCG GTGGGCCCCA TCGGCGCCGG TGCCGGCACT GCTGGCCGCC TTCTGGCCCG GCGGCCACTG GGAGGCCCCG TGGTTGTTGC TGGGCCTGCG CCTGGGGGTG GACCCGCTCG GCGCGCCGTT GCTGCTGCTG GCCGCGGTAG TGTGGCTGCT GGCCGCTCTC GCCGCGCGCC GCGCCATGGC CGGAGATGCC CATCGGGACC GCTTTCTGGC GCTGTTCCTG CTCACCATGG CCGGTAACCT GGGGGTGTTC ATCGCCCTGG ACGCGGCAGG CTTTTACCTG GCCTATGCGT TGATGACCTT CGCCGCCTAC GGCTTGGTGA TCCACGACGG CCGGGCGGTC TCGCGCCGGG CCGGGCGGGT CTATCTGGTG TTGGCGGTGC TGGGTGAGGG GCTGCTGGTG GCGGGCCTGC TGCTGATGGC CGCTGAGTTC GGCAACCCCG ATCTGCGCGG GCTGGGCCCG ATGCTGGCCG GGGCCGCCCA CAAGGACTGG CTGGTGGTGC TGTTCCTGCT CGGCTTCGGG ATCAAGATGG GTATGGTGCC GTTGCACCTC TGGCTGCCGT TGGCCCATCC CTGCGCGCCG GTCCCTGCCA GCGCCGTGCT CTCCGGGGTG ATCGTCAAGG CCGGGCTGAT GGGTTGGCTG CGGTTTCTGC CGCTTGGCCA GGGGGTGACC GACGCCGTCG CCCCGTGGGT GATGGCCATC GGCCTGTTTT CGGCCTTTTT CGCCGTGGTG GTGGGTCTGG CCCAGGAACG GGCCAAGACC GTGCTGGCCT ATTCCACGGT CAGCCAGATG GGGCTGCTCA CCCTGCTGAT CGGTTTGGGA TTGGCGCTGC CGGAGATCCA GTGGACCCTG GCCATGGGGG CGGTCGTGCT GATGGCGTTG CACCATGGAC TGGCCAAGGG GGCGCTGTTC CTGGCCATGG GGGCGGACGC GCGGGCGCGG TTCTGGCTGA TGCTTTGGCC GGCCGCCGCC CTGGCCGGAC TGCCGTTGAC CGCCGGGGCC CTGGGCAAGA AGGCGCTCAA GGATGCCGTC GATATGGCGC CCGGGGTCTG GGCGGAGCTG TTGCTGCCGC TGCTGGCGCT GAGCTCGCTG GCCACGACGC TGCTCATGCT GCGCCTGCTC TGGCTGGTCC GACCGCCGGG TGGTCCCTGG GGGGCGGAGC ACAACGGCGC CGTGGGCGAG CCGGCGCGCG GCCGGGCCCC GGTGCTTGGC CTGGTACTGA TCGGGGTGTT TCTGCCCTGG TTGTGGGCGG GGTGGCAGCT TCCCCAGGCG GCGGTGGCGG CCCTCGGGGT GACGGCACTC TGGGACAGCC TCTGGCCGGC ACTGCTGGGG CTGGGGGCGG GGGGCGCCTG GTGGTATTGG GCGCCGCGCC GCTGGCAGGC CGTGCGCTTG CCGGAAGGGG ACCTGGCCGC CCTGTTGCCC GCCGCGCCTT CGTTGCCCCA GCCGCCCGCC TGGCAACCGT TACTATCGGA CAGCGGTGGC GGCAGCCGGC TGGTGTCCCG GATGGCGTCC GGGTTCAGTC AATTGGCGGT GGCCGGGGCG CTGTTCCTGG CCCTGGTGCT CCTGATGCTG CTGCTCCTGC TGCGCTGA
|
Protein sequence | MLVILVLLWP LLLAAALFTA WRPSAWRWAP SAPVPALLAA FWPGGHWEAP WLLLGLRLGV DPLGAPLLLL AAVVWLLAAL AARRAMAGDA HRDRFLALFL LTMAGNLGVF IALDAAGFYL AYALMTFAAY GLVIHDGRAV SRRAGRVYLV LAVLGEGLLV AGLLLMAAEF GNPDLRGLGP MLAGAAHKDW LVVLFLLGFG IKMGMVPLHL WLPLAHPCAP VPASAVLSGV IVKAGLMGWL RFLPLGQGVT DAVAPWVMAI GLFSAFFAVV VGLAQERAKT VLAYSTVSQM GLLTLLIGLG LALPEIQWTL AMGAVVLMAL HHGLAKGALF LAMGADARAR FWLMLWPAAA LAGLPLTAGA LGKKALKDAV DMAPGVWAEL LLPLLALSSL ATTLLMLRLL WLVRPPGGPW GAEHNGAVGE PARGRAPVLG LVLIGVFLPW LWAGWQLPQA AVAALGVTAL WDSLWPALLG LGAGGAWWYW APRRWQAVRL PEGDLAALLP AAPSLPQPPA WQPLLSDSGG GSRLVSRMAS GFSQLAVAGA LFLALVLLML LLLLR
|
| |