Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0294 |
Symbol | |
ID | 4269324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 337041 |
End bp | 338621 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638125020 |
Product | cytochrome-c oxidase |
Protein accession | YP_741139 |
Protein GI | 114319456 |
COG category | [C] Energy production and conversion |
COG ID | [COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 |
TIGRFAM ID | [TIGR02891] cytochrome c oxidase, subunit I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00647712 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTATCA CGGTCTCCGA ACTGGACTAC GACAACCACA AGCCCCAGGG GTTTCTTGAC CGCTGGGTCC TGACCACCAG TCACAAGGAC ATCGGCACCC TCTACCTGGT CTTCTCTCTG ACCATGTTCT TTGTCGGTGG GGCCATGGCC ATGGTCATCC GCGCCGAGCT GTTTCAGCCC GGCCTGAGCC TGGTGGACCC GCACTTTTTC AACCAGATGA CCACCATGCA CGCCCTGGTG ATGATCTTCG GGGCGGTCAT GCCGGCCTTC GTGGGCTTGG CCAACTGGTT GATCCCGATG CAGATCGGGG CGCCGGACAT GGCGCTGCCG CGGATGAACA ACTGGAGCTT CTGGCTGCTG CCGTTCGCCT TCCTGATGTT GCTGGCAACC CTGATCATGC CCGGCGGCGG CCCCGCTTCG GGGTGGACGC TCTACCCGCC CTTATCGCTG CAGACGGGCA TGGCCCTGCC CTTCCTGATC TTCGCCATCC ACGTGGCCGG GCTGTCCTCC ATCATGGGCG CCATAAACAT CATCGTCACC ATCCTCAACC TGCGGGCGCC GGGGATGACG CTGATGAAGA TGCCCCTGTT CGTCTGGACT TGGCTGATCA CCGCCTTCCT GCTGATCGCG GTGATGCCGG TGCTGGCCGG TGCGGTGACC ATGCTGCTCA CCGACGCCTA CTTCGGCACC AGCTTCTTCT CGGCCGCCGG CGGCGGTGAC CCGGTAATGT ACCAGCACAT CTTCTGGTTC TTCGGGCACC CCGAGGTCTA CATCATGATC CTGCCGGCCT TCGGCATCGT TTCGGCGATC ATCCCGGCCT TCGCCCGCAA GCCGCTGTTC GGCTATGCCT CCATGGTCTA CGCGACCGCC GCCATCGCCT TCCTGTCGTT CATCGTCTGG GCGCACCACA TGTTCACGGT GGGAATGCCC ACGGCCGGTG TGCTGTTCTT CATGTACGCC ACCATGCTGA TCGCGGTGCC CACCGGGGTG AAGGTGTTCA ACTGGGTGGC CACCATGTGG CGGGGCGCCA TGACCTTCGA GACCCCCATG CTGTTCAGCA TCGCCTTCGT GGTGCTGTTC ACCATCGGCG GGCTCTCCGG GATCATGCTG GCCATCGCCC CGGCGGACTT CCAGTACCAC GATACCTACT TTGTGGTGGC GCACTTCCAC TACGTGCTGG TCACCGGCTC GGTGTTTGCC ATCTTCGCCG CCGTCTACTA CTGGATTCCC AAGTGGACGG GGGTGATGTA CGACGAGTTC CTGGGCAAGG TGCACTTCTG GCTGTCGGTG GTCTCGGTGA ACGTGCTCTT CTTCCCGCAG CACTTCCTGG GGCTGGCCGG CATGCCGCGG CGCACCGCCG ATTACGCCCT GCAGTTCGCC GAGTGGAACA TGATCTCCTC CATCGGGGGC TTCGCCTTCG GCGTCAGCCA ACTGCTGTTC CTGTATATCG TCATCAAGTG CATGCGCGGC CAGGGTGAGC CGGCCCCGGC CCGCTCCTGG GAAGGGGCCG AGGGGCTGGA GTGGGAGCTG CCCTCGCCGG CCCCGCACCA CACGTTCTCC AAGCCGCCGG TGATCAAATG A
|
Protein sequence | MSITVSELDY DNHKPQGFLD RWVLTTSHKD IGTLYLVFSL TMFFVGGAMA MVIRAELFQP GLSLVDPHFF NQMTTMHALV MIFGAVMPAF VGLANWLIPM QIGAPDMALP RMNNWSFWLL PFAFLMLLAT LIMPGGGPAS GWTLYPPLSL QTGMALPFLI FAIHVAGLSS IMGAINIIVT ILNLRAPGMT LMKMPLFVWT WLITAFLLIA VMPVLAGAVT MLLTDAYFGT SFFSAAGGGD PVMYQHIFWF FGHPEVYIMI LPAFGIVSAI IPAFARKPLF GYASMVYATA AIAFLSFIVW AHHMFTVGMP TAGVLFFMYA TMLIAVPTGV KVFNWVATMW RGAMTFETPM LFSIAFVVLF TIGGLSGIML AIAPADFQYH DTYFVVAHFH YVLVTGSVFA IFAAVYYWIP KWTGVMYDEF LGKVHFWLSV VSVNVLFFPQ HFLGLAGMPR RTADYALQFA EWNMISSIGG FAFGVSQLLF LYIVIKCMRG QGEPAPARSW EGAEGLEWEL PSPAPHHTFS KPPVIK
|
| |