Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1372 |
Symbol | |
ID | 4268135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1571137 |
End bp | 1572573 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638126128 |
Product | oxygen-independent coproporphyrinogen III oxidase |
Protein accession | YP_742211 |
Protein GI | 114320528 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00538] oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.292242 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGTGC CACACATCCC CAATCTCGGC ATTGTGCGAT TTGTCATGGA CCAAACACTG CAATTCGACC CGGCCATCCT GGCCAAGTAT GACGTCAGCG GCCCCCGGTA CACCTCTTAC CCCACGGCTC CGCAGTTCCA CGAGGGCTTC GACGACAAGG CCTATGCCGA GGTGGCCGCG CTCAGCAACG AGGATCCCAT CCCGCGGCCG CTTTCGCTGT ACGTGCACGT CCCCTTCTGC GACACCGTCT GCTTCTACTG TGCGTGCAAC AAGATCATCA CCGGTAACTA CTCCCGCGCC GGTACCTACC TGGACTACCT CGAGAAAGAG GTTGCCCTGC AGAGCCAGCT GTTCGACGCC GACCGGCGGG TGGAGCAACT GCACTTCGGG GGCGGCACCC CGACCTACCT CAGTGACGAG GATCTCATCC GGGTAATGGA CATGCTCTCC CGCTATTTCA CCCTCGAGCG GGGGCCGCAA CGGGAGTTCT CTATCGAGAT CGACCCGCGG GCCGTGCGGG AGACCACGAT CGAGCTGCTG GCCAAGCTGG GCTTCAACCG CATGAGCGTG GGGGTGCAGG ACTTCGATCC CGCGGTGCAA AAGGCCGTGA ACCGGATCCA GCCCTACGAG ACCACCGCCC GGGTGATCGC AAAGGCGCGT AGCTGCGGAT TCCGTTCCAC CAACCTCGAC CTGATTTATG GGCTGCCGCT GCAGAGCGTA GAGACCTACT CCCGGACCCT GGATCAGACC CTGGAACTGC GCCCCGAGCG CCTGGCGGTC TACAATTACG CCCATCTACC CCACCTGTTC AAAGTCCAGC GCCAGATCCG CGAGGAGCAG CTCCCGGGGC CCGACGAAAA GCTGGCCATC CTGGAGCTGA CCATCCGCCG GCTAACCGAG GCCGGCTATG TCTACATCGG CATGGACCAC TTCGCCCTGC CGGAGGACGA GCTCGCCCAG GCCCAGCGCG CCGGGACGCT GCACCGCAAT TTCCAGGGCT ACTCTACCCG TGCGGAGTGC GACTTGGTGG GCCTGGGGGT CACCTCCATT GGCAAGGTGG GCGAGAGCTA CAGTCAGAAC CTGCGTGATA TGGAGGCCTA CTACGCCCGC CTGGATGAAG GCCGGTTGCC GGTCTTCCGG GGCGTGGAGC TCAGCGCCGA CGATCAACTG CGCCGCGACG TGATCACTGA GATCATGTGC CACTCCCGGG TGGACTTCCG GGAGATGGAG GAGCGTTACC AGATCCACTT CCGGGACTAC TTCCGCGACG CCCTGGAACG GCTGCAGGGG ATGGAGTCGG ATGGTTTGGT CCGAATCGAG AGCGACCGGT TACAGGTCCT CCCCCGAGGG CGTTTGCTGC TGCGCCACGT GGCCATGGCC TTCGACGCCT ACCTGAATGC CGACAACGGC AAAACCCGCT ACAGTAAGGT CATATAG
|
Protein sequence | MPVPHIPNLG IVRFVMDQTL QFDPAILAKY DVSGPRYTSY PTAPQFHEGF DDKAYAEVAA LSNEDPIPRP LSLYVHVPFC DTVCFYCACN KIITGNYSRA GTYLDYLEKE VALQSQLFDA DRRVEQLHFG GGTPTYLSDE DLIRVMDMLS RYFTLERGPQ REFSIEIDPR AVRETTIELL AKLGFNRMSV GVQDFDPAVQ KAVNRIQPYE TTARVIAKAR SCGFRSTNLD LIYGLPLQSV ETYSRTLDQT LELRPERLAV YNYAHLPHLF KVQRQIREEQ LPGPDEKLAI LELTIRRLTE AGYVYIGMDH FALPEDELAQ AQRAGTLHRN FQGYSTRAEC DLVGLGVTSI GKVGESYSQN LRDMEAYYAR LDEGRLPVFR GVELSADDQL RRDVITEIMC HSRVDFREME ERYQIHFRDY FRDALERLQG MESDGLVRIE SDRLQVLPRG RLLLRHVAMA FDAYLNADNG KTRYSKVI
|
| |