Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2667 |
Symbol | |
ID | 4268800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 3019405 |
End bp | 3020712 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638127426 |
Product | HemY domain-containing protein |
Protein accession | YP_743497 |
Protein GI | 114321814 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG3071] Uncharacterized enzyme of heme biosynthesis |
TIGRFAM ID | [TIGR00540] hemY protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.000000205656 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAGCCT TGTTGATCGC CTTGGTCGTG CTCCTGGCCG CGGTGGCCGC CGCCGCCTGG CTGCAGCCCC ACACCGGTTA TTTGGTGCTG TCGGTGGCGG GCTGGCGCAT TGAGACCAGT CTTATCTTCG CAGTGCTGGC CGTGGCTGTG GTGCTGGTGG TCCTGCAGGT GCTCTGGGTG CTCCTGGACC GCACCGTGGG CCTGCCCAAG GTGTTGAGCC ATTGGAGCAC CCGGCGGCGC AAGGAGAAGG CCCGCGGGGA GATGGCCAAG GGGCTGCTGG CGCTCGCCGA GGGCCGCTAC CGGCGCGGGG AGGATCTGCT GCTCAAGCAC GTCGAGCGTA GCGACTACCC GCTCATCAAC TACCTCGGGG CCGCCCTGTG CGCCCAGCGC CGCCATGCCA CCGAGACCCG GGACAGCTAC CTGGCCTTGG CCGAGCAGAC GGCCCGGGGC TCGGGGTCAG CAGTGAATCT GCTGCAGGCC CAGCTCTACA TGGAGTCCGG CCAGTGGGAA CAGGCCCTGG CCAGCCTCAC CTCGGCCTAC GAGCGCAACC CCAACCACCA CCGCACCCTG GAGATGATGC GCGATTGCTG TGTGGCGCTG GAGGACTGGG AGCGGCTCGG GCGGCTGCTC AAACCGCTGC GCAAACAGGG CATCATCGGC TCCGAGGAGG CGGAGGAGTA CGCCCGCTAT GTGGCGCGTG ACAAGATCCG GCGCGCGGCC CGGATCGGGC TGGGTGACCT GGAGTCGGCT TGGTCGCGCT TGCCGCGGGC CCAGCGCAAC GACGACGACC TGGTGCTGAC CCACGCCGAG GCCCTGCTGG AGCTGGACGA GATCGATCGC GCCGCCGTGG TCCTGAAGGG GCGCATCGAC GAGACCTGGG ACGAACGTCT GATCATGCGT TTCGGTGCGC TCGAGCAGAT CGACCCCGAG TGGCAACTGC AGCAACTCAA GCGCTGGCTG CGCCAGCAGC CGGACAACGC CGCGTTGCTT TACGTCGCCG GCCGGGTGGC GCTCCGCTTG CACGACTGGG ATCTGGCCCG GGAGTACCTG GAGCAGGCAC TCGCCCGCCG GGCCCGCCCG GAGGTGTACA TGGCCCTGGG TGCCCTGCTG GAGTTTCAGG AGCGACCGGA CGACGCCCGT GAACTCTATC GCAAGGCGCT GGGCATGGTC AGTGAGAGCA CGAGCAGCGA CGACCTTCCG GAGCTGCCGG TGCCGAGCAC CCTGGAGGGG GAGGTGCGCC CCGAGGACGA GATCGCCGTG CGTCAGACCG GGGAGGAGGA TCCGGTGGCG CTGCGCTCCA GCACCTGA
|
Protein sequence | MKALLIALVV LLAAVAAAAW LQPHTGYLVL SVAGWRIETS LIFAVLAVAV VLVVLQVLWV LLDRTVGLPK VLSHWSTRRR KEKARGEMAK GLLALAEGRY RRGEDLLLKH VERSDYPLIN YLGAALCAQR RHATETRDSY LALAEQTARG SGSAVNLLQA QLYMESGQWE QALASLTSAY ERNPNHHRTL EMMRDCCVAL EDWERLGRLL KPLRKQGIIG SEEAEEYARY VARDKIRRAA RIGLGDLESA WSRLPRAQRN DDDLVLTHAE ALLELDEIDR AAVVLKGRID ETWDERLIMR FGALEQIDPE WQLQQLKRWL RQQPDNAALL YVAGRVALRL HDWDLAREYL EQALARRARP EVYMALGALL EFQERPDDAR ELYRKALGMV SESTSSDDLP ELPVPSTLEG EVRPEDEIAV RQTGEEDPVA LRSST
|
| |