Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2454 |
Symbol | |
ID | 4270195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2789078 |
End bp | 2790343 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638127212 |
Product | Rieske (2Fe-2S) domain-containing protein |
Protein accession | YP_743284 |
Protein GI | 114321601 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000486629 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.341461 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGCCG ATCCCAAGGT CATCAACGAC AAGCTGGCCA AGCGCATGTA CCAGCCCTAC CTGGAAGCCG AGTGGGGTTT CATCAATCAC TGGTACCCGG CCCTGTTCAC CCACGAACTG GAGGAGGGTG ATACCAAGGG CATCCAGATC TGTGGTGTGC CCATCGTTCT GCGTCGTTCC AAGGGCAAGG TCTACGCCCT GAAGGACCAG TGCATCCACC GTGGCGTGAA GCTTTCGGCC AAACCCATGT GCCTTACCGA TGACACCATC ACCTGCTGGT ACCACGGCTT CACCTTCGAC CTGGCCTCCG GCAAGCTGGT CTCCATCGTG GCCGCGCCCG ATGACGAGAT CATCGGCACC ACCGGCGTTC AGACGTTTGC CGTCGAGGAG CACAGCGGCA TGATCTTCGT GTTCGTCTGC GACGAGGACT GGGACGAGGA CGTGCCCCCG CTGGCCGCGG ACCTGCCGCT GCGTTATCCG GAGAACAACG AGCGTTTCCC GCACCCCTAC TGGCCCGATA CCCCCAGCGT GCTGGACGAG CACTCCGTTG CCCTCGGTAT CCACCGCAAG GGGTACGCCA ACTGGCGACT GGCGGCCGAG AACGGCTTTG ATCCGGGCCA CCTGCTGATC CACAAGGACA ACGCCATCGT GCACGCCCGT GACTGGGCGC TGCCGTTGGG GGTGAAACCG GTCACCGACC AGGCCATCGC GCTGATCGAG GACGACAACG GCCCCAAGGG CTTCCTGAAC CGGTACTACA CGGACCACTA CGAACCGATC CTGGAGAACG AAAAGCTGGG TGTGAAGGCG CAGGGCACCG TGCCCCGCTA CTTCCGCACC TCCATGTACC TGCCCGGCGT GCTCATGGTG GAGAACTGGC CGGAGGACCA TGTGGTGCAG TACGAGTGGT ACGTGCCCAT TACCGACGAC ACCTATGAGT ACTGGGAGGT GCTGGTCAAG CACTGCAAGG ACGAGCAGGA GCGCAAGGAC TTCGAGTACC GCTTTGAGAA CCTCTACAAG CCCATGTGCC TGCACGGCTT CAACGACTGC GACCTGTTCG CCCGCGACGC CATGCAAAAC TTCTACGCCG ATGGCACCGG CTGGAACGAG GAGCAGCTGG CCGACATGGA CGCCTCGGTG GTGACCTGGC GCAAGATCGC CTCCCGTCAC AACCGCGGTC TGGCCCGCAA GCCCAAGGGC GTGCCGGGCG TGCTCAAGGA CCAGAGCTAC CGGTTTGCCG AGGCCTCTGA AGGCGCCTTC GAGTAA
|
Protein sequence | MAADPKVIND KLAKRMYQPY LEAEWGFINH WYPALFTHEL EEGDTKGIQI CGVPIVLRRS KGKVYALKDQ CIHRGVKLSA KPMCLTDDTI TCWYHGFTFD LASGKLVSIV AAPDDEIIGT TGVQTFAVEE HSGMIFVFVC DEDWDEDVPP LAADLPLRYP ENNERFPHPY WPDTPSVLDE HSVALGIHRK GYANWRLAAE NGFDPGHLLI HKDNAIVHAR DWALPLGVKP VTDQAIALIE DDNGPKGFLN RYYTDHYEPI LENEKLGVKA QGTVPRYFRT SMYLPGVLMV ENWPEDHVVQ YEWYVPITDD TYEYWEVLVK HCKDEQERKD FEYRFENLYK PMCLHGFNDC DLFARDAMQN FYADGTGWNE EQLADMDASV VTWRKIASRH NRGLARKPKG VPGVLKDQSY RFAEASEGAF E
|
| |