Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2668 |
Symbol | |
ID | 4268801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 3020731 |
End bp | 3021942 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 638127427 |
Product | protein of unknown function DUF513, hemX |
Protein accession | YP_743498 |
Protein GI | 114321815 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2959] Uncharacterized enzyme of heme biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0000000148689 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGGAAA ATAAACCCGA GCGGGAAGAG GAGAAGCCCC GGTCTGAGAA GACCGGCGGC GAGGACCCCC AGGGCCAGGA GCTGACGGCC TCTGCGGCCG CGCCCGAGCC GGGCAAGGGG GGCGGCGGTA CGCCGCCGGC CGGCGGGGAC GGCGACGGTA CCGACAAGGA TCGCCAGGGC GGGCCCTGGA AACAGGTCGT GGCGTTGCTG GTCGTGGTCC TGGTGCTGGG TGCCGCGGCC ACCTGGTGGC TGACCGGCGA GATGCGCGAG TTGCGCGCCG AGCAGGCGCG TATGGTCAGT GCCGACCGGC TGGATGAACG CAGCGACGCC CTCGAGCGGC AACTGGCACG GCTCGAGGAT CGCGTCACCG ATACCGGCGA GCGCGCCGCC TCCGCCCGTG AGCGGGCCGA CGAGGCCGGC GATGCCCTGG GGACCCTGCG CGAACAGCTG GACGAGTTGC GCGCCCGCCA GGGCGGTTTC GAGGAGGGCC TGGAGCGACT GGGTGCGCGC GCCGAGGCCA ACCGGGAGAA CTGGATCCGT TCCGAGGCGG CCTACCTGGC CACCGTGGCC GTCCACCGGA TGCGCTTCCA CCGCGACCCC AAGACCGCAC TTGGCGCCCT GCAGGCAGCA GACAAGCTGA TGGCCGACAT CGGTGCCAGC GAGAGCGTGC CGGCTCGCGT TGCCCTCAAC GAGGCGGTCA CCCAGGTGCT GGAGTGGGCC CCGCCCGAGG TGGGCCGGCT GGCCGCCACC CTGGCCGACC TGGAAGGCCG GGTGGATGGG CTGCCCATGC CGGCGGAGCG GGCCACCGGC GGCATCGATC TGCCGCGCAT GGCCGCGGAC GAGGGCGACC CGGTCTGGCT GGCGCGGCTG AAGGACGCCA CCGGCCGGGT CCAGGCTGGA TTGGGTGAGC TGGTGGTGGT GCAGCGCGAG GAGGCCGCGC CGCCCCTGGT GGCACCGGAT CAGCGCTACT TCCTGCGCGA GAACCTCAAG CTGCGCCTGG AGGCGGCCCG ACTGGCCGCA CTGCAGGGCG ATCAGGACCT GTGGGAGGAC AGCCTGCAGC GGGCCCACGA CTGGGTCCTG GCCCACTTCG ATACCAGCGA TCTGGATGTG GAGGCGGTGG CGGACACGCT GGCGCGCCTG CGCCGTCAGG ACATCGACCC GGAGCTGCCG GATATCGCCA CCACCCTGGA ACCGGTCAAG CCGTTCCTGT AA
|
Protein sequence | MQENKPEREE EKPRSEKTGG EDPQGQELTA SAAAPEPGKG GGGTPPAGGD GDGTDKDRQG GPWKQVVALL VVVLVLGAAA TWWLTGEMRE LRAEQARMVS ADRLDERSDA LERQLARLED RVTDTGERAA SARERADEAG DALGTLREQL DELRARQGGF EEGLERLGAR AEANRENWIR SEAAYLATVA VHRMRFHRDP KTALGALQAA DKLMADIGAS ESVPARVALN EAVTQVLEWA PPEVGRLAAT LADLEGRVDG LPMPAERATG GIDLPRMAAD EGDPVWLARL KDATGRVQAG LGELVVVQRE EAAPPLVAPD QRYFLRENLK LRLEAARLAA LQGDQDLWED SLQRAHDWVL AHFDTSDLDV EAVADTLARL RRQDIDPELP DIATTLEPVK PFL
|
| |