Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2109 |
Symbol | |
ID | 4270087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2393692 |
End bp | 2394519 |
Gene Length | 828 bp |
Protein Length | 275 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638126865 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_742941 |
Protein GI | 114321258 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.144161 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTGC ACCAACGCAC CCTGTTCATC ACCGGCGCCA GCCGGGGGGT GGGCCTGGCC ATTGCCCTGC GCGCCGCCCG GGACGGGGCA AATATCGCCA TTGCGGCCAA GACCGACCGG CCCCACCCGA GGTTGCCCGG CACCATCTAC ACCGCTGCTG AGGCCATCGA ACAGGCCGGC GGCCGTGCCC TGCCCCTGAC GGTGGACATC CGCGATGAGG ACCAGGTGGC GGCGGCGGTG GAGCGCACCG CTGATCACTT TGGCGGTATC GACATCCTGG TCAACAACGC CAGCGCCATC CGCCTGAGCG GCACGCTGGA CACGGAGATA AAGCACTTCG ACCTGATGCA CCAGGTGAAT GCCCGCGGCA CCTTCCTCTG CGCCCGGGCC TGCCTGCCCC ACCTGCTGCG GGCGGACAAC CCCCACGTGC TCACCCTGTC ACCGCCGCTG AATCTCAAGC CGGAGCACTT CGGCCCCCAC CTGGCCTACA GCCTGGCCAA GTTCGGGATG AGCCTCTGCA CGCTGGGGCT GGCCGAGGAG TTCCGCGACC GCGGGGTTGC CTTCAACTCG CTCTGGCCGC GCACGCTGCT CGATACCGCG GCGGTGCGCA ACCTGCTGGG TGGCGAGGGG GTGGCGCGCC GCGGTCGCCG CCCGGAGATC GTGGCCGACG CCGCCCACGT CGTGCTCACC CGCGCCGCGC GGGGACAGAC CGGGCAGTTC CTGATCGACG AGGCGGTGTT GCGCAAGGCC GGGGTCACCG ACTTCCGCCG CTACCAGGTC GATCCGGACC TGGCGGAATC CGCGTTGATG GACGATCTGT TTCTCTGA
|
Protein sequence | MSLHQRTLFI TGASRGVGLA IALRAARDGA NIAIAAKTDR PHPRLPGTIY TAAEAIEQAG GRALPLTVDI RDEDQVAAAV ERTADHFGGI DILVNNASAI RLSGTLDTEI KHFDLMHQVN ARGTFLCARA CLPHLLRADN PHVLTLSPPL NLKPEHFGPH LAYSLAKFGM SLCTLGLAEE FRDRGVAFNS LWPRTLLDTA AVRNLLGGEG VARRGRRPEI VADAAHVVLT RAARGQTGQF LIDEAVLRKA GVTDFRRYQV DPDLAESALM DDLFL
|
| |