Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1401 |
Symbol | |
ID | 4270623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1606305 |
End bp | 1607087 |
Gene Length | 783 bp |
Protein Length | 260 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638126157 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_742240 |
Protein GI | 114320557 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.278227 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.338656 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAGGG AATCAATGGA ATGGAGCCTG GTCACGGGCG GCGGCACCGG TCTGGGCCGG GCGCTGGCGC TGGCGCTGGC CGCCCGGGGC CAGCGAGTCA TCATCACCGG GCGCCGTCCC GGGCCGCTGG AGGCGACCGC GACCGCCGCG CCTGACGGGG CCGTGGTCCC GGTGCCGGCC GACGTGGCGC AGGAGGAGGG CCGCCAGCGG ATCGGCCAGG CCGTCGACCA GCACGCGCCG AATGGCTTGC GTTTCCTTAT TCACAATGCC GGCACCATCG ACCCCATCGG GCCGTTGACC GAGCTCGATC CCGAGCAGTG GCGGAGGAGC CAGGCGATCA ACGTAGAGGG GCCGCTGTTT CTCACCCGGT CCCTGTTGTC CCGCCTGCCC GCCGGCAGCC GGGTGCTGCA CATCTCCTCC GGTGCCGCCC ACAGCCCGAT CCCCGGCTGG GGGGCCTACT GCACCGCCAA GGCCGCCCTG CACATGCTCT ACCAGTGTCT GGACGGCGAG CTGCGTGACC GGGGCATCCG TGTCGGCAGC CTCCGCCCGG GGGTGGTGGA CACCCCGATG CAGGCACACA TCCGTGCCAG TTCCGAGGCG GCGTTCCCGC TGGTGGAGCG ATTCCGTGCC CTGCACCGAG AGGGGGAGTT GCGGTCGGCC GAGGAGGTGG CCGACTTCGC CTGCTGGGTG CTCAGTGCCA CCGACGACGA CCGCTACGCA TCGTCGGAGT GGAATATCTC CGACCCCGAG CAGACCCGTG ACTGGCAGCG CAGCCCCGGG TAA
|
Protein sequence | MSRESMEWSL VTGGGTGLGR ALALALAARG QRVIITGRRP GPLEATATAA PDGAVVPVPA DVAQEEGRQR IGQAVDQHAP NGLRFLIHNA GTIDPIGPLT ELDPEQWRRS QAINVEGPLF LTRSLLSRLP AGSRVLHISS GAAHSPIPGW GAYCTAKAAL HMLYQCLDGE LRDRGIRVGS LRPGVVDTPM QAHIRASSEA AFPLVERFRA LHREGELRSA EEVADFACWV LSATDDDRYA SSEWNISDPE QTRDWQRSPG
|
| |