Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0417 |
Symbol | |
ID | 4269456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 466704 |
End bp | 467834 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638125147 |
Product | glucose sorbosone dehydrogenase |
Protein accession | YP_741261 |
Protein GI | 114319578 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000311026 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.00936904 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTGCAGAA GCAGCACCCT CATCCTGATC GGCCTCGGGT TCACCGCCCT GACCGCCCCC ACGCTGGCAC CGGCCACCGA CGTTGAGAAC AGCCAGCACC ACCGCTTCGA GATCGAACGC CTGGGCCAGG GCTTCAGCCA CCCCTGGGGG CTCGCCTTCC TGCCCGACGG CGACCTGCTG GTCACCGAGC GCCCGGGACG GCTGCAGCGC GTCGACGCCG GGACCGGTGA GCGCCGGCGT ATCGAGGGCA CCCCGGACGT CGCCGCCACC GGCCAGGGCG GTATGCTCGA CATCGCCCTG CACCCGGACT TCGACACCAA CCGCTACGTC TACCTCACCT ACTCCGCCTA CGGCCGCGGC GGCATGACCA CCCACCTGGG CCGCGGTGTG CTGGATGGCG ACACCCTGCG TGACTTCGAG CTGCTGTACG CGGCCACCCC CTACTCCGGC GGCGGCCGCC ACTTCGGCTC GCGGATCGTC TTTGACGACG ACGGCTATCT CTTTATGACC ATGGGCGACC GCGGCCGGCG CGAGCGCGCA CAGCAGTTGG ACAACCACCA CGGCAAACTG CTGCGGCTGC ACGACGACGG CGGCATCCCC GCGGACAACC CCTTCGTGGA TGACGAGGGC GCCGAGCCCG CCATCTACAG CTACGGCCAC CGCAACGCCC AGGGCATGAC CCTGCACCCG GAGACCCGGG TGCTCTGGCT GCACGAACAC GGCCCGCGCG GCGGTGACGA GATCAACCTG CCGCGCCCGG GCCTCAACTT CGGCTGGCCG GAGGCCACCT TCGGTACCGA GTACCACGGC CCGGAGATCG CCCCCGACCC GCCGGTGGCA GGCATGGAAC CCCCCATCCA CCACTGGACG CCCTCCATCG CCCCCTCCGG CATGGCCTTC TACTACGCCG ACGCCTTCCC GGAGTGGCAG GGTGATCTGT TCGTCGGGGC CCTGGCCCAC CGCCACCTGG AACGGTTGCG CCTGGACGGC ACCGACGTGG TGGAGCAGGA GCGCCTGCTG CAAGGACTCG GCTGGCGCAT CCGCGACGTG CGGGTCGGTC CCAAGGGCCA TCTCTACGTC CTCCCGGACC GCAGTAGCAC GCCCCTCTTG CGGCTCCGCC CCGCCGACTG A
|
Protein sequence | MCRSSTLILI GLGFTALTAP TLAPATDVEN SQHHRFEIER LGQGFSHPWG LAFLPDGDLL VTERPGRLQR VDAGTGERRR IEGTPDVAAT GQGGMLDIAL HPDFDTNRYV YLTYSAYGRG GMTTHLGRGV LDGDTLRDFE LLYAATPYSG GGRHFGSRIV FDDDGYLFMT MGDRGRRERA QQLDNHHGKL LRLHDDGGIP ADNPFVDDEG AEPAIYSYGH RNAQGMTLHP ETRVLWLHEH GPRGGDEINL PRPGLNFGWP EATFGTEYHG PEIAPDPPVA GMEPPIHHWT PSIAPSGMAF YYADAFPEWQ GDLFVGALAH RHLERLRLDG TDVVEQERLL QGLGWRIRDV RVGPKGHLYV LPDRSSTPLL RLRPAD
|
| |