Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2737 |
Symbol | |
ID | 4270991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 3105865 |
End bp | 3107112 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638127499 |
Product | glucose sorbosone dehydrogenase |
Protein accession | YP_743567 |
Protein GI | 114321884 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.417309 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCAGC CTGGTTCATT CCTTACGACC CCAGCCCGCT GCGTAGTCGG GGCAGCGGTC CCAGCGGTCT TGCTGGTGAC CAGCCTGTCG GCGCTGGCGG AGGTCAGTCG CAGTGACCAG GTGGTGGCCG CCGGGGTCGA GAGCGAGCAG GCCCGCTTTG ATGTCGTCCA GGTGCTGGAG GGGTTGGAGC ACCCCTGGGC GGTGGCCTGG CTGCCCGATG GCCGCAAGCT CGTCACCGAA CGCCCCGGCC GCCTCTGGCT GGTGGATGGC GATGACATCA CCCGCGTGGG TAATACCCCC CGGGTGAACC CCGAGGAGCG CGACGGTCTG GCCCTGGAGG CGAGCTGGCA GGGCGGCCTG CTCGATCTGG CGGTCCACCC CGATTACGAG GACAACGGCT GGATCTACAT GACCTATTCC AGCCCGGGGG ATCCGGACGC GGTCATCGGC GACGCCGAGT TTGGCAGCGG CACCGCCCTG GCCCGGGCGC GGTTGAGCGA CGACGGCAGC CAGCTCACCG ACCTGGAGAC GCTGTACGTG CAGATGCCCC GCACCGCCCC GGGCCGGCAC TACGGCTCGC GGATCGTCTT TCCGGGCGAC GGCACCGTGA TCTTCTCCAT CGGCGACAGC GGCCTGCGCG CCCCCTCGCA GGATCTGACC GATCCGGCCG GGTCCATGAT CCGCCTCAAC GAGGATGGTG GCGCCGCCGA GGACAACCCG CTGGTGGGCA TGGCGCCGGG CAACCTGCGG CCGGAGATCT ACTCCTTCGG TCACCGCAAC AACCAGGGCC TGGCCATCCA CCCGGAGACC GGTGAGATCT GGACCAGCGA GCACGGGCCC CGGGGCGGCG ACATGATCCA CCGGATCGAG CCCGGCAACA ACTACGGCTG GCCCCAAGTG GCCTACGGCA CCGAGTACTC CACCGACGAG CAGGTCGGCA TTGGCCGGTC CGCCCCCGGC GTGACCCCGG CGGTCCACTA CTGGGACTAC TCTATGGCCC CCTCAGGGCT CGCCTTCTAC AGCGGTGATG AGGTGCCGGG CTGGCAGGGC GATCTGTTTG CCGGGTCGCT GGCCGAGGAG CGCTTGCACC GGCTGGTGCT GGAGGGCGAC CGCGTGGTCC ACGAGGAGCT CCTGCTCGAC GGCACCCTCG GGCGCATCCG GGATGTGCGG CAGGGGCCGG ACGGCCGGCT CTACCTGCTC ACGGATGAGG AGTCGGGGGG GCTCTATCGG TTGGAGCCCG CCCACTGA
|
Protein sequence | MQQPGSFLTT PARCVVGAAV PAVLLVTSLS ALAEVSRSDQ VVAAGVESEQ ARFDVVQVLE GLEHPWAVAW LPDGRKLVTE RPGRLWLVDG DDITRVGNTP RVNPEERDGL ALEASWQGGL LDLAVHPDYE DNGWIYMTYS SPGDPDAVIG DAEFGSGTAL ARARLSDDGS QLTDLETLYV QMPRTAPGRH YGSRIVFPGD GTVIFSIGDS GLRAPSQDLT DPAGSMIRLN EDGGAAEDNP LVGMAPGNLR PEIYSFGHRN NQGLAIHPET GEIWTSEHGP RGGDMIHRIE PGNNYGWPQV AYGTEYSTDE QVGIGRSAPG VTPAVHYWDY SMAPSGLAFY SGDEVPGWQG DLFAGSLAEE RLHRLVLEGD RVVHEELLLD GTLGRIRDVR QGPDGRLYLL TDEESGGLYR LEPAH
|
| |