Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0113 |
Symbol | |
ID | 4268200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 124956 |
End bp | 126005 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638124839 |
Product | hypothetical protein |
Protein accession | YP_740960 |
Protein GI | 114319277 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGCCAAA TGTTGGACCA CGACTTGGAC TGGCTGACGG CAACGGCTCA TCCCTCGCAC GGCGAGGTGG GGCAGTTCGT CTCCGGCGAT CATTATCTGC CGTTCTATGC TCACGATGGT GGTTTGGGTT TCTTACTGGC CGGCTTCGCT TTAGGGCGCA TTCCGATCCG ACGATACGTC ATCAGCGGCA ATTTGCCGTA TTTGTCGAAG CCAGAGGCGG CAGAGGTGCT TTGCCTCCTG CGGCGGGAGC TTGGTAGCCG AGGCGTGCTA TTCCTAATAG GCGTCGTTGA AGGTGAGCCC CTCTGGCAGG CGTTGCGAGA GCCAATAGTG CGAAGGCACT ACCGCGTGTT GCAAAGTGGT GGGTTGGAAA CGCGTCGGCT GGCGTGCTTT CCAGAGGGTT TTGATGCCTA CCTGGCAAGT CTGCCCAGAA GAAGGCGTCA GGATTTGCGA CGGAGCGAGC GGCGTTTTTT ACGGCGATTT CCCGGCTGGT CTCTTGGGGT GTATACCGAG CCCGAGCTGT TGGGCCGTTT CCTAGGTCAG GTTGAGCCGG TTTCTTTGAG GACGTACCAA AGCCGATTAC ACGGTCTGGG GATAACTGAA ACCGGTTGGA TTGCCAGGAA AGTGCGGGCC GGTGCCCGCC TGGGCAAGGC TCGTTGTTAC GCCCTCTTCG TCGATGGCCA GCCAGTTGCC TGGCGTATCG GGTTTGTTCA TGGTGGCATC TACTACAGCC ACCACATCGG CTATGACCCG AGGCTGGCAC AGTGGCACCC CGGCATTGTA CTCCAGTTGA AAGTGATCGA GGATCTTTGT CAGCTCACGC CGGCCGTCAG CGAACTCGAC ATGTTGTATG GGGATAACCC CGTCAAAGAA AAGCTCAGCA ACGCTTGCCG TCGGGAGGGG AAATTTTACC TGTTCCCGAA TAACCTCAAG GGGAACCTAA GCTTTGCGGC ATTGAGAGGC TTCAATGCCT TTTCGGACGG CGTTTCAGGC CTTGCAGACT GGCTCCGTAT CAAGGAGCGT CTGCGGCGTC GCCTGCGCCG CGCGGTTTGA
|
Protein sequence | MGQMLDHDLD WLTATAHPSH GEVGQFVSGD HYLPFYAHDG GLGFLLAGFA LGRIPIRRYV ISGNLPYLSK PEAAEVLCLL RRELGSRGVL FLIGVVEGEP LWQALREPIV RRHYRVLQSG GLETRRLACF PEGFDAYLAS LPRRRRQDLR RSERRFLRRF PGWSLGVYTE PELLGRFLGQ VEPVSLRTYQ SRLHGLGITE TGWIARKVRA GARLGKARCY ALFVDGQPVA WRIGFVHGGI YYSHHIGYDP RLAQWHPGIV LQLKVIEDLC QLTPAVSELD MLYGDNPVKE KLSNACRREG KFYLFPNNLK GNLSFAALRG FNAFSDGVSG LADWLRIKER LRRRLRRAV
|
| |