Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1094 |
Symbol | |
ID | 4269801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1276114 |
End bp | 1277427 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 638125846 |
Product | hypothetical protein |
Protein accession | YP_741936 |
Protein GI | 114320253 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.095311 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGTGGC TGGCCCTCCA CCTGCCCGAC CAGGCCCTGC CCGAGTCGCC CTGCACGCCC CCGGCAACGG GGGCGTGCAG GGCGACCGCC GATCAGGGGG CCGACGACCC CCTCGAAAAC CTGGCGGCCT GGGCCTACCA GTACAGCAGC CGGGTCTGCC CCTGGCCGCC GGCCACGCTG GTGCTGGAGA TTGGTGCCAG CCTGAACCTA TTCGGCGGCC TCCAGGCCCT GCTGTCCAGG ATTGACACCG GACTGCGCCG GCTGGACCGG ACGGCCCGGC GGGCGGTGGC GCCCACGCCC CGGGGGGCCT GTTGGCTGGC GCGCTGCGGG CAACAGCGGA TCCTCCAGAG CCCCGAGGCC CTGCGCCAGG CCCTGGATCC GCTGCCCCTC GCGGTGCTGG ACCTGGAACC GCGCCAGCAT CAGGCCCTGC ACGGTCTGGG CCTGCGCCGC CTGGGCGACT GTCTGGCCCT GCCCCGCCGG GAACTGGCCC GTCGTCTGGG TCCGGCGCTC CACCAGCAAC TGGACCAGGC CCTGGGCCAC CGCCCCGAAC CGCTGCCCGA ATGGCGGCCG CCGGCCCGCT ACCGGGGCCG CCGGGAACTG GTGCGGGAGA CGGACAACCT CACCCCTCTG CTGCCCCTGC TGGAACGGCT GCTGCACGAA CTGCAGGGCC TGCTGCGCGG CCTGGACGCC GGTGTCCCCC GCTTTGAGCT GGTGCTGGAG CACCTCCACC GCCCCGCCAG TCGGCTGACC GTCGGCCTGA CGGAGCCGGA CCGCGACCCG GAGCGTCTGC TGCGGGTGGC TGGCGAACGC CTGGCCCGGG AGCCGCTGGC CGCCCCGGTA CAGGCGATCA CACTGCTGGC CGAGGACATC CAACCCCTGC GGCCCGAGCC GGAGGCCCTG CCCGGCACTC GCGCCGCCCA CGATCATCAC CCCATGCGGG TGCTGCTGGA GCGACTGACC GCACGCCTGG GCGAGACCCG CGCCCACGGG CTGGCGGTGC ATCCGGAGCA CCGCCCGGAA CGGGCCTGGC GCCGGGTGCC ACCGGGCCAG GCCGGTGCCG CCGCCCCCCA GAAACCCCGC CCCACCTGGC TGTTGGAGCG GCCGCGCATC CTCGGCCAGC AGCAGGGCCA GCCGGTCTGC CGCGGCCCGC TGATCCTGGA GCGGGGGCCG GAGCGCATCG AGAGCGGCTG GTGGGACGGC GCCGACGTGG CCCGCGACTA CTACGTGGCC CGCGACCATG ACGGCGCCCG CCTGTGGATC TTCCGCGAGC GCCGTGGCCG CCGGCGCTGG TTCCTGCACG GCCTGTTCGG CTGA
|
Protein sequence | MLWLALHLPD QALPESPCTP PATGACRATA DQGADDPLEN LAAWAYQYSS RVCPWPPATL VLEIGASLNL FGGLQALLSR IDTGLRRLDR TARRAVAPTP RGACWLARCG QQRILQSPEA LRQALDPLPL AVLDLEPRQH QALHGLGLRR LGDCLALPRR ELARRLGPAL HQQLDQALGH RPEPLPEWRP PARYRGRREL VRETDNLTPL LPLLERLLHE LQGLLRGLDA GVPRFELVLE HLHRPASRLT VGLTEPDRDP ERLLRVAGER LAREPLAAPV QAITLLAEDI QPLRPEPEAL PGTRAAHDHH PMRVLLERLT ARLGETRAHG LAVHPEHRPE RAWRRVPPGQ AGAAAPQKPR PTWLLERPRI LGQQQGQPVC RGPLILERGP ERIESGWWDG ADVARDYYVA RDHDGARLWI FRERRGRRRW FLHGLFG
|
| |