Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2251 |
Symbol | |
ID | 4269192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2551573 |
End bp | 2552262 |
Gene Length | 690 bp |
Protein Length | 229 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638127008 |
Product | phosphoglycolate phosphatase |
Protein accession | YP_743083 |
Protein GI | 114321400 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCTG ACCTGAGCAG GATTCGCGCG GTGCTCTACG ATCTGGACGG CACCCTGGTC GATAGCGCAC CGGACCTGGC GGTGGCGGTT AACCGCGTGC TGGCCGACCT GGGCCAGCAG CCCCGGGAAG AGAACGAGAT TCGCCGCTGG GTGGGCAACG GGGCCCGGCG CCTGATTATG CGGGCCCTGA CCGGCGAGCA TGAGGGCGAT CCCGGTGATG AGCACACGGA TCCGGCACTG GAGCAGTTCT TCGAGTACTA CGGTGAGCGG GTGGCCGAGC GCAGTCGCCT CTACCCCGGC GTGGCCGAGG GCATCGCCGG GGTGGCCGAG CTGGGTATCG CCCAGGCGGT GGTGACCAAC AAGCCGCGCC GGTTTGCCGA GCCGCTGTTG GAGACCCTGG GCATCCGCCG CTATATGGCG ACGGTGGTGG GCGGCGAGTG CGCCCCGGTG AAAAAGCCCG ATCCGGCCCC GCTGCGTCTG GCCCTGGAGC GGCTGGGGGT GGAGCCCGCA CAGGCGTTGA TGGTGGGCGA CTCGGCGGTG GACGTGGGCG CGGCCCGCAA CACCGGCATG AAGGTGATCT GCGTGCCCTA CGGTTATAAC GCCGGCAATG CCATCGAGGA CGCCTTCCCC GATGCCATGG TCAAGAGCCT GGCGGAAATC CCCGCCATGC TGCGCAGTCG GGCGGCCTGA
|
Protein sequence | MKADLSRIRA VLYDLDGTLV DSAPDLAVAV NRVLADLGQQ PREENEIRRW VGNGARRLIM RALTGEHEGD PGDEHTDPAL EQFFEYYGER VAERSRLYPG VAEGIAGVAE LGIAQAVVTN KPRRFAEPLL ETLGIRRYMA TVVGGECAPV KKPDPAPLRL ALERLGVEPA QALMVGDSAV DVGAARNTGM KVICVPYGYN AGNAIEDAFP DAMVKSLAEI PAMLRSRAA
|
| |