Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1990 |
Symbol | |
ID | 4270464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2260316 |
End bp | 2261020 |
Gene Length | 705 bp |
Protein Length | 234 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638126746 |
Product | HAD family hydrolase |
Protein accession | YP_742822 |
Protein GI | 114321139 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01428] 2-haloalkanoic acid dehalogenase, type II [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.169915 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.0795934 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAGT CGGGCGCGAT CCGGGCGGTC ACCTTCGATC TCGACTTCAC CCTCTGGGAC TTGGAACACG TCATTCAGCG TGCCGAGCAG CGCATGCAGC GCTTTCTGGC GGCGCGCTAT CCGCGGGTCA GCGAGCATTT CGACGAGGAG GCCATGCGCC GGCTGCGGTT GCGTATGGCG GAGGAGCACC CGGAGTTGCG GATCAATGTC AGCGCCATGC GCCGTGCGTC GCTGCGACGC ATCGCGCTGA CCTGCGGCTA TGGTGAGGAT ATGGTGGAGG CGGCCTTCCA TGTCTTCATG GAGGGGCGCC ACGAAGTGGT GCCATATGAC GACGTGGCCC CCACCCTGAC GGCATTGCGC CGTCACTACC GGATCGGAGC GCTGACCAAC GGTAATGCCG ACGTCAACCG CCTGGCCTTG GGCGAGTACT TTGACTTCAG TGTCTCAGCG GTGGAGGTCG GCGCGGCCAA GCCGAGCCGG ATCATCTTCG AGGCGGCCTG TCATCGGGCC GGCATCGCCC CCGGTGAGAT GGTCCATGTG GGTGATGAGG TGCACAGCGA CGTGCTGGGC GCGGTGCGCT TTGGGATGGG GGCCGTTTGG CTCAACCGCC GGGGCGAACC CTGGCCGGAG GACCTGGAGC GGCTGCCTCA TGTGGAATTG GCGGACCTGA GCCGGTTGCC GACGGTGCTG GCCCGCTGGC GTTGA
|
Protein sequence | MPESGAIRAV TFDLDFTLWD LEHVIQRAEQ RMQRFLAARY PRVSEHFDEE AMRRLRLRMA EEHPELRINV SAMRRASLRR IALTCGYGED MVEAAFHVFM EGRHEVVPYD DVAPTLTALR RHYRIGALTN GNADVNRLAL GEYFDFSVSA VEVGAAKPSR IIFEAACHRA GIAPGEMVHV GDEVHSDVLG AVRFGMGAVW LNRRGEPWPE DLERLPHVEL ADLSRLPTVL ARWR
|
| |