Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0126 |
Symbol | |
ID | 4269819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 142429 |
End bp | 143664 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638124850 |
Product | Orn/DAP/Arg decarboxylase 2 |
Protein accession | YP_740971 |
Protein GI | 114319288 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0019] Diaminopimelate decarboxylase |
TIGRFAM ID | [TIGR01048] diaminopimelate decarboxylase [TIGR03099] pyridoxal-dependent decarboxylase, exosortase system type 1 associated |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATGGAC AACGCCCTTG CCATGCGCCC ATGGACCAGT TCGAGGTTGC CGACGGCATG CTGCAGGTGG GTGGCCGCCC GCTCTCCGAT CTGGTGCAGC AGGCCGGCGG TACGCCCTTC TACGCCTACG ACCGTCGCCT GATGACCCGG CGTGTCGCGG CGTTGCGCCA GACCCTGCCA GAGGGGATTG CGCTTCACTA CGCGATCAAG GCCAACCCCA TGCCTGCCGT GGTCCAGCAT ATGGCCGGCC AGGTTGACGG TCTCGATGTG GCCTCGGCGG GGGAGTTGCG GGTGGCGTTG GATGCCGGGG TGTCCGCGGC GGATATCAGC TTCGCCGGCC CCGGTAAGAC GGACGATGAA CTGGCCGCGG CCGTAGCCGC GGGTATCACC ATCAACCTGG AGTCGGCCAC CGAGCTGGAG CGGCTGGCCG CCATCGCGGA GAAGATCGGT CACCGACCCC ATGTGGCGGT GCGGGTGAAT CCCGACTTCG AGCTCAAGAC CTCCGGGATG AAGATGAGCG GCGGGGCCAA GCCGTTCGGC GTGGATGCGG AGGCGGTGCC GGCGTTGCTC AAACGCATTG CGGCGCTGGA TGTGCATTTT CGCGGTTTTC ACATCTTTTC CGGGTCACAG AATCTGCGGG CCGAGGCCCT GGTGGAGGCC CAGGGCCTGA CCCTGGACCT GGCGCTCCGG CTGGCGGACG ATGCGCCCGG CCCGGTGGAG ATGCTGAATA TCGGCGGTGG CTTTGGCATC CCCTATTTCC CGGGCGACCG CCGGCTGGAT CTGGCGCCGG TCAGTGAGCA GCTTGAAGCG CGGCTCCCGG CGGTGCGGGA GCGCCTCCCG GGGGTGGAGA TCATCCTGGA GCTGGGGCGT TACCTGGTCG GCGAGGCCGG CATCTACGTG GCGAAGGTAG TGGACCGCAA GGTCTCCCGC GGCCGCACAT TCCTGGTCAC CAACGGGGGG TTGCATCACC ATCTGGCGGC ATCCGGCAAC TTCGGCCAGG TCATCCGCAA GAACTACCCG GTCGCCGTCG GCAATCGCAT GGATGCCAGC GCCAGCGAGA CAGTGGATGT GGTGGGGCCG TTGTGCACGC CGTTGGATAT CCTCGCCCAG CAGGTGGACC TGCCGCCGGC GCAGCCCGGT GACTGGATCG TGGTGTACCA GTCCGGTGCC TACGGTTACA CCGCGAGCCC GACGCGGTTC CTGGGGCATC CGGAACCGCT GGAGTTGCTC GTTTGA
|
Protein sequence | MNGQRPCHAP MDQFEVADGM LQVGGRPLSD LVQQAGGTPF YAYDRRLMTR RVAALRQTLP EGIALHYAIK ANPMPAVVQH MAGQVDGLDV ASAGELRVAL DAGVSAADIS FAGPGKTDDE LAAAVAAGIT INLESATELE RLAAIAEKIG HRPHVAVRVN PDFELKTSGM KMSGGAKPFG VDAEAVPALL KRIAALDVHF RGFHIFSGSQ NLRAEALVEA QGLTLDLALR LADDAPGPVE MLNIGGGFGI PYFPGDRRLD LAPVSEQLEA RLPAVRERLP GVEIILELGR YLVGEAGIYV AKVVDRKVSR GRTFLVTNGG LHHHLAASGN FGQVIRKNYP VAVGNRMDAS ASETVDVVGP LCTPLDILAQ QVDLPPAQPG DWIVVYQSGA YGYTASPTRF LGHPEPLELL V
|
| |