Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1444 |
Symbol | |
ID | 4269254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1648646 |
End bp | 1649530 |
Gene Length | 885 bp |
Protein Length | 294 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638126200 |
Product | Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase |
Protein accession | YP_742283 |
Protein GI | 114320600 |
COG category | [R] General function prediction only |
COG ID | [COG0388] Predicted amidohydrolase |
TIGRFAM ID | [TIGR03381] N-carbamoylputrescine amidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.0920298 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCCTT TGCGTGTCGC CCTGGTCCAG CAACGCTGCG GCCCGGACCC CGATGACAAC CTCCACCGGA CGCTGACCGC CATCGCCGAA GCCGCCGGCC GGGGGGCCGG CCTGGTCCTG CTCCAGGAGC TGCATCGGGG GCGCTACTTC TGCCAGCAGG AGGATCCGGC CTGTTTCGAC CAGGCGGAGC CCGTACCCGG TCCCACCACG GACGCCCTCG GCACGGCAGC ACGCGAGCAC GGCGTGGTGG TCGTCGGCTC GGTGTTCGAG CGGCGCGCCC CGGGGCTCTA CCACAACACG GCCGTGGTGC TCGATGCGGA CGGCAGCCTC GCCGGGCGCT ACAGGAAGAT GCATATCCCG GATGATCCGG GCTACTACGA GAAGTTCTAT TTTACCCCGG GCGACTTGGG CTTCGAGCCC GTCCAGACCC GTGTCGGCCG GCTGGGGGTG CTGGTGTGCT GGGACCAATG GTTCCCCGAA GCGGCCCGTC TGATGGCGCT GGCCGGGGCG GAGGTGCTGC TCTACCCTAC CGCCATAGGC TGGACGCCCG ATGACCGCCC CGATGAACAG GCGCGTCAGC GGGAGGCTTG GATGCTGGCG CAACGGGGCC ACGCGGTGAG CAACGGCTTG CCGGTACTGG CCTGTAACCG CACCGGCGAG GAACCGGACC CGGAGCACCC GGACCAAGGC ATCCGTTTCT GGGGCGGCAG CTTCGTCTGC GGCCCGCAGG GCGAGATCCT GGCCCAGGCG GCCACCGATG AGGAGTGCGT CCTGACTGTG GACGTTGACC TGCAGGCGGT GGAGCAGGTG CGCCGTATCT GGCCCTTCCT GCGCGACCGG CGTATCGATG CCTACAGCGA TTTACTTAAA CGTTTCAGGG ACTAA
|
Protein sequence | MAPLRVALVQ QRCGPDPDDN LHRTLTAIAE AAGRGAGLVL LQELHRGRYF CQQEDPACFD QAEPVPGPTT DALGTAAREH GVVVVGSVFE RRAPGLYHNT AVVLDADGSL AGRYRKMHIP DDPGYYEKFY FTPGDLGFEP VQTRVGRLGV LVCWDQWFPE AARLMALAGA EVLLYPTAIG WTPDDRPDEQ ARQREAWMLA QRGHAVSNGL PVLACNRTGE EPDPEHPDQG IRFWGGSFVC GPQGEILAQA ATDEECVLTV DVDLQAVEQV RRIWPFLRDR RIDAYSDLLK RFRD
|
| |