Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1735 |
Symbol | |
ID | 4270842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1984489 |
End bp | 1985379 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638126493 |
Product | endonuclease IV |
Protein accession | YP_742571 |
Protein GI | 114320888 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0648] Endonuclease IV |
TIGRFAM ID | [TIGR00587] apurinic endonuclease (APN1) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.652839 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCTAG GCGCCCATGT ATCGGCGGCC GGTGGCCCGC AGAACGCCCC CGAGCGGGGC CGGGAGATCG GCTGCGACTG CATTCAGATC TTCACTCGCA ACCAGCGGCA GTGGCGGGTC AAGCCCATCA GTGACGAGGA GGCCACCGCC TTCCGGGCCA ACCGCGAGGC CTGTGGGATC GGCGCGGTGA TGAGCCACGC CTCCTACCTG CTCAACCTGG GCACCACCGA TCCGGAGAAG TTCGCCAAGA CCTATGACGC CTTCGAGCAG GAGCTGCTCC GTTGCCACAA GCTGGGCGTG GAACTGCTGA ACTTCCACCC CGGGGCCCAC GTGGGCAAAG GGGTGGAGAC CGGCATCCAG CAGATTGCCC ACGCGCTCAA CGAGATCTGC CGCGCCCACC CGGACAAGAC CGACGTCTGC CTGGTGCTAG AGAATGTGGC CGGGCAGGGC ACCACCCTGG GTCGCAGTTT CGAGGAACTG CGCGCGATCC TGGACCTGCT CGAGAGCCCG GAGCGCTTCG GTGTTTGCGT GGACACCGCC CACGCCTTTG CCGCCGGCTA CCCGATTCAC ACCGAGGCGG GCTGGGAAAA GACCTGGAGC GAGTTCGACC GCATCCTCGG CCTGGATCGG CTGGTGGCGC TGCACCTGAA CGACTCCAAA GTGCCCTTCG ACTCGCGCAA GGACCGCCAT GCCCTGATCG GGCGGGGCGA GATCGGCGCC GAGGCCTTCC GCCGCGCGGT TACCGACCCG CGCACCCGCC ACCTGCCCAT GTTCCTGGAG ACCCCGGCCG GACCGGAGGG TTGGGCGCGT GAGATCGTAT GGCTGCGCCA GGCGGCGGCG GGCGAGGCCG CCCCGTTGCC GGAGATCGAG GATGCGGGGG TGAACCTGTA A
|
Protein sequence | MQLGAHVSAA GGPQNAPERG REIGCDCIQI FTRNQRQWRV KPISDEEATA FRANREACGI GAVMSHASYL LNLGTTDPEK FAKTYDAFEQ ELLRCHKLGV ELLNFHPGAH VGKGVETGIQ QIAHALNEIC RAHPDKTDVC LVLENVAGQG TTLGRSFEEL RAILDLLESP ERFGVCVDTA HAFAAGYPIH TEAGWEKTWS EFDRILGLDR LVALHLNDSK VPFDSRKDRH ALIGRGEIGA EAFRRAVTDP RTRHLPMFLE TPAGPEGWAR EIVWLRQAAA GEAAPLPEIE DAGVNL
|
| |