Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | AnaeK_4158 |
Symbol | |
ID | 6784173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaeromyxobacter sp. K |
Kingdom | Bacteria |
Replicon accession | NC_011145 |
Strand | + |
Start bp | 4692533 |
End bp | 4693612 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 642765625 |
Product | HNH nuclease |
Protein accession | YP_002136490 |
Protein GI | 197124539 |
COG category | [V] Defense mechanisms |
COG ID | [COG1403] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.416299 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACATCG CACTCGAGTT CACGAAGCGT CTCGTCTCTC TTCTCCGCTC GGAGCGGCAC GCCATGGCGG AGTTCCTGGT CGCCCTGGCG GAGTTCGACG AGCGCGGGCT GTGGCGGGCG CGCGGGCACA CGTCGCTCTT TTCCTTCCTG CACCGTGAGC TGAAGCTCTC GGCCGGGTCC GCGCAGCTTC GCAAGACAGC CGCCGAGCTC ATCCAGCGCT ATCCGGCGGT CGAGGGCGCG CTCCGGGAGG GCAAGCTCTG CCTCTCCTCC GTCTGTGAGC TGGCGAAGGT GGTGACCACC GAGAACTGCG CGGAGATCCT GCCGCGGTTC TTTGGGCTAT CGAGCCGCGA CGCCGCTGCG GTGGTTGCCT CCATCCGGCC GGTCGAGAAC CCTCCGCAGC GAGAGGTCGT CGTGCCGGTC CGGATCGCGG CCCCTCCTGC GGTTCCGGCG TCGTCATCGC GTGATGCCGC GGCGCCGCTA CCCGCACGGA GCGTTCTATT TCATGCGCAT GAAACGCCGC CTGCGTCCGC TGCCACCGAG CCGGCTACGC CGGCTCCGCG GGCGGCCTGC CTCGCGGTAC CGATCGCGAA GCCCAGCTCG GTGGACTGGC TCGCCTCCGA TCAGGCCCGG ATGCACCTCA CGGTCTCGAA GGCGTTCCTG AGGAAGCTCG ACGCGGCGCG GGACGCGCTT TCGCACGCCA TGCCCGGTGC CACCCGCGAG GACGTCCTCG AGGCGGCGCT GGATCAGCTC CTTGCCGAGC GCTCACGCCG CAAGCGCCTC ACGGCGAAGC CGCAGAAGAC GGTCCGCGCC TCGCAGCGGC GGGAGCACAT CCCTGCGCAG GTCCGGCGCG AGGTCTGGGA GCGCGACGGC GGGCGGTGCA CCTTCGCCCT CGCCTCGGGC GAGCCCTGCG GCTCCACGCA CCGGCTCGAG CTGGACCACA TCGTCCCGCT CGCGCGCGGC GGGCCCTCGA CGGCGGACAA CCTCCGCATC CGTTGCCGGG GCCACAACCT CGAGGAGGCG CGGCGGGTCC TCGGGGACGC GCTCGTGGAC GCCTATGCCA GCAGGCGGCG CTGGGGATGA
|
Protein sequence | MDIALEFTKR LVSLLRSERH AMAEFLVALA EFDERGLWRA RGHTSLFSFL HRELKLSAGS AQLRKTAAEL IQRYPAVEGA LREGKLCLSS VCELAKVVTT ENCAEILPRF FGLSSRDAAA VVASIRPVEN PPQREVVVPV RIAAPPAVPA SSSRDAAAPL PARSVLFHAH ETPPASAATE PATPAPRAAC LAVPIAKPSS VDWLASDQAR MHLTVSKAFL RKLDAARDAL SHAMPGATRE DVLEAALDQL LAERSRRKRL TAKPQKTVRA SQRREHIPAQ VRREVWERDG GRCTFALASG EPCGSTHRLE LDHIVPLARG GPSTADNLRI RCRGHNLEEA RRVLGDALVD AYASRRRWG
|
| |