Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Anae109_3813 |
Symbol | |
ID | 5375599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaeromyxobacter sp. Fw109-5 |
Kingdom | Bacteria |
Replicon accession | NC_009675 |
Strand | - |
Start bp | 4443274 |
End bp | 4444275 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640845338 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_001380976 |
Protein GI | 153006651 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.00879372 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGGATCC TCGCGATCGA GACCTCCTGC GACGAGACCG CGGCCGCCAT CGTCGAGGAC GGGCGGCGGG CGCTCGCGGA CGTGATCTCC ACGCAGATCG ACATCCACCG CCGGTGGGGC GGGGTGGTGC CCGAGCTCGC GAGCAGGAAC CACGTCGTCC AGGTGATGCC GGTGGTGGAC GAGGCGCTCT CCCGGGCCGG CGTCGGTCCG GACGGCCTCG ACGCCATCGC GGTCACGAGC GGCCCCGGCC TGGTGGGCGC GCTCCTCGTC GGCGTCCAGG CCGCCAAGGC GCTCGCGCTC GCGTGGCAGA AGCCGCTCGT GCGCGTCAAC CACCTCGAGG GCCACCTCGT GGCCGCCTTC CTCTCCGAGA CCGCCCCGGC CTTCCCGTAC CTCGGGCTCG TGGTCTCCGG TGGCCACACC TCGCTCTACG CGGCGCACGG GTTCGGCGAC TACCGGCTGC TCGGCCAGAC CCGCGACGAC GCCGCGGGCG AGGCGTTCGA CAAGGGCGCG AAGCTGCTCG GGCTGCCGTA CCCCGGAGGC GTGGCCATCG ACCGGCTGGC GAAGGAGGGC GACGCGCGCG CCATCCGCTT CCCGAAGGCG ATCGTGAAGG GCGCGGACCT CGACTTCAGC TTCTCCGGCC TGAAGACGGC GCTGCTCCAC CACGTGAAGA AGCACGGCCT GCCCGAGGGC AAGGGGCTCG CCGATCTGTG CGCGAGCTAC CAGGAGGCGA TCGTCCGCGC TCTGGTCGAG AAGGCGTTCC GCGCCGCGCG CCGCCTGCAG TACGACAGGC TCGTGCTCTC GGGCGGCGTC GCCGCGAACA GCCGGCTGCG CGGCGCGGTG GCCGAGCGCG CGCGGGAGTA CGAGGGGATG GAGGTGTTCC TGCCCGCCCC CAGGCTCTGC ACGGACAACG CGGCGATGAT CGCAGTCGCC GGCACGCACG CGTTCCTCCG CGGCGAGCGC GCGGGCGCCG ACCTCAACGC GGATCCGGCC TGGCGCCTGT GA
|
Protein sequence | MRILAIETSC DETAAAIVED GRRALADVIS TQIDIHRRWG GVVPELASRN HVVQVMPVVD EALSRAGVGP DGLDAIAVTS GPGLVGALLV GVQAAKALAL AWQKPLVRVN HLEGHLVAAF LSETAPAFPY LGLVVSGGHT SLYAAHGFGD YRLLGQTRDD AAGEAFDKGA KLLGLPYPGG VAIDRLAKEG DARAIRFPKA IVKGADLDFS FSGLKTALLH HVKKHGLPEG KGLADLCASY QEAIVRALVE KAFRAARRLQ YDRLVLSGGV AANSRLRGAV AERAREYEGM EVFLPAPRLC TDNAAMIAVA GTHAFLRGER AGADLNADPA WRL
|
| |