Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1326 |
Symbol | |
ID | 4070615 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1600656 |
End bp | 1601642 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637983335 |
Product | endonuclease IV |
Protein accession | YP_590402 |
Protein GI | 94968354 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0648] Endonuclease IV |
TIGRFAM ID | [TIGR00587] apurinic endonuclease (APN1) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGCG CGGTTTCGAA AAAAGTAGAA ATAGACATCC TCAAACTCAT CCCACCGCCA AAAACTCCGC CCAAACGCAC TGCGGTTCGC ATCGGTATTC ACACTTCGAG TGCCGGTGGA GTCGAACTCG CCGCGGAACG CGCCTACCGT CTCGGCTGCT CCTGCCTCCA AATCTTCTCT TCCAGTCCAC GCCAATGGAA GCCCTTTGAA CTCGGCCGCT CGCAGTGCGA GACGATGTCG TCCATTCGCG CCAAGTATGA CCTCAACCCG CTCGTCATCC ACGCGAATTA CTTGATCAAT GTCGCCGGCG GCAATCCCGA GTTCCACCAG AAATCCATCG AAGCCTTCCG CGCCGAGGTC CAGCGCGGCA TCGATCTCTG CGCCGACTAC CTCGTTCTGC ATCCCGGCTC ATTCAAAGGC GCGACGCGCG AAGACGGCCT GCAACGCGCC GCGGAAGCCA TCGAAGCCGC AGTCGACGGT CTCGGCATCG AGAAGACGAA CCTGAAGATC ACCATCGAGA ACACCGCCGG TTCCGAATTC TCGTTAGGCG GTAGCTTCGA ACAAGTTGCG GAATTAATGG CGCGCCTGCG CAAGCACGTC CCCGTCGCCG CCTGCATCGA CACCTGCCAT ACCCACGTCG CCGGCTACGA CATCACCACG AAAAAGGGCT TCGAGAAAAC CCTGCAGCAA CTCGACGACA CCGTCGGCCT GAAGAATGTC GGAGTTTTCC ACTGTAACGA CGCCAAAGCC CCCCGCGGCT CCAAGCTCGA CCGCCACCAG CACATCGGCC AGGGAACCAT CGGCCTCGAA CCCTTCAAAT GGCTGTTGAA CGATCCACGG CTACAGCACC CCGCGTTCAT CGCCGAAACG CCTATTGACG AGCCGCTGGA TGACCTGAAG AATATCGACG CCCTGAAAAG CTGTGTGAAG AAATCAAAGC CTGCCATTCA CCACAGAGAC GCAGAGGCAC AGAGAACAAA GAAATAA
|
Protein sequence | MPRAVSKKVE IDILKLIPPP KTPPKRTAVR IGIHTSSAGG VELAAERAYR LGCSCLQIFS SSPRQWKPFE LGRSQCETMS SIRAKYDLNP LVIHANYLIN VAGGNPEFHQ KSIEAFRAEV QRGIDLCADY LVLHPGSFKG ATREDGLQRA AEAIEAAVDG LGIEKTNLKI TIENTAGSEF SLGGSFEQVA ELMARLRKHV PVAACIDTCH THVAGYDITT KKGFEKTLQQ LDDTVGLKNV GVFHCNDAKA PRGSKLDRHQ HIGQGTIGLE PFKWLLNDPR LQHPAFIAET PIDEPLDDLK NIDALKSCVK KSKPAIHHRD AEAQRTKK
|
| |