Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2553 |
Symbol | |
ID | 6872682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 2433185 |
End bp | 2434042 |
Gene Length | 858 bp |
Protein Length | 285 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642785628 |
Product | endonuclease IV |
Protein accession | YP_002216286 |
Protein GI | 198244000 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0648] Endonuclease IV |
TIGRFAM ID | [TIGR00587] apurinic endonuclease (APN1) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.764772 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.000695113 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAATACA TCGGAGCGCA CGTCAGCGCT GCTGGCGGTC TGGCTAACGC CCCTGCCCGC GCGGCTGAAA TTGGCGCAAC GGCCTTTGCG CTTTTCACAA AAAACCAGCG TCAGTGGCGT GCCGCCCCCC TTACTCCCCA GGTCATTGAT GACTTTAAAA TCGCCTGTGA AAAGTATCAT TTCTCGGCGG CGCAAATTCT TCCCCACGAT AGTTACCTGA TTAATCTGGG CCATCCGGTC AGTGAAGCGC TGGAAAAATC ACGCGATGCC TTTCTCGATG AAATGCAGCG CTGTGAACAA CTCGGCTTAA CCTTGCTTAA TTTTCATCCC GGTAGCCATC TGATGCAGAT TGCACAGGAG GATTGCCTGG CGCGGATCGC GGAATCCATC AATATTGCCC TCGCGCAGAC CGAGGGCGTT ACGGCGGTTA TCGAAAATAC AGCCGGTCAG GGCAGTAATC TGGGGTTTGA GTTTGAACAG TTAGCCGCCA TCATCGACGG CGTGGAAGAT AAGTCGCGCG TTGGCGTCTG TATCGATACC TGCCATGCCT TTGCCGCCGG ATACGATCTG CGTACGCCAG AGGCGTGCGA AAAAACATTC TCCGAATTCG GGAAAATTGT CGGATTTCAG TATTTGCGCG GAATGCACCT GAACGACGCC AAAAGCGCCT TCGGTAGCCG CGTTGACCGC CATCACAGTC TGGGTGAAGG CAATATCGGC CACGATGCGT TTCGTTGGAT TATGCAGGAT GGGCGTTTTG ACGGTATTCC GCTGATACTG GAGACCATCA ATCCTGATAT CTGGGCGGAA GAGATTGCGT GGTTAAAAGC CCAGCAAATT GCCGAAGCGA TGGCCTGA
|
Protein sequence | MKYIGAHVSA AGGLANAPAR AAEIGATAFA LFTKNQRQWR AAPLTPQVID DFKIACEKYH FSAAQILPHD SYLINLGHPV SEALEKSRDA FLDEMQRCEQ LGLTLLNFHP GSHLMQIAQE DCLARIAESI NIALAQTEGV TAVIENTAGQ GSNLGFEFEQ LAAIIDGVED KSRVGVCIDT CHAFAAGYDL RTPEACEKTF SEFGKIVGFQ YLRGMHLNDA KSAFGSRVDR HHSLGEGNIG HDAFRWIMQD GRFDGIPLIL ETINPDIWAE EIAWLKAQQI AEAMA
|
| |