Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B1065 |
Symbol | hpaI |
ID | 6796505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 1063163 |
End bp | 1063954 |
Gene Length | 792 bp |
Protein Length | 263 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 642775334 |
Product | 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase |
Protein accession | YP_002145975 |
Protein GI | 197249063 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3836] 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase |
TIGRFAM ID | [TIGR02311] 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATG CTTTCAAAGA CGCGTTAAAA GCGGGGCGCC CGCAAATCGG TTTGTGGCTG GGGCTTGCCA ACAGTTACAG CGCTGAACTG TTAGCGGGCG CCGGCTTCGA CTGGCTACTG ATCGACGGTG AACACGCGCC AAACAACGTG CAGACGGTGT TGACCCAGTT GCAGGCGATT GCGCCTTATC CCAGCCAGCC GGTGGTGCGT CCGTCATGGA ACGATCCGGT ACAGATTAAG CAACTGCTCG ACGTCGGCGC GCAAACGCTG CTGATACCGA TGGTGCAGAA TGCCGATGAA GCGCGAAACG CCGTAGCGGC TACGCGTTAT CCGCCTGCCG GTATTCGCGG CGTGGGCAGC GCGCTGGCGC GGGCATCGCG CTGGAATCGC ATTCCGGACT ATCTCCACCA GGCCAACGAC GCCATGTGCG TACTGGTGCA GATTGAAACG CGTGAGGCGA TGAGCAATCT GGCGTCAATT CTCGACGTGG ATGGCATTGA CGGCGTGTTT ATTGGTCCGG CGGACCTCAG CGCCGATATG GGCTTTGCCG GCAATCCGCA GCACCCGGAA GTGCAGGCGG CGATTGAGAA CGCCATCGTG CAGATACGCG CGGCGGGGAA AGCGCCGGGG ATTCTGATGG CCAATGAAGC ACTGGCGAAA CGTTATCTGG AACTGGGGGC GCTATTTGTC GCCGTCGGCG TTGACACCAC GCTGCTGGCG CGCGGCGCGG AGGTGCTGGC GGCGCGCTTT GGCGCAGAAA AAAAACTGTC CGGTGCGTCC GGCGTCTATT AA
|
Protein sequence | MKNAFKDALK AGRPQIGLWL GLANSYSAEL LAGAGFDWLL IDGEHAPNNV QTVLTQLQAI APYPSQPVVR PSWNDPVQIK QLLDVGAQTL LIPMVQNADE ARNAVAATRY PPAGIRGVGS ALARASRWNR IPDYLHQAND AMCVLVQIET REAMSNLASI LDVDGIDGVF IGPADLSADM GFAGNPQHPE VQAAIENAIV QIRAAGKAPG ILMANEALAK RYLELGALFV AVGVDTTLLA RGAEVLAARF GAEKKLSGAS GVY
|
| |