Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B1062 |
Symbol | hpaD |
ID | 6795107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 1060965 |
End bp | 1061816 |
Gene Length | 852 bp |
Protein Length | 283 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642775331 |
Product | 3,4-dihydroxyphenylacetate 2,3-dioxygenase |
Protein accession | YP_002145972 |
Protein GI | 197251582 |
COG category | [S] Function unknown |
COG ID | [COG3384] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02298] 3,4-dihydroxyphenylacetate 2,3-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCAAGT TAGCGTTAGC AGCAAAAATT ACCCACGTGC CGTCGATGTA TCTTTCTGAA CTACCAGGAA AAAATCACGG TTGTCGTCAG GCAGCCATTG ATGGGCATAT TGAAATTGGC AAGCGTTGCC GCGAAATGGG CGTTGACACC ATTATCGTAT TCGACACCCA CTGGCTGGTG AATAGCGCTT ACCACATTAA TTGTGCCGAC CATTTCCAGG GCGTCTATAC CAGCAACGAA TTGCCGCACT TTATTCGCGA CATGACCTAT GACTATGACG GTAATCCGGA GCTCGGCCAC CTGATCGCCG ACGAGGCAGT CAAACTGGGC GTGCGCGCTA AAGCGCATAA CATCCCGAGC CTGAAGCTGG AGTATGGCAC GCTGGTACCG ATGCGCTACA TGAACAGCGA CAAGCACTTC AAAGTGGTCT CCATTTCGGC GTTCTGCACT GTGCATGATT TTGCCGACAG TCGCAGACTG GGCGAAGCCA TTCTCAAGGC GATTGAGAAA TATGACGGTA CCGTAGCGGT ATTCGCCAGT GGTTCTCTGT CGCACCGTTT TATTGACGAC CAACGGGCGG AAGAGGGGAT GAACAGCTAC ACCCGCGAGT TCGATCATCA AATGGACGAG CGCGTGGTCA AGCTGTGGCG CGAAGGCAAA TTCAAGGAGT TTTGCACCAT GTTGCCGGAG TACGCCGACT ACTGCTACGG CGAAGGCAAC ATGCACGACA CGGTCATGCT GCTGGGAATG CTGGGTTGGG ACAAATACGA CGGCAAGGTG GAGTTCATCA CCGACCTGTT CGCCAGCTCC GGTACCGGCC AGGTAAACGC TGTTTTCCCG CTGCCTGCGT AA
|
Protein sequence | MGKLALAAKI THVPSMYLSE LPGKNHGCRQ AAIDGHIEIG KRCREMGVDT IIVFDTHWLV NSAYHINCAD HFQGVYTSNE LPHFIRDMTY DYDGNPELGH LIADEAVKLG VRAKAHNIPS LKLEYGTLVP MRYMNSDKHF KVVSISAFCT VHDFADSRRL GEAILKAIEK YDGTVAVFAS GSLSHRFIDD QRAEEGMNSY TREFDHQMDE RVVKLWREGK FKEFCTMLPE YADYCYGEGN MHDTVMLLGM LGWDKYDGKV EFITDLFASS GTGQVNAVFP LPA
|
| |