Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A1203 |
Symbol | hpaA |
ID | 6484389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 1197760 |
End bp | 1198656 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642736605 |
Product | 4-hydroxyphenylacetate catabolism regulatory protein HpaA |
Protein accession | YP_002040363 |
Protein GI | 194445424 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | [TIGR02297] 4-hydroxyphenylacetate catabolism regulatory protein HpaA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 0.375104 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCCAAC GTGCGATCGC CAATATTGAT ATCAGCAAAG AGTATGACGA AAGCATGGGC AGTAACGATG TGCATTATCA GTCGTTTGCT CGTATGGCGG ATTTCTTTGG TCGTGATATG CAGGCGCATC GCCACGACCA GTTTTTTCAA ATGCACTTTC TTGATACCGG GCAGATTGAG CTACAGCTCG ACGATCATCG CTATTCGGTG CAGGCGCCGC TATTTGTGCT TACGCCGCCC TCGGTGCCGC ATGCTTTTAT TACCGAATCG GATAGCGATG GCCATGTTCT GACGGTACGC GAAGAGCTGG TTTGGCCGCT GCTGGAAGTG CTTTATCCCG GCACCAGAGA GGCCTTTGGC CTGCCGGGAA TCTGCCTGTC GCTGGCGGAT AAACCCAACG AGCTGGCGGC GCTCAAACAT TACTGGCAGC TAATTGAGCG GGAGTCCACG GAACAACTGG CTGGCTGCGA ACATACCCTG GTGCTACTGG CGCAGGCGGT ATTTACCTTG CTGTTGCGTA ATGCGAAGCT GGACGATCAC GCCGCAACCG GGATGCGCGG TGAACTGAAA CTTTTTCAGC GCTTTACCCT GTTAATTGAC AACCACTTCC ATCAGCACTG GACGGTGCCC GATTATGCCT GCGAGCTGCA TATTACCGAA TCTCGTTTGA CCGATATTTG CCGACGTTTT GCTAATCGCC CGCCTAAACG CCTGATTTTT GATCGGCAAT TACGCGAGGC GAAACGACTG CTGCTTTTTT CCGACAATGC TGTCAACGAG ATCGCCTGGC AATTAGGTTT TAAAGATCCG GCTTATTTCG CCCGTTTCTT TAATCGCCTT GCTGGCTGTT CTCCTTCGCA GTTTCGCCAA CGTGAAGTTC CCTCTTTTCT CAACTAA
|
Protein sequence | MCQRAIANID ISKEYDESMG SNDVHYQSFA RMADFFGRDM QAHRHDQFFQ MHFLDTGQIE LQLDDHRYSV QAPLFVLTPP SVPHAFITES DSDGHVLTVR EELVWPLLEV LYPGTREAFG LPGICLSLAD KPNELAALKH YWQLIEREST EQLAGCEHTL VLLAQAVFTL LLRNAKLDDH AATGMRGELK LFQRFTLLID NHFHQHWTVP DYACELHITE SRLTDICRRF ANRPPKRLIF DRQLREAKRL LLFSDNAVNE IAWQLGFKDP AYFARFFNRL AGCSPSQFRQ REVPSFLN
|
| |