Gene SeAg_B1067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B1067 
SymbolhpaA 
ID6795970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp1065413 
End bp1066309 
Gene Length897 bp 
Protein Length298 aa 
Translation table11 
GC content52% 
IMG OID642775336 
Product4-hydroxyphenylacetate catabolism regulatory protein HpaA 
Protein accessionYP_002145977 
Protein GI197249446 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID[TIGR02297] 4-hydroxyphenylacetate catabolism regulatory protein HpaA 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCCAAC GTGCGATCGC CAATATTGAT ATCAGCAAAG AGTATGACGA AAGCATGGGC 
AGTAACGATG TGCATTATCA GTCGTTTGCT CGTATGGCGG ATTTCTTTGG TCGTGATATG
CAGGCGCATC GCCACGACCA GTTTTTTCAA ATGCACTTTC TTGATACCGG GCAGATTGAG
CTACAGCTCG ACGATCATCG CTATTCGGTG CAGGCGCCGC TATTTGTGCT TACGCCGCCC
TCGGTGCCGC ATGCTTTTAT TACCGAATCG GATAGCGATG GCCATGTTCT GACGGTACGC
GAAGAGCTGG TTTGGCCGCT GCTGGAAGTG CTTTATCCCG GCACCAGAGA GGCCTTCGGC
CTGCCGGGAA TCTGTCTGTC GCTGGCGGAT AAACCCAACG AGCTGGCGGC GCTCAAACAT
TACTGGCAGC TAATTGAGCG GGAGTCCACG GAACAACTGG CTGGCTGCGA ACATACCCTG
GTGCTACTGG CGCAGGCGGT ATTTACCTTG CTGTTGCGTA ATGCGAAGCT GGACGATCAC
GCCGCAACCG GGATGCGCGG TGAACTGAAA CTTTTTCAGC GCTTTACCCT GTTAATTGAC
AACCACTTCC ATCAGCACTG GACGGTGCCC GATTATGCCT GCGAGCTGCA TATTACCGAA
TCTCGTTTGA CCGATATTTG CCGACGTTTT GCTAATCGCC CGCCTAAACG CCTGATTTTT
GATCGGCAAT TACGCGAGGC GAAACGACTG CTGCTTTTTT CCGACAATGC TGTCAACGAG
ATCGCCTGGC AATTAGGTTT TAAAGATCCG GCTTATTTCG CCCGTTTCTT TAATCGCCTT
GCTGGCTGTT CTCCTTCGCA GTTTCGCCAA CGTGAAGTTC CCTCTTTTCT CAACTAA
 
Protein sequence
MCQRAIANID ISKEYDESMG SNDVHYQSFA RMADFFGRDM QAHRHDQFFQ MHFLDTGQIE 
LQLDDHRYSV QAPLFVLTPP SVPHAFITES DSDGHVLTVR EELVWPLLEV LYPGTREAFG
LPGICLSLAD KPNELAALKH YWQLIEREST EQLAGCEHTL VLLAQAVFTL LLRNAKLDDH
AATGMRGELK LFQRFTLLID NHFHQHWTVP DYACELHITE SRLTDICRRF ANRPPKRLIF
DRQLREAKRL LLFSDNAVNE IAWQLGFKDP AYFARFFNRL AGCSPSQFRQ REVPSFLN