Gene SeSA_A1172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A1172 
SymbolhpaA 
ID6517775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp1154095 
End bp1154991 
Gene Length897 bp 
Protein Length298 aa 
Translation table11 
GC content52% 
IMG OID642746297 
Product4-hydroxyphenylacetate catabolism regulatory protein HpaA 
Protein accessionYP_002114106 
Protein GI194737210 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID[TIGR02297] 4-hydroxyphenylacetate catabolism regulatory protein HpaA 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.856893 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCCAAC GTGCGATCGC CAATATTGAT ATCAGCAAAG AGTATGACGA AAGCATGGGC 
AGTAACGATG TGCATTATCA GTCGTTTGCT CGTATGGCGG ATTTCTTTGG TCGTGATATG
CAGGCGCATC GCCACGACCA GTTTTTTCAA ATGCACTTTC TTGATACCGG GCAGATTGAG
CTACAGCTCG ACGATCATCG CTATTCGGTG CAGGCGCCGC TATTTGTGCT TACGCCGCCC
TCGGTGCCGC ATGCTTTTAT TACCGAATCG GATAGCGATG GCCATGTTCT GACGGTACGC
GAAGAGCTGG TTTGGCCGCT GCTGGAAGTG CTTTATCCCG GCACCAGAGA GGCCTTTGGC
CTGCCGGGAA TCTGCCTGTC GCTGGCGGAT AAACCCAACG AGCTGGCGGC GCTCAAACAT
TACTGGCAGC TAATTGAGCG GGAGTCCACG GAACAACTGG CTGGCTGCGA ACATACCCTG
GTGCTACTGG CGCAGGCGGT ATTTACCTTG CTGTTGCGTA ATGCGAAGCT GGACGATCAC
GCCGCAACCG GGATGCGCGG TGAACTGAAA CTTTTTCAGC GCTTTACCCT GTTAATTGAC
AACCACTTCC ATCAGCACTG GACGGTGCCC GATTATGCCT GCGAGCTGCA TATTACCGAA
TCTCGTTTGA CCGATATTTG CCGACGTTTT GCTAATCGCC CGCCTAAACG CCTGATTTTT
GATCGGCAAT TACGCGAGGC GAAACGACTG CTGCTTTTTT CCGACAATGC TGTCAACGAG
ATCGCCTGGC AATTAGGTTT TAAAGATCCG GCTTATTTCG CCCGTTTCTT TAATCGCCTT
GCTGGCTGTT CTCCTTCGCA GTTTCGCCAA CGTGAAGTTC CCTCTTTTCT CAACTAA
 
Protein sequence
MCQRAIANID ISKEYDESMG SNDVHYQSFA RMADFFGRDM QAHRHDQFFQ MHFLDTGQIE 
LQLDDHRYSV QAPLFVLTPP SVPHAFITES DSDGHVLTVR EELVWPLLEV LYPGTREAFG
LPGICLSLAD KPNELAALKH YWQLIEREST EQLAGCEHTL VLLAQAVFTL LLRNAKLDDH
AATGMRGELK LFQRFTLLID NHFHQHWTVP DYACELHITE SRLTDICRRF ANRPPKRLIF
DRQLREAKRL LLFSDNAVNE IAWQLGFKDP AYFARFFNRL AGCSPSQFRQ REVPSFLN