Gene SeHA_C1218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1218 
SymbolhpaA 
ID6491712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1200149 
End bp1201045 
Gene Length897 bp 
Protein Length298 aa 
Translation table11 
GC content52% 
IMG OID642741457 
Product4-hydroxyphenylacetate catabolism regulatory protein HpaA 
Protein accessionYP_002045108 
Protein GI194448747 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID[TIGR02297] 4-hydroxyphenylacetate catabolism regulatory protein HpaA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.398586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCCAAC GTGCGATCGC CAATATTGAT ATCAGCAAAG AGTATGACGA AAGCATGGGC 
AGTAACGATG TGCATTATCA GTCGTTTGCT CGTATGGCGG ATTTCTTTGG TCGTGATATG
CAGGCGCATC GCCACGACCA GTTTTTTCAA ATGCACTTTC TTGATACCGG GCAGATTGAG
CTACAGCTCG ACGATCATCG CTATTCGGTG CAGGCGCCGC TATTTGTGCT TACGCCGCCC
TCGGTGCCGC ATGCTTTTAT TACCGAATCG GATAGCGATG GCCATGTTCT GACGGTACGC
GAAGAGCTGG TTTGGCCGCT GCTGGAAGTG CTTTATCCCG GCACCAGAGA GGCCTTTGGC
CTGCCGGGAA TCTGCCTGTC GCTGGCGGAT AAACCCAACG AGCTGGCGGC GCTCAAACAT
TACTGGCAGC TAATTGAGCG GGAGTCCACG GAACAACTGG CTGGCTGCGA ACATACCCTG
GTGCTACTGG CGCAGGCGGT ATTTACCTTG CTGTTGCGTA ATGCGAAGCT GGACGATCAC
GCCGCAACCG GGATGCGCGG TGAACTGAAA CTTTTTCAGC GCTTTACCCT GTTAATTGAC
AACCACTTCC ATCAGCACTG GACGGTGCCC GATTATGCCT GCGAGCTGCA TATTACCGAA
TCTCGTTTGA CCGATATTTG CCGACGTTTT GCTAATCGCC CGCCTAAACG CCTGATTTTT
GATCGGCAAT TACGCGAGGC GAAACGACTG CTGCTTTTTT CCGACAATGC TGTCAACGAG
ATCGCCTGGC AATTAGGTTT TAAAGATCCG GCTTATTTCG CCCGTTTCTT TAATCGCCTT
GCTGGCTGTT CTCCTTCGCA GTTTCGCCAA CGTGAAGTTC CCTCTTTTCT CAACTAA
 
Protein sequence
MCQRAIANID ISKEYDESMG SNDVHYQSFA RMADFFGRDM QAHRHDQFFQ MHFLDTGQIE 
LQLDDHRYSV QAPLFVLTPP SVPHAFITES DSDGHVLTVR EELVWPLLEV LYPGTREAFG
LPGICLSLAD KPNELAALKH YWQLIEREST EQLAGCEHTL VLLAQAVFTL LLRNAKLDDH
AATGMRGELK LFQRFTLLID NHFHQHWTVP DYACELHITE SRLTDICRRF ANRPPKRLIF
DRQLREAKRL LLFSDNAVNE IAWQLGFKDP AYFARFFNRL AGCSPSQFRQ REVPSFLN