Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4577 |
Symbol | hpaA |
ID | 5595198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4586007 |
End bp | 4586897 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640923671 |
Product | 4-hydroxyphenylacetate catabolism regulatory protein HpaA |
Protein accession | YP_001461111 |
Protein GI | 157163793 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | [TIGR02297] 4-hydroxyphenylacetate catabolism regulatory protein HpaA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 73 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGTGACC GTCAGATTGC CAATATTGAT ATCAGCAAAG AGTACGATGA AAGCCTGGGC ACGGACGATG TGCATTATCA GTCCTTCGCC CGCATGGCGG CATTTTTTGG CCGCCATATG CTGCCACATC GCCACGAACA GTACTTTCAG ATGCATTTCC TCAATAGCGG ACAGATTGAG CTACAGCTTG ACGATCATCG CTACTCGGTG GAAGCGCCCC TGTTTGTCCT GACGCCGCCG TCAGTACCTC ATGCGTTTAT TACGGAGTCT GACGCCGACG GTCATGTGTT GACGGTACGG GAAGATCTGA TCTGGCCCCT GCTGGAAGTT CTTTATCCAG GCACTCGGGA AACCTTCGGC CTGCCGGGGA TTTGCCTGTC ACTGGCAGAT AAACCCGACG AACTGGCGGC GCTGGAACAC TATTGGCAAC TGATAGAGCG GGAATCGGTA GAACAACTGC CTGGACGGGA ACACACCCTG ACGTTACTGG CACAGGCAGT GTTCACCCTA CTGCTGCGTA ACGCAAAACT CGACGACCAT GCCGCCAGCG GAATGCGCGG AGAATTAAAA CTGTTCCAGC GTTTTCATAT GCTGATTGAA AGCCATTTTC ATCAGCACTG GACAGTACCG GATTACGCTA ACGAACTGCA TATCACCGAA TCACGCCTCA CGGACATCTG CCGCCGCTTT GCCAACCGTC CGCCAAAACG GTTGATTTTC GACAGGCAGC TGCGAGAAGC CAAGCGGCTG CTGCTGTTTT CTGATAACGC CGTGAACAAT ATTGCCTGGC AACTCGGTTT TAAGGATCCA GCTTATTTTG CGCGCTTTTT TAATCGCTTA GTCGGTTGCT CGCCCAGTGC TTATCGTGCC AAAAAAGTAC CTGTGACGTG A
|
Protein sequence | MCDRQIANID ISKEYDESLG TDDVHYQSFA RMAAFFGRHM LPHRHEQYFQ MHFLNSGQIE LQLDDHRYSV EAPLFVLTPP SVPHAFITES DADGHVLTVR EDLIWPLLEV LYPGTRETFG LPGICLSLAD KPDELAALEH YWQLIERESV EQLPGREHTL TLLAQAVFTL LLRNAKLDDH AASGMRGELK LFQRFHMLIE SHFHQHWTVP DYANELHITE SRLTDICRRF ANRPPKRLIF DRQLREAKRL LLFSDNAVNN IAWQLGFKDP AYFARFFNRL VGCSPSAYRA KKVPVT
|
| |