Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4578 |
Symbol | hpaX |
ID | 5595199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4586907 |
End bp | 4588283 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640923672 |
Product | 4-hydroxyphenylacetate permease |
Protein accession | YP_001461112 |
Protein GI | 157163794 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2223] Nitrate/nitrite transporter |
TIGRFAM ID | [TIGR02332] 4-hydroxyphenylacetate permease |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 77 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACA CCTCACCTGC CATACCGGAG AGTATCGATC CGGCGAATCA GCATAAAGCG CTGACTGCCG GACAACAGGC GGTTATTAAG AAGCTATTTC GCCGCCTGAT CGTCTTTCTG TTCGTGCTGT TTATCTTCTC GTTCCTTGAT CGCATCAACA TCGGCTTTGC CGGACTCACG ATGGGACGCG ACCTCGGTCT GAGCGCCACC ATGTTTGGCC TCGCTACCAC CCTGTTCTAC GCCGCTTATG TCATCTTCGG CATTCCCAGC AACATTATGC TGAGTATTGT CGGTGCGCGG CGCTGGATCG CCACCATCAT GGTGCTTTGG GGCATCGCCT CTACTGCCAC CATGTTTGCC ACTGGCCCCA CCAGCTTGTA CGTACTGCGT ATACTGGTTG GCATTACCGA AGCCGGCTTT CTGCCTGGCA TTCTGCTGTA TTTAACCTTC TGGTTTCCAG CCTATTTCCG CGCCCGTGCC AACGCCTTGT TTATGGTTGC AATGCCGGTA ACGACAGCGT TGGGATCGCT CGTTTCCGGC TACATTTTGT CGCTGGATGG CGTAATGGCA TTAAAAGGCT GGCAGTGGCT GTTTTTGCTG GAAGGCTTCC CGTCGGTATT ACTCGGCGTC ATGGTGTGGT TCTGGCTTGA TGACTCACCG GACAAAGCTA AGTGGCTGAC GAAAGAAGAC AAAAAATGCC TGCAAGAGAT GATGGATAAC GATCGTCTGA CGCTGGTTCA GCCAGAGGGA GCCATCAGCC ACCACGCCAT GCAACAACGC AGCATGTGGC GGGAGATCTT CACTCCGGTG GTGATGATGT ATACCCTGGC GTATTTCTGC CTGACCAACA CACTTAGTGC GATCAGCATC TGGACACCGC AGATCCTGCA AAGCTTTAAT CAGGGCAGCA GTAATATCAC CATCGGCCTG CTGGCCGCCG TACCGCAGAT TTGTACCATT CTCGGGATGA TCTACTGGAG CCGTCACTCA GATCGCCGCC AGGAACGAAG GCATCACACC GCCCTTCCTT ATTTGTTCGC TGCCGCGGGT TGGTTACTGG CTTCGGCAAC TGATCACAAC ATGATCCAGA TGCTGGGGAT CATTATGGCT TCGACCGGAT CATTCAGCGC AATGGCGATT TTCTGGACAA CACCTGATCA GTCCATCAGC CTGCGGGCAC GAGCGATCGG TATTGCGGTG ATCAACGCCA CTGGCAACAT TGGCTCAGCG TTAAGTCCGT TTATGATCGG CTGGTTGAAA GATCTGACCG GCAGCTTTAA CAGTGGATTG TGGTTTGTTG CCGCGCTGCT GGTGATTGGT GCGGGGATTA TCTGGGCAAT TCCAATGCAG TCCTCCCGTC CGCGAGCGAC CCCGTAA
|
Protein sequence | MSDTSPAIPE SIDPANQHKA LTAGQQAVIK KLFRRLIVFL FVLFIFSFLD RINIGFAGLT MGRDLGLSAT MFGLATTLFY AAYVIFGIPS NIMLSIVGAR RWIATIMVLW GIASTATMFA TGPTSLYVLR ILVGITEAGF LPGILLYLTF WFPAYFRARA NALFMVAMPV TTALGSLVSG YILSLDGVMA LKGWQWLFLL EGFPSVLLGV MVWFWLDDSP DKAKWLTKED KKCLQEMMDN DRLTLVQPEG AISHHAMQQR SMWREIFTPV VMMYTLAYFC LTNTLSAISI WTPQILQSFN QGSSNITIGL LAAVPQICTI LGMIYWSRHS DRRQERRHHT ALPYLFAAAG WLLASATDHN MIQMLGIIMA STGSFSAMAI FWTTPDQSIS LRARAIGIAV INATGNIGSA LSPFMIGWLK DLTGSFNSGL WFVAALLVIG AGIIWAIPMQ SSRPRATP
|
| |