Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_04190 |
Symbol | hpaX |
ID | 8115085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 4500120 |
End bp | 4501496 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644850330 |
Product | hypothetical protein |
Protein accession | YP_003001903 |
Protein GI | 251787599 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2223] Nitrate/nitrite transporter |
TIGRFAM ID | [TIGR02332] 4-hydroxyphenylacetate permease |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.580835 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACA CCTCACCTGC CATACCGGAG AGTATCGATC CGGCGAATCA GCATAAAGCG CTGACTGCCG GACAACAGGC GGTTATTAAG AAGCTATTTC GCCGCCTGAT CGTCTTTCTG TTCGTGCTGT TTATCTTCTC GTTCCTTGAT CGCATCAACA TCGGCTTTGC CGGACTCACG ATGGGACGCG ACCTCGGTCT GAGCGCCACC ATGTTTGGCC TCGCTACCAC CCTGTTCTAC GCCGCTTATG TCATCTTCGG CATTCCCAGC AACATTATGC TGAGTATTGT CGGTGCACGG CGCTGGATCG CCACCATCAT GGTGCTCTGG GGCATCGCCT CTACTGCTAC CATGTTTGCC ACTGGCCCCA CCAGCTTATA CGTACTGCGT ATACTGGTTG GCATTACCGA AGCCGGCTTT CTGCCTGGCA TTCTGCTCTA TTTAACCTTC TGGTTTCCGG CCTACTTCCG CGCCCGTGCC AACGCCTTGT TTATGGTGGC AATGCCGGTA ACGACAGCGT TGGGATCGAT CGTTTCCGGC TACATTTTGT CGCTGGATGG CGTAATGGCA TTAAAAGGCT GGCAGTGGCT GTTTTTGCTG GAAGGCTTCC CGTCGGTATT ACTCGGCGTC ATGGTGTGGT TCTGGCTTGA TGACTCACCG GACAAAGCTA AGTGGCTGAC GAAAGAAGAC AAAAAATGCC TGCAAGAGAT GATGGATAAC GATCGTCTGA CGCTGGTTCA GCCAGAGGGA GCCATCAGCC ACCATGCCAT GCAACAACGC AGCATGTGGC GGGAGATCTT CACTCCGGTG GTGATGATGT ATACCCTGGC GTATTTCTGC CTGACCAACA CACTTAGTGC GATCAGCATC TGGACACCGC AGATCCTGCA AAGCTTTAAT CAGGGCAGCA GTAATATCAC CATCGGCCTG CTGGCCGCCG TACCGCAGAT TTGTACCATT CTCGGGATGA TCTACTGGAG CCGTCACTCA GATCGCCGCC AGGAACGAAG GCATCACACC GCCCTTCCTT ATTTGTTCGC TGCCGCTGGT TGGTTACTGG CTTCGGCAAC TGATCACAAC ATGATCCAGA TGCTGGGGAT CATTATGGCT TCGACCGGAT CATTCAGCGC AATGGCGATT TTCTGGACAA CACCGGATCA GTCCATCAGC CTGCGGGCAC GAGCGATCGG TATTGCGGTG ATCAACGCCA CTGGCAACAT TGGTTCAGCA TTAAGTCCGT TTATGATCGG CTGGTTGAAA GATCTGACCG GCAGCTTTAA CAGTGGATTG TGGTTTGTTG CCGCGCTGCT GGTGATTGGT GCAGGGATTA TCTGGGCAAT TCCAATGCAG TCCTCCCGTC CGCGAGCGAC CCCGTAA
|
Protein sequence | MSDTSPAIPE SIDPANQHKA LTAGQQAVIK KLFRRLIVFL FVLFIFSFLD RINIGFAGLT MGRDLGLSAT MFGLATTLFY AAYVIFGIPS NIMLSIVGAR RWIATIMVLW GIASTATMFA TGPTSLYVLR ILVGITEAGF LPGILLYLTF WFPAYFRARA NALFMVAMPV TTALGSIVSG YILSLDGVMA LKGWQWLFLL EGFPSVLLGV MVWFWLDDSP DKAKWLTKED KKCLQEMMDN DRLTLVQPEG AISHHAMQQR SMWREIFTPV VMMYTLAYFC LTNTLSAISI WTPQILQSFN QGSSNITIGL LAAVPQICTI LGMIYWSRHS DRRQERRHHT ALPYLFAAAG WLLASATDHN MIQMLGIIMA STGSFSAMAI FWTTPDQSIS LRARAIGIAV INATGNIGSA LSPFMIGWLK DLTGSFNSGL WFVAALLVIG AGIIWAIPMQ SSRPRATP
|
| |