Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_02394 |
Symbol | hcaE |
ID | 8115730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 2534047 |
End bp | 2535408 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644848596 |
Product | hypothetical protein |
Protein accession | YP_003000169 |
Protein GI | 251785865 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACAC CCTCAGATTT GAACATTTAC CAACTGATTG ACACCCAAAA CGGTCGGGTC ACTCCGCGTA TTTATACCGA CCCGGACATT TACCAACTGG AGCTTGAGCG TATTTTCGGT CGTTGCTGGC TATTTCTCGC CCACGAAAGC CAGATCCCAA AACCCGGTGA TTTCTTTAAC ACCTACATGG GAGAAGATGC GGTTGTCGTA GTGCGTCAGA AAGACGGCAG CATCAAGGCG TTTCTCAACC AATGCCGCCA CCGGGCCATG CGTGTGAGTT ATGCAGATTG CGGCAACACT CGCGCCTTTA CCTGCCCGTA TCACGGCTGG TCTTATGGCA TTAACGGCGA GTTGATCGAT GTACCGCTGG AACCTCGCGC CTATCCACAA GGGTTGTGTA AATCCCACTG GGGGCTAAAC GAAGTTCCTT GTGTGGAGAG TTATAAAGGG CTGATTTTTG GCAACTGGGA TACCAGCGCA CCGGGCCTGC GTGATTACCT GGGTGACATT GCCTGGTATC TGGATGGCAT GCTGGATCGT CGCGAAGGCG GCACCGAAAT TGTCGGCGGC GTACAAAAGT GGGTGATCAA CTGCAACTGG AAATTCCCGG CAGAGCAGTT CGCCAGTGAC CAGTATCATG CTCTGTTCAG CCATGCTTCT GCCGTTCAGG TATTAGGGGC GAAAGATGAT GGCAGCGATA AGCGCCTCGG TGATGGACAA ACCGCCCGCC CGGTGTGGGA AACCGCCAAA GATGCGCTGC AATTTGGTCA GGACGGTCAC GGTAGCGGTT TCTTCTTTAC TGAAAAACCG GATGCTAATG TCTGGGTCGA TGGCGCAGTT TCAAGCTATT ACCGCGAAAC CTATGCCGAA GCAGAACAAC GTTTAGGTGA AGTTCGCGCC CTGCGCCTGG CGGGTCATAA CAATATTTTC CCCACGCTTT CATGGCTCAA CGGCACTGCC ACGCTCCGCG TCTGGCATCC GCGCGGCCCT GATCAAGTTG AAGTGTGGGC GTTCTGTATT ACTGACAAAG CCGCCTCCGA TGAAGTTAAA GCCGCTTTTG AAAACAGCGC CACTCGTGCT TTTGGTCCTG CTGGTTTTCT CGAGCAGGAT GACTCGGAGA ACTGGTGTGA AATCCAGAAA TTGCTTAAAG GCCACCGCGC CCGCAACAGC AAACTGTGTC TGGAAATGGG GCTTGGTCAG GAAAAGCGTC GCGACGACGG CATTCCTGGC ATTACTAACT ATATTTTCTC AGAAACTGCC GCTCGCGGAA TGTACCAACG TTGGGCCGAT CTCCTGAGTA GCGAAAGCTG GCAAGAAGTG CTCGATAAAA CCGCCGCTTA CCAGCAGGAG GTGATGAAAT GA
|
Protein sequence | MTTPSDLNIY QLIDTQNGRV TPRIYTDPDI YQLELERIFG RCWLFLAHES QIPKPGDFFN TYMGEDAVVV VRQKDGSIKA FLNQCRHRAM RVSYADCGNT RAFTCPYHGW SYGINGELID VPLEPRAYPQ GLCKSHWGLN EVPCVESYKG LIFGNWDTSA PGLRDYLGDI AWYLDGMLDR REGGTEIVGG VQKWVINCNW KFPAEQFASD QYHALFSHAS AVQVLGAKDD GSDKRLGDGQ TARPVWETAK DALQFGQDGH GSGFFFTEKP DANVWVDGAV SSYYRETYAE AEQRLGEVRA LRLAGHNNIF PTLSWLNGTA TLRVWHPRGP DQVEVWAFCI TDKAASDEVK AAFENSATRA FGPAGFLEQD DSENWCEIQK LLKGHRARNS KLCLEMGLGQ EKRRDDGIPG ITNYIFSETA ARGMYQRWAD LLSSESWQEV LDKTAAYQQE VMK
|
| |