Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2691 |
Symbol | hcaE |
ID | 6145470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2764366 |
End bp | 2765727 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617562 |
Product | 3-phenylpropionate dioxygenase alpha subunit |
Protein accession | YP_001744727 |
Protein GI | 170680180 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.244955 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACAC CCTCAGATTT GAACATTTAC CAACTGATTG ACACCCAAAA CGGTCGGGTC ACTCCGCGTA TTTATACCGA CCCGGACATT TACCAACTGG AGCTTGAGCG CATTTTCGGT CGTTGCTGGC TATTTCTCGC CCACGAAAGC CAGATCCCAA AACCTGGTGA TTTCTTTAAC ACCTACATGG GGGAAGATGC AGTTGTCGTG GTGCGCCAGA AAGACGGCAG CATCAAGGCG TTTCTCAACC AGTGCCGCCA CCGGGCCATG CGTGTGAGTT ATGCAGATTG CGGCAACACT CGCGCCTTTA CCTGTCCGTA TCACGGCTGG TCTTATGGCA TTAACGGCGA GTTGATCGAT GTACCGCTGG AACCTCGCGC CTACCCACAA GGGTTGTGTA AATCCCACTG GGGACTAAAC GAAGTTCCTT GTGTGGAGAG TTATAAAGGG CTGATTTTTG GCAACTGGGA TACCAGCGCA CCGGGCCTGC GTGATTACCT TGGTGACATT GCCTGGTATC TGGATGGCAT GCTGGATCGT CGCGAAGGCG GCACCGAAAT TGTCGGCGGC GTACAAAAGT GGGTGATCAA CTGCAACTGG AAATTCCCGG CAGAGCAGTT CGCCAGTGAC CAGTATCATG CTCTGTTCAG CCATGCTTCT GCCGTTCAGG TATTAGGGGC GAAAGATGAT GGCAGCGATA AGCGTCTCGG TGATGGACAA ACTGCCCGCC CGGTGTGGGA AACCGCCAAA GATGCGCTGC AGTTTGGTCA GGACGGTCAC GGCAGCGGTT TCTTCTTTAC TGAAAAACCG GATGCTAATG TCTGGGTCGA TGGCGCAGTT TCCAGCTATT ACCGCGAAAC CTATGCCGAA GCAGAACAAC GTTTAGGTGA AGTTCGTGCC CTGCGCCTGG CGGGTCATAA CAATATTTTC CCCACGCTTT CATGGCTCAA CGGCACTGCC ACGCTCCGCG TCTGGCATCC GCGCGGCCCT GATCAAGTCG AAGTGTGGGC GTTCTGTATT ACTGACAAAG CTGCCTCCGA TGAAGTTAAA GCCGCTTTTG AAAACAGCGC CACTCGTGCT TTTGGTCCTG CTGGTTTTCT CGAGCAGGAT GACTCGGAGA ACTGGTGTGA AATCCAGAAA TTGCTTAAAG GCCACCGCGC CCGCAACAGC AAACTGTGTC TGGAAATGGG GCTTGGTCAG GAAAAGCACC GCGACGACGG CATTCCTGGC ATTACTAACT ATATCTTTTC AGAAACGGCC GCTCGTGGAA TGTACCAACG CTGGGCCGAT CTTCTGAGTA GCGAAAGCTG GCAGGAAGTG CTCGATAAAA CCGCCGCTTA CCAGCAGGAG GTGATGAAAT GA
|
Protein sequence | MTTPSDLNIY QLIDTQNGRV TPRIYTDPDI YQLELERIFG RCWLFLAHES QIPKPGDFFN TYMGEDAVVV VRQKDGSIKA FLNQCRHRAM RVSYADCGNT RAFTCPYHGW SYGINGELID VPLEPRAYPQ GLCKSHWGLN EVPCVESYKG LIFGNWDTSA PGLRDYLGDI AWYLDGMLDR REGGTEIVGG VQKWVINCNW KFPAEQFASD QYHALFSHAS AVQVLGAKDD GSDKRLGDGQ TARPVWETAK DALQFGQDGH GSGFFFTEKP DANVWVDGAV SSYYRETYAE AEQRLGEVRA LRLAGHNNIF PTLSWLNGTA TLRVWHPRGP DQVEVWAFCI TDKAASDEVK AAFENSATRA FGPAGFLEQD DSENWCEIQK LLKGHRARNS KLCLEMGLGQ EKHRDDGIPG ITNYIFSETA ARGMYQRWAD LLSSESWQEV LDKTAAYQQE VMK
|
| |