Gene ECD_02430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02430 
SymbolhcaE 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2534837 
End bp2536198 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content53% 
IMG OID 
Product3-phenylpropionate dioxygenase, large (alpha) subunit 
Protein accessionACT44250 
Protein GI253978580 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACAC CCTCAGATTT GAACATTTAC CAACTGATTG ACACCCAAAA CGGTCGGGTC 
ACTCCGCGTA TTTATACCGA CCCGGACATT TACCAACTGG AGCTTGAGCG TATTTTCGGT
CGTTGCTGGC TATTTCTCGC CCACGAAAGC CAGATCCCAA AACCCGGTGA TTTCTTTAAC
ACCTACATGG GAGAAGATGC GGTTGTCGTA GTGCGTCAGA AAGACGGCAG CATCAAGGCG
TTTCTCAACC AATGCCGCCA CCGGGCCATG CGTGTGAGTT ATGCAGATTG CGGCAACACT
CGCGCCTTTA CCTGCCCGTA TCACGGCTGG TCTTATGGCA TTAACGGCGA GTTGATCGAT
GTACCGCTGG AACCTCGCGC CTATCCACAA GGGTTGTGTA AATCCCACTG GGGGCTAAAC
GAAGTTCCTT GTGTGGAGAG TTATAAAGGG CTGATTTTTG GCAACTGGGA TACCAGCGCA
CCGGGCCTGC GTGATTACCT GGGTGACATT GCCTGGTATC TGGATGGCAT GCTGGATCGT
CGCGAAGGCG GCACCGAAAT TGTCGGCGGC GTACAAAAGT GGGTGATCAA CTGCAACTGG
AAATTCCCGG CAGAGCAGTT CGCCAGTGAC CAGTATCATG CTCTGTTCAG CCATGCTTCT
GCCGTTCAGG TATTAGGGGC GAAAGATGAT GGCAGCGATA AGCGCCTCGG TGATGGACAA
ACCGCCCGCC CGGTGTGGGA AACCGCCAAA GATGCGCTGC AATTTGGTCA GGACGGTCAC
GGTAGCGGTT TCTTCTTTAC TGAAAAACCG GATGCTAATG TCTGGGTCGA TGGCGCAGTT
TCAAGCTATT ACCGCGAAAC CTATGCCGAA GCAGAACAAC GTTTAGGTGA AGTTCGCGCC
CTGCGCCTGG CGGGTCATAA CAATATTTTC CCCACGCTTT CATGGCTCAA CGGCACTGCC
ACGCTCCGCG TCTGGCATCC GCGCGGCCCT GATCAAGTTG AAGTGTGGGC GTTCTGTATT
ACTGACAAAG CCGCCTCCGA TGAAGTTAAA GCCGCTTTTG AAAACAGCGC CACTCGTGCT
TTTGGTCCTG CTGGTTTTCT CGAGCAGGAT GACTCGGAGA ACTGGTGTGA AATCCAGAAA
TTGCTTAAAG GCCACCGCGC CCGCAACAGC AAACTGTGTC TGGAAATGGG GCTTGGTCAG
GAAAAGCGTC GCGACGACGG CATTCCTGGC ATTACTAACT ATATTTTCTC AGAAACTGCC
GCTCGCGGAA TGTACCAACG TTGGGCCGAT CTCCTGAGTA GCGAAAGCTG GCAAGAAGTG
CTCGATAAAA CCGCCGCTTA CCAGCAGGAG GTGATGAAAT GA
 
Protein sequence
MTTPSDLNIY QLIDTQNGRV TPRIYTDPDI YQLELERIFG RCWLFLAHES QIPKPGDFFN 
TYMGEDAVVV VRQKDGSIKA FLNQCRHRAM RVSYADCGNT RAFTCPYHGW SYGINGELID
VPLEPRAYPQ GLCKSHWGLN EVPCVESYKG LIFGNWDTSA PGLRDYLGDI AWYLDGMLDR
REGGTEIVGG VQKWVINCNW KFPAEQFASD QYHALFSHAS AVQVLGAKDD GSDKRLGDGQ
TARPVWETAK DALQFGQDGH GSGFFFTEKP DANVWVDGAV SSYYRETYAE AEQRLGEVRA
LRLAGHNNIF PTLSWLNGTA TLRVWHPRGP DQVEVWAFCI TDKAASDEVK AAFENSATRA
FGPAGFLEQD DSENWCEIQK LLKGHRARNS KLCLEMGLGQ EKRRDDGIPG ITNYIFSETA
ARGMYQRWAD LLSSESWQEV LDKTAAYQQE VMK