Gene EcSMS35_2691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2691 
SymbolhcaE 
ID6145470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2764366 
End bp2765727 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content53% 
IMG OID641617562 
Product3-phenylpropionate dioxygenase alpha subunit 
Protein accessionYP_001744727 
Protein GI170680180 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.244955 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACAC CCTCAGATTT GAACATTTAC CAACTGATTG ACACCCAAAA CGGTCGGGTC 
ACTCCGCGTA TTTATACCGA CCCGGACATT TACCAACTGG AGCTTGAGCG CATTTTCGGT
CGTTGCTGGC TATTTCTCGC CCACGAAAGC CAGATCCCAA AACCTGGTGA TTTCTTTAAC
ACCTACATGG GGGAAGATGC AGTTGTCGTG GTGCGCCAGA AAGACGGCAG CATCAAGGCG
TTTCTCAACC AGTGCCGCCA CCGGGCCATG CGTGTGAGTT ATGCAGATTG CGGCAACACT
CGCGCCTTTA CCTGTCCGTA TCACGGCTGG TCTTATGGCA TTAACGGCGA GTTGATCGAT
GTACCGCTGG AACCTCGCGC CTACCCACAA GGGTTGTGTA AATCCCACTG GGGACTAAAC
GAAGTTCCTT GTGTGGAGAG TTATAAAGGG CTGATTTTTG GCAACTGGGA TACCAGCGCA
CCGGGCCTGC GTGATTACCT TGGTGACATT GCCTGGTATC TGGATGGCAT GCTGGATCGT
CGCGAAGGCG GCACCGAAAT TGTCGGCGGC GTACAAAAGT GGGTGATCAA CTGCAACTGG
AAATTCCCGG CAGAGCAGTT CGCCAGTGAC CAGTATCATG CTCTGTTCAG CCATGCTTCT
GCCGTTCAGG TATTAGGGGC GAAAGATGAT GGCAGCGATA AGCGTCTCGG TGATGGACAA
ACTGCCCGCC CGGTGTGGGA AACCGCCAAA GATGCGCTGC AGTTTGGTCA GGACGGTCAC
GGCAGCGGTT TCTTCTTTAC TGAAAAACCG GATGCTAATG TCTGGGTCGA TGGCGCAGTT
TCCAGCTATT ACCGCGAAAC CTATGCCGAA GCAGAACAAC GTTTAGGTGA AGTTCGTGCC
CTGCGCCTGG CGGGTCATAA CAATATTTTC CCCACGCTTT CATGGCTCAA CGGCACTGCC
ACGCTCCGCG TCTGGCATCC GCGCGGCCCT GATCAAGTCG AAGTGTGGGC GTTCTGTATT
ACTGACAAAG CTGCCTCCGA TGAAGTTAAA GCCGCTTTTG AAAACAGCGC CACTCGTGCT
TTTGGTCCTG CTGGTTTTCT CGAGCAGGAT GACTCGGAGA ACTGGTGTGA AATCCAGAAA
TTGCTTAAAG GCCACCGCGC CCGCAACAGC AAACTGTGTC TGGAAATGGG GCTTGGTCAG
GAAAAGCACC GCGACGACGG CATTCCTGGC ATTACTAACT ATATCTTTTC AGAAACGGCC
GCTCGTGGAA TGTACCAACG CTGGGCCGAT CTTCTGAGTA GCGAAAGCTG GCAGGAAGTG
CTCGATAAAA CCGCCGCTTA CCAGCAGGAG GTGATGAAAT GA
 
Protein sequence
MTTPSDLNIY QLIDTQNGRV TPRIYTDPDI YQLELERIFG RCWLFLAHES QIPKPGDFFN 
TYMGEDAVVV VRQKDGSIKA FLNQCRHRAM RVSYADCGNT RAFTCPYHGW SYGINGELID
VPLEPRAYPQ GLCKSHWGLN EVPCVESYKG LIFGNWDTSA PGLRDYLGDI AWYLDGMLDR
REGGTEIVGG VQKWVINCNW KFPAEQFASD QYHALFSHAS AVQVLGAKDD GSDKRLGDGQ
TARPVWETAK DALQFGQDGH GSGFFFTEKP DANVWVDGAV SSYYRETYAE AEQRLGEVRA
LRLAGHNNIF PTLSWLNGTA TLRVWHPRGP DQVEVWAFCI TDKAASDEVK AAFENSATRA
FGPAGFLEQD DSENWCEIQK LLKGHRARNS KLCLEMGLGQ EKHRDDGIPG ITNYIFSETA
ARGMYQRWAD LLSSESWQEV LDKTAAYQQE VMK