Gene EcHS_A1474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1474 
SymbolpaaN 
ID5591622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1474790 
End bp1476835 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content54% 
IMG OID640920631 
Productbifunctional aldehyde dehydrogenase/enoyl-CoA hydratase 
Protein accessionYP_001458187 
Protein GI157160869 
COG category[C] Energy production and conversion
[I] Lipid transport and metabolism 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases
[COG2030] Acyl dehydratase 
TIGRFAM ID[TIGR02278] phenylacetic acid degradation protein paaN 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCAGT TAGACAGTTT CTTATCCGGT ACCTGGCAGT CTGGCCGGGG CCGTAGCCGT 
TTGATTCACC ACGCCATTAG CGGCGAGGCA TTATGGGAAG TGACCAGTGA AGGTCTTGAT
ATGGCGGCTG CCCGCCAGTT TGCCATTGAA AAAGGTGCCC CCGCCCTCCG CGCGATGACC
TTTATCGAAC GTGCGGCGAT GCTTAAAGCG GTCGCTAAAC ATCTGCTGAG TGAAAAAGAG
CGCTTCTATG CTCTTTCTGC GCAAACAGGC GCAACGCGGG CAGACAGTTG GGTTGATATT
GAAGGCGGTA TTGGGACGTT ATTTACTTAC GCCAGCCTCG GTAGCCGGGA GCTGCCTGAC
GATACGCTGT GGCCGGAAGA TGAATTGATC CCCTTATCGA AAGAAGGTGG ATTTGCCGCG
CGCCATGTAC TGACCTCAAA GTCAGGCGTG GCAGTGCATA TTAACGCCTT TAACTTCCCC
TGCTGGGGAA TGCTGGAAAA GCTGGCACCA ACGTGGCTGG GCGGAATGCC AGCCATCATC
AAACCAGCTA CCGCGACGGC CCAACTGACT CAGGCGATGG TGAAATCAAT TGTCGATAGT
GGTCTTGTTC CCGAAGGCGC AATTAGTCTG ATCTGCGGTA GTGCGGGCGA CCTTTTGGAT
CATCTGGACA GCCAGGATGT GGTGACTTTC ACGGGGTCCG CGACGACCGG ACAGATGCTG
CGAGTTCAGC CAAATATCGT TGCCAAATCT ATCCCCTTCA CGATGGAAGC TGATTCCCTG
AACTGCTGCG TACTGGGCGA AGATGTCACC CCGGATCAAC CGGAGTTTGC GCTGTTTATT
CGTGAAGTTG TGCGTGAGAT GACCACAAAA GCCGGGCAAA AATGTACGGC AATCCGGCGG
ATTATTGTGC CGCAGGCATT GGTTAATGCT GTCAGTGATG CTCTGGTTGC GCGATTACAG
AAAGTCGTGG TCGGTGATCC TGCACAGGAA GGTGTGAAAA TGGGCGCACT GGTAAATGCT
GAACAGCGTG CTGATGTGCA GGAAAAAGTG AACACATTGC TGGCTGCAGG ATGCGAGATT
CGCCTCGGTG GTCAGGCGGA TTTATCTGCT GCGGGTGCAT TCTTCCCGCC AACCTTATTG
TACTGTCCGC AGCCGGATGA AACACCGGCG GTACATGCAA CAGAAGCCTT TGGCCCTGTC
GCAACGCTGA TGCCAGCACA AAACCAGCAA CATGCTCTGC AACTGGCTTG TGCAGGCGGC
GGTAGCCTTG CGGGAACGCT GGTGACGGCT GATCCGCAAA TTGCGCGTCA GTTTATTGCC
GACGCGGCAC GTACGCATGG GCGAATTCAG ATCCTCAATG AAGAGTCGGC AAAAGAATCC
ACCGGGCATG GCTCCCCACT GCCACAACTG GTACATGGTG GGCCTGGTCG CGCAGGAGGC
GGTGAAGAAT TAGGTGGTTT ACGAGCGGTG AAACATTACA TGCAGCGAAC CGCTATACAG
GGTAGCCCGT CGATGCTTGC CGCTATCAGT AAACAGTGGG TGCGTGGTGC GAAAGTCGAA
GAAGATCGTA TTCATCCGTT CCGCAAATAT TTTGAGGAGC TGCAACCAGG CGACAGCCTG
CTGACTCCCC GCCGCACAAT GACAGAGGCC GATATTGTTA ACTTTGCTTG CCTCAGCGGC
GATCATTTCT ATGCACATAT GGATAAGATT GCTGCTGCCG AATCTATTTT CGGTGAGCGG
GTGGTGCATG GGTATTTTGT GCTTTCTGCG GCTGCGGGTC TGTTTGTCGA TGACGGTGTC
GGTCCGGTCA TTGCTAACTA CGGGCTGGAA AGCTTGCGTT TTATCGAACC CGTAAAGCCA
GGCGATACCA TCCAGGTGCG TCTCACCTGT AAGCGCAAGA CGCTGAAAAA ACAGCGTAGC
GCAGAAGAAA AACCAACAGG TGTGGTGGAA TGGGCTGTAG AGGTATTCAA TCAGCATCAA
ACCCCGGTGG CGCTGTATTC AATTCTGACG CTGGTGGCCA GGCAGCACGG TGATTTTGTC
GATTAA
 
Protein sequence
MQQLDSFLSG TWQSGRGRSR LIHHAISGEA LWEVTSEGLD MAAARQFAIE KGAPALRAMT 
FIERAAMLKA VAKHLLSEKE RFYALSAQTG ATRADSWVDI EGGIGTLFTY ASLGSRELPD
DTLWPEDELI PLSKEGGFAA RHVLTSKSGV AVHINAFNFP CWGMLEKLAP TWLGGMPAII
KPATATAQLT QAMVKSIVDS GLVPEGAISL ICGSAGDLLD HLDSQDVVTF TGSATTGQML
RVQPNIVAKS IPFTMEADSL NCCVLGEDVT PDQPEFALFI REVVREMTTK AGQKCTAIRR
IIVPQALVNA VSDALVARLQ KVVVGDPAQE GVKMGALVNA EQRADVQEKV NTLLAAGCEI
RLGGQADLSA AGAFFPPTLL YCPQPDETPA VHATEAFGPV ATLMPAQNQQ HALQLACAGG
GSLAGTLVTA DPQIARQFIA DAARTHGRIQ ILNEESAKES TGHGSPLPQL VHGGPGRAGG
GEELGGLRAV KHYMQRTAIQ GSPSMLAAIS KQWVRGAKVE EDRIHPFRKY FEELQPGDSL
LTPRRTMTEA DIVNFACLSG DHFYAHMDKI AAAESIFGER VVHGYFVLSA AAGLFVDDGV
GPVIANYGLE SLRFIEPVKP GDTIQVRLTC KRKTLKKQRS AEEKPTGVVE WAVEVFNQHQ
TPVALYSILT LVARQHGDFV D