Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3774 |
Symbol | hcaD |
ID | 6968709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3496665 |
End bp | 3497867 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643387563 |
Product | phenylpropionate dioxygenase ferredoxin reductase subunit |
Protein accession | YP_002272016 |
Protein GI | 209399212 |
COG category | [R] General function prediction only |
COG ID | [COG0446] Uncharacterized NAD(FAD)-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAAA AAACGATCAT TATTGTCGGG GGCGGGCAAG CGGCGGCAAT GGCTGCGGCC TCGCTACGCC AGCAAGGGTT CACCGGTGAG CTGCATCTGT TTTCCGATGA GCAACATCTT CCTTATGAAC GCCCTCCGCT CTCGAAATCC ATGTTGCTGG AAGATTCCCC ACAGTTGCAG TCTGTGTTAC CCGCTCACTG GTGGCAGGAA AACAATGTTC ATCTGCATTC CGGTGTAACC ATCAAAACAT TGGGCCGCGA CACACGAGAG TTAGTGTTAG CCAACGGCGA AAGCTGGCAC TGGGATCAGC TTTTTATAGC AACCGGCGCG GCAGCTCGAC CGCTGCCGTT GCTTGATGCA CTGGGAGAAC GCTGCTTTAC CCTGCGCCAT GCCGGTGATG CCGCCAGACT GCGAGAAGTT CTGCAGCCCG AACGGTCAGT CGTGATTGTC GGTGCCGGAA CTATTGGTCT GGAACTGGCT GCCAGCGCCA CGCAGCGCAG ATGTAAGGTG ACAGTGATTG AACTGGCGGC AACCGTCATG GGCCGTAATG CACCACCGCC CGTGCAACGC TATCTTTTAC AGCGCCATCA GCAGGCTGGT GTGCGCATTC TGCTCAATAA TGCCATTGAA CATGTGGTCG ATGGTGAAAA AGTAGAACTG ACGCTGCAAA GTGGCGAGAC GCTTCAGGCC GATGTGGTGA TTTACGGTAT TGGTATCAGC GCCAACGACC AACTGGCTCG CGAGGCCAAC CTTGATACTA CCAATGGCAT TGTCATTGAT GAGGCTTGCC GCACCTGCGA TCCCGCGATC TTTGCCGGTG GCGATGTGGC AATCACTCGT CTTGATAATG GTGCACTACA CCGCTGCGAA AGCTGGGAAA ACGCCAATAA CCATGCGCAA ATTGCCGCTG CCGCAATGTT AGGGCTACCG CTTCCGCTAC TGCCGCCGCC GTGGTTCTGG AGTGATCAGT ACAGTGATAA CTTACAGTTT ATTGGCGATA TGCGTGGCGA TGACTGGCTT TGTCGTGGCA ACCCGGAAAC CCAGAAAGCG ATTTGGTTTA ATCTGCAAAA CGGCGTGCTT ATCGGTGCGG TAACGCTGAA TCAGGGACGT GAGATTCGCT CAATCCGCAA ATGGATCCAG AGCGGCAAAA CGTTTGATGC GAAACAGCTG ACAGATGAGA ACATCGCGCT TAAATCACTG TAA
|
Protein sequence | MKEKTIIIVG GGQAAAMAAA SLRQQGFTGE LHLFSDEQHL PYERPPLSKS MLLEDSPQLQ SVLPAHWWQE NNVHLHSGVT IKTLGRDTRE LVLANGESWH WDQLFIATGA AARPLPLLDA LGERCFTLRH AGDAARLREV LQPERSVVIV GAGTIGLELA ASATQRRCKV TVIELAATVM GRNAPPPVQR YLLQRHQQAG VRILLNNAIE HVVDGEKVEL TLQSGETLQA DVVIYGIGIS ANDQLAREAN LDTTNGIVID EACRTCDPAI FAGGDVAITR LDNGALHRCE SWENANNHAQ IAAAAMLGLP LPLLPPPWFW SDQYSDNLQF IGDMRGDDWL CRGNPETQKA IWFNLQNGVL IGAVTLNQGR EIRSIRKWIQ SGKTFDAKQL TDENIALKSL
|
| |