Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2695 |
Symbol | hcaD |
ID | 6142684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2767381 |
End bp | 2768583 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617566 |
Product | phenylpropionate dioxygenase ferredoxin reductase subunit |
Protein accession | YP_001744731 |
Protein GI | 170683500 |
COG category | [R] General function prediction only |
COG ID | [COG0446] Uncharacterized NAD(FAD)-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAAA AAACGATCAT TATTGTCGGT GGCGGGCAAG CGGCGGCAAT GGCTGCGGCC TCGCTACGCC AGCAAGGGTT CACCGGTGAG CTGCATCTGT TTTCCGATGA GCAACATCTT CCTTATGAAC GCCCCCCGCT CTCGAAATCC ATGTTGCTGG AAGATTCCCC ACAGTTGCAG TCTGTGTTAC CCGCTCACTG GTGGCAGGAA AACAATGTTC ATCTGCATTC CGGTGTAACC ATCAAAACAT TGGGCCGCGA CACACGAGAG TTAGTGTTAG CTAACGGCGA AAGCTGGCAC TGGGATCAGC TTTTTATAGC AACCGGCGCG GCAGCCAGAC CGCTGCCGTT GCTTGATGCA CTGGGAGAAC GCTGCTTTAC TCTGCGCCAT GCCGGCGATG CCGCCAGACT GCGAGAAGTT CTGCAGCCCG AACGGTCAGT CGTGATTGTC GGTGCCGGAA CTATTGGTCT GGAACTGGCT GCCAGCGCCA CGCAGCGTGG ATGTAAGGTG ACAGTGATTG AACTGGCGGC AACCGTCATG GGCCGTAATG CACCACCGCC CGTGCAACAC TATCTTTTAC AGCGCCATCA GCAGGCTGGT GTGCGCATTC TGCTCAATAA TGCCATTGAA CATGTGGTCG ATGGTGAAAA CGTAGAACTG ACGCTGCAAA GTGGCGAAAC GCTTCGGGCC GATGTGGTGA TTTACGGTAT TGGTATCAGC GCCAACGACC AACTGGCTCG CGAGGCCAAC CTTGATACTG CCAATGGCAT TGTCATTGAT GAGGCTTGCC GCACCTGCGA TCCCGCGATC TTTGCCGGTG GCGATGTGGC AATCACCCGT CTTGATAATG GTGCACTACA CCGCTGCGAA AGCTGGGAAA ACGCCAATAA CCAGGCGCAA ATTGCCGCTT CCGCAATGTT GGGGCTACCG CTTCCGCGAC TGCCGCCGCC GTGGTTCTGG AGCGATCAGT ACAGTGATAA CTTACAGTTT ATTGGCGATA TGCATGGCGA TGACTGGCTT TGTCGTGGCA ACCCGGAAAC TCAGAAGGCG ATTTGGTTTA ATCTGCAAAA CGGCGTGCTT ATCGGTGCAG TCACGCTGAA TCAGGGCCGT GAGATTCGCC CAATCCGCAA ATGGATCCAG AGCGGCAAAA CGTTTGATGC GAAACTGCTG ACAGATGAGG ACATCGCGCT TAAATCACTG TAA
|
Protein sequence | MKEKTIIIVG GGQAAAMAAA SLRQQGFTGE LHLFSDEQHL PYERPPLSKS MLLEDSPQLQ SVLPAHWWQE NNVHLHSGVT IKTLGRDTRE LVLANGESWH WDQLFIATGA AARPLPLLDA LGERCFTLRH AGDAARLREV LQPERSVVIV GAGTIGLELA ASATQRGCKV TVIELAATVM GRNAPPPVQH YLLQRHQQAG VRILLNNAIE HVVDGENVEL TLQSGETLRA DVVIYGIGIS ANDQLAREAN LDTANGIVID EACRTCDPAI FAGGDVAITR LDNGALHRCE SWENANNQAQ IAASAMLGLP LPRLPPPWFW SDQYSDNLQF IGDMHGDDWL CRGNPETQKA IWFNLQNGVL IGAVTLNQGR EIRPIRKWIQ SGKTFDAKLL TDEDIALKSL
|
| |