Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3704 |
Symbol | |
ID | 6065740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4054964 |
End bp | 4055815 |
Gene Length | 852 bp |
Protein Length | 283 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641603122 |
Product | 3,4-dihydroxyphenylacetate 2,3-dioxygenase |
Protein accession | YP_001726642 |
Protein GI | 170021688 |
COG category | [S] Function unknown |
COG ID | [COG3384] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02298] 3,4-dihydroxyphenylacetate 2,3-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.17743 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAAGT TAGCGTTAGC CGCCAAGATC ACGCACGTAC CGTCGATGTA TCTCTCTGAG CTGCCTGGGA AAAACCACGG TTGCCGCCAG GGTGCGATCG ACGGGCATAA AGAGATCAGC AAGCGTTGCC GGGAAATGGG CGTCGATACC ATTATCGTTT TCGATACCCA CTGGCTGGTC AACAGTGCTT ATCACATCAA CTGTGCAGAC CATTTTGAAG GCGTCTACAC CAGTAACGAG CTGCCGCATT TTATTCGTGA CATGACCTAC AACTACGAGG GCAACCCGGA GTTGGGGCAG CTTATTGCCG ATGAAGCCTT AAAGCTCGGC GTGCGGGCAA AAGCGCACAA CATTCCCAGC CTGAAACTGG AATACGGCAC GCTGGTGCCG ATGCGCTACA TGAATGAAGA CAAGCACTTC AAAGTGGTCT CCATTTCGGC TTTCTGCACG GTTCACGATT TTGCCGACAG CCGCAAGCTG GGCGAAGCGA TTCTGAAAGC GATCGAACAG TACGACGGCA CCGTGGCGGT CCTTGCCAGC GGTTCGTTAT CGCACCGCTT TATTGACGAT CAGCGTGCAG AAGAAGGGAT GAACAGCTAC ACCCGCGAGT TCGACCGCCA GATGGACGAG CGCGTGGTGA AGTTGTGGCG CGAAGGCCAG TTCAAAGAGT TCTGCAATAT GCTGCCGGAG TACGCCGACT ACTGCTACGG CGAAGGCAAT ATGCACGACA CGGTGATGCT GCTGGGGATG CTCGGCTGGG ATAAATACGA CGGCAAGGTG GAGTTTATTA CCGAGCTGTT CCCAAGCTCT GGCACCGGTC AGGTTAACGC TGTTTTCCCG CTGCCCGCGT AA
|
Protein sequence | MGKLALAAKI THVPSMYLSE LPGKNHGCRQ GAIDGHKEIS KRCREMGVDT IIVFDTHWLV NSAYHINCAD HFEGVYTSNE LPHFIRDMTY NYEGNPELGQ LIADEALKLG VRAKAHNIPS LKLEYGTLVP MRYMNEDKHF KVVSISAFCT VHDFADSRKL GEAILKAIEQ YDGTVAVLAS GSLSHRFIDD QRAEEGMNSY TREFDRQMDE RVVKLWREGQ FKEFCNMLPE YADYCYGEGN MHDTVMLLGM LGWDKYDGKV EFITELFPSS GTGQVNAVFP LPA
|
| |