Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3392 |
Symbol | |
ID | 4075566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 412493 |
End bp | 413338 |
Gene Length | 846 bp |
Protein Length | 281 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638004901 |
Product | 5-carboxymethyl-2-hydroxymuconate delta-isomerase |
Protein accession | YP_611626 |
Protein GI | 99078368 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.401918 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTGC TGCGATATGG CCCTTTGGGT CAGGAAAAAC CCGGACTCCT TGATCAGGAC GGCAATATCC GCGACCTCTC CGGGCAGGTT GCAGATATTG GCGCTGCGAC GCTCGGTGAT GCGGGTCTCG ATGCCCTGCG CGCTCTTGAT CCCCAGAGCT TGCCTCTGGT CGAGGGCACG CCTCGGATCG GCGCTTGTGT CGGACAGGTT GGAAAATTCA TCTGCATCGG TTTGAACTAC GCAGACCATG CCGCCGAAAG CGGCATGAGC CTGCCTGAGG AGCCGGTGAT CTTCTTCAAG GCGACCTCTG CTATCTGCGG GCCCAACGAT GCTGTCGAAA TTCCGCGAAC CTCGGTCAAG ACCGACTGGG AAGTGGAACT GGGCGTGGTG ATCGGAAAGA CGGCGAAATA CATTGGCCGA GATGAGGCGC TGGATCACGT TGCGGGCTAC TGCGTGGTGA ATGATCTCTC GGAGCGTGAC TTTCAACTGC ATCGCTCCGG CCAATGGGTA AAGGGCAAAT CCGCTGATAC ATTTGGTCCT ATCGGCCCCT GGTTGGTGAC CCGCGACGAG GTCCCAGATC CGCAGAACCT GGCGATGTGG CTCGAGGTCA ATGGGCATCG CTACCAGGAC GGGTCCACCC GGACGATGCA TTTTGATGTG GCCACGGTGA TCTCGCATCT GTCGCAATTC ATGAGCTTGC AACCGGGTGA CGTAATTTCA ACCGGTACGC CGCCGGGCGT TGGCATGGGG CAAACGCCCG AGACCTACCT GAAACCCGGT GACGTGATGG AGCTGGGGAT CGCGGGGCTC GGTGTGCAGC GACAGGTGAC AGAGGCAGCA GAATGA
|
Protein sequence | MKLLRYGPLG QEKPGLLDQD GNIRDLSGQV ADIGAATLGD AGLDALRALD PQSLPLVEGT PRIGACVGQV GKFICIGLNY ADHAAESGMS LPEEPVIFFK ATSAICGPND AVEIPRTSVK TDWEVELGVV IGKTAKYIGR DEALDHVAGY CVVNDLSERD FQLHRSGQWV KGKSADTFGP IGPWLVTRDE VPDPQNLAMW LEVNGHRYQD GSTRTMHFDV ATVISHLSQF MSLQPGDVIS TGTPPGVGMG QTPETYLKPG DVMELGIAGL GVQRQVTEAA E
|
| |