Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3468 |
Symbol | |
ID | 4075102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 492552 |
End bp | 494063 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638004977 |
Product | 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
Protein accession | YP_611702 |
Protein GI | 99078444 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACC TCGCCGTCAA CCTCGACAAA CTCGCAGGTT TTCTCGAAGA ATACCGAAAA TCCGGCATCA CCAACCTCAT TGGCGGCGAG GCCCGCCCCG CCGCCTCCGG CCAGACGTTT GAAACGACCT CGCCGGTGGA TGAGAGCGTG ATCTGCTCGG TGGCCCGCGG CGGGTCCGAG GACATAGATG CAGCCGCACA GGCCGCCAAA GCCGCCTTCC CCGCATGGCG CGACCTGCCC GCGACCGAAC GCAAGGCGAT CCTGCACCGC ATCGCAGATG GGATTGTGGA ACGGGCCGAG GAAATCGCGC TCTGCGAGTG CTGGGACACC GGTCAGGCGC TGCGCTTCAT GTCCAAAGCG GCCCTGCGCG GAGCAGAGAA TTTTCGCTTC TTTGCCGACC GTGCGCCTTC GGCCCGCGAT GGTCAGATGC TGCCCTCGCC CACCTTGATG AATGTCACCA CCCGTGTGCC GATCGGCCCT GTGGGTGTCA TCACACCCTG GAATACGCCC TTCATGCTCT CCACATGGAA GATCGCGCCT GCCTTGGCCG CGGGCTGCAC CGTCGTCCAT AAACCGGCAG AGTTCTCGCC CCTGACGGCC CGTATTCTGG CGGAGATCGC CCATGATGCG GGCCTGCCCC CCGGCGTCTG GAACCTCGTC AACGGCCTCG GCGAAGAGGC CGGCAAGGCG CTCACCGAAC ATCCCGACAT CAAGGCCATC GCCTTTGTCG GTGAGAGCAA AACCGGCTCG ATGATCATGG CCCAAGGCGC GCCGACCCTG AAACGTGTGC ATTTTGAACT CGGCGGCAAG AACCCGGTGG TGGTCTTTGA CGATGCCGAT CTCGACCGGG CCTTGGACGC GGCGATCTTC ATGATCTACT CGCTGAATGG CGAGCGCTGC ACCTCTTCCT CGCGCCTGCT CATTCAGGAC AGCATCGCCG AGGCATTCGA GGCCAAACTG CTCGCGCGTG TAAACAGCAT CAAGGTCGGC CACCCGCTTG ACCCCCAAAC CGAGGTCGGC CCGCTCATTC ACAAGACCCA CTTCGACAAG GTGACCTCCT ATTTCGACAT CGCGCGGCAC GATGGCGCCA CCGTCGCCGC AGGTGGCAGC CGCCACGGCG ACACCGGCTG GTTTGTCAAA CCCACGCTCT TTACCGGGGC CACCAATCAG ATGACCATCG CCCGCGAGGA AATCTTCGGC CCGGTCCTCA CCGCCATCCG CTTCACCGAC GAAGACGAGG CGCTGAACAT CGCCAATGAC ACCCAATATG GCCTCACCGC CTATGTCTGG ACCAACGATG TGACCCGCGC CATGCGCTTC ACCAACCAGC TTGAGGCCGG GATGATCTGG GTGAACTCGG AAAACGTCCG CCACCTGCCT ACGCCCTTTG GTGGGGTCAA GGCTTCCGGG ATCGGGCGCG ACGGCGGCGA CTGGAGCTTT GAGTTCTACA TGGAACAGAA ACACATCGGC TTTGCCACCG GCCACCACAA GATCCCCCGC CTCGGCGCCT GA
|
Protein sequence | MSDLAVNLDK LAGFLEEYRK SGITNLIGGE ARPAASGQTF ETTSPVDESV ICSVARGGSE DIDAAAQAAK AAFPAWRDLP ATERKAILHR IADGIVERAE EIALCECWDT GQALRFMSKA ALRGAENFRF FADRAPSARD GQMLPSPTLM NVTTRVPIGP VGVITPWNTP FMLSTWKIAP ALAAGCTVVH KPAEFSPLTA RILAEIAHDA GLPPGVWNLV NGLGEEAGKA LTEHPDIKAI AFVGESKTGS MIMAQGAPTL KRVHFELGGK NPVVVFDDAD LDRALDAAIF MIYSLNGERC TSSSRLLIQD SIAEAFEAKL LARVNSIKVG HPLDPQTEVG PLIHKTHFDK VTSYFDIARH DGATVAAGGS RHGDTGWFVK PTLFTGATNQ MTIAREEIFG PVLTAIRFTD EDEALNIAND TQYGLTAYVW TNDVTRAMRF TNQLEAGMIW VNSENVRHLP TPFGGVKASG IGRDGGDWSF EFYMEQKHIG FATGHHKIPR LGA
|
| |