Gene TM1040_3468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3468 
Symbol 
ID4075102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp492552 
End bp494063 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content63% 
IMG OID638004977 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_611702 
Protein GI99078444 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACC TCGCCGTCAA CCTCGACAAA CTCGCAGGTT TTCTCGAAGA ATACCGAAAA 
TCCGGCATCA CCAACCTCAT TGGCGGCGAG GCCCGCCCCG CCGCCTCCGG CCAGACGTTT
GAAACGACCT CGCCGGTGGA TGAGAGCGTG ATCTGCTCGG TGGCCCGCGG CGGGTCCGAG
GACATAGATG CAGCCGCACA GGCCGCCAAA GCCGCCTTCC CCGCATGGCG CGACCTGCCC
GCGACCGAAC GCAAGGCGAT CCTGCACCGC ATCGCAGATG GGATTGTGGA ACGGGCCGAG
GAAATCGCGC TCTGCGAGTG CTGGGACACC GGTCAGGCGC TGCGCTTCAT GTCCAAAGCG
GCCCTGCGCG GAGCAGAGAA TTTTCGCTTC TTTGCCGACC GTGCGCCTTC GGCCCGCGAT
GGTCAGATGC TGCCCTCGCC CACCTTGATG AATGTCACCA CCCGTGTGCC GATCGGCCCT
GTGGGTGTCA TCACACCCTG GAATACGCCC TTCATGCTCT CCACATGGAA GATCGCGCCT
GCCTTGGCCG CGGGCTGCAC CGTCGTCCAT AAACCGGCAG AGTTCTCGCC CCTGACGGCC
CGTATTCTGG CGGAGATCGC CCATGATGCG GGCCTGCCCC CCGGCGTCTG GAACCTCGTC
AACGGCCTCG GCGAAGAGGC CGGCAAGGCG CTCACCGAAC ATCCCGACAT CAAGGCCATC
GCCTTTGTCG GTGAGAGCAA AACCGGCTCG ATGATCATGG CCCAAGGCGC GCCGACCCTG
AAACGTGTGC ATTTTGAACT CGGCGGCAAG AACCCGGTGG TGGTCTTTGA CGATGCCGAT
CTCGACCGGG CCTTGGACGC GGCGATCTTC ATGATCTACT CGCTGAATGG CGAGCGCTGC
ACCTCTTCCT CGCGCCTGCT CATTCAGGAC AGCATCGCCG AGGCATTCGA GGCCAAACTG
CTCGCGCGTG TAAACAGCAT CAAGGTCGGC CACCCGCTTG ACCCCCAAAC CGAGGTCGGC
CCGCTCATTC ACAAGACCCA CTTCGACAAG GTGACCTCCT ATTTCGACAT CGCGCGGCAC
GATGGCGCCA CCGTCGCCGC AGGTGGCAGC CGCCACGGCG ACACCGGCTG GTTTGTCAAA
CCCACGCTCT TTACCGGGGC CACCAATCAG ATGACCATCG CCCGCGAGGA AATCTTCGGC
CCGGTCCTCA CCGCCATCCG CTTCACCGAC GAAGACGAGG CGCTGAACAT CGCCAATGAC
ACCCAATATG GCCTCACCGC CTATGTCTGG ACCAACGATG TGACCCGCGC CATGCGCTTC
ACCAACCAGC TTGAGGCCGG GATGATCTGG GTGAACTCGG AAAACGTCCG CCACCTGCCT
ACGCCCTTTG GTGGGGTCAA GGCTTCCGGG ATCGGGCGCG ACGGCGGCGA CTGGAGCTTT
GAGTTCTACA TGGAACAGAA ACACATCGGC TTTGCCACCG GCCACCACAA GATCCCCCGC
CTCGGCGCCT GA
 
Protein sequence
MSDLAVNLDK LAGFLEEYRK SGITNLIGGE ARPAASGQTF ETTSPVDESV ICSVARGGSE 
DIDAAAQAAK AAFPAWRDLP ATERKAILHR IADGIVERAE EIALCECWDT GQALRFMSKA
ALRGAENFRF FADRAPSARD GQMLPSPTLM NVTTRVPIGP VGVITPWNTP FMLSTWKIAP
ALAAGCTVVH KPAEFSPLTA RILAEIAHDA GLPPGVWNLV NGLGEEAGKA LTEHPDIKAI
AFVGESKTGS MIMAQGAPTL KRVHFELGGK NPVVVFDDAD LDRALDAAIF MIYSLNGERC
TSSSRLLIQD SIAEAFEAKL LARVNSIKVG HPLDPQTEVG PLIHKTHFDK VTSYFDIARH
DGATVAAGGS RHGDTGWFVK PTLFTGATNQ MTIAREEIFG PVLTAIRFTD EDEALNIAND
TQYGLTAYVW TNDVTRAMRF TNQLEAGMIW VNSENVRHLP TPFGGVKASG IGRDGGDWSF
EFYMEQKHIG FATGHHKIPR LGA