Gene TM1040_2186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2186 
Symbol 
ID4076785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2295598 
End bp2297073 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content62% 
IMG OID638007508 
Productbifunctional 3-hydroxyacyl-CoA dehydrogenase/thioesterase 
Protein accessionYP_614180 
Protein GI99082026 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG0824] Predicted thioesterase
[COG1250] 3-hydroxyacyl-CoA dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.628959 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAA CAGCAGCAAT CATTGGCGGT GGCGTTATCG GCGGCGGTTG GGCCGCGCGG 
TTCTTGCTCA ATGGCTGGGA TGTGCGGGTC TTTGACCCGG ATCCCGAAGC CGAGCGCAAG
ATTGGCGATG TTCTTGCCAA TGCCCGCCGC AGCCTGCCCG GGCTTGGCAA TGTGGCGCTG
CCACCCGAAG GAAGTCTGAG CTATCATGAG ACCCTCGCGG AAACCGTGCA GGGCGTCGAC
TGGGTACAGG AGAGCGTGCC CGAGCGGCTC GATCTGAAAC AGAAAGTATA CGCAGAGCTC
GAGGCGCATG CGCCCGGGGG CGCGGTGATT GGTTCGTCGA CCTCGGGCTA CAAGCCCTCG
CAATTGCAGG ACGGGTTCAC CAACGCCGCA CAGATCGTTG TGGCGCATCC TTTCAATCCG
GTCTACCTCA TGCCGCTGGT GGAGGTGGTC ACAACTGATG TGAACACGCC CGAGATGATT
GCCAAGGCCA AGGCGATCAT CACCGAAATC GGCATGTATC CCTTGCACTT GAAAAAGGAA
ATCGACGCTC ATGTGGCGGA CCGGTTCCTT GAGGCGGTCT GGCGCGAGGC GCTGTGGCTG
GTAAAAGATG GCATCGCCAC CACCGAAGAG ATCGACAACG CGATCCGCTA TGGCTTTGGC
ATCCGCTGGG CGCAGATGGG GCTGTTTGAA ACCTACCGCG TTGCGGGCGG CGAGGCGGGC
ATGAAGCATT TCATGGCGCA GTTCGGCCCG GCTCTCAGCT GGCCCTGGAC CAAACTGATG
GATGTACCCG AGTATAATGA TGCGTTGGTC GACCTGATCG CAGGCCAGTC GGACGCACAG
TCAGGGCAGT ATTCCATCCG CGAGCTCGAG CGTATCCGCG ATGACAACCT CGTGGGGATG
ATGCGGGCGC TGGGCGCAAA CAACTGGGGT GCCGGGGCGC TTCAGAATGC TCATGACGCC
GCGCGCCTGT CAGAGGCGGG GCTTGCGCGC AGCATGGAGG ACGTTGCCGA TCTGTCGCAG
CCCATCCTCA CCCAGAGCCG CGTCGTGCCG CTCGATTGGA CCGACTACAA CGGCCACATG
ACCGAGAGCC GCTATCTGGA TGCCTTTGCC CAATCCACTG ACCGGCTGAT GGAGATCATC
GGCTGTGACG CCGACTATAT CGCCTCAGGC GGCAGCTACT TCACCGCCGA GACCCATATC
CGCCATATCG ACGAGGTCCA CGCGGGCCAC CCGGTCCAAG TCCGAACGCG GGTCATAATG
GGCGCGGGTA AGAAGATGCA CCTCTGGCAC GAGATGTATG AAGGCGACCG CCTCCTGGCG
ACCGGCGAGC ATATGCTCTT GCATGTGGAT CTGAAAACCC GCCGCTCGGC ACCGCCGGCC
GCGCATATCG AGGCCAATCT TGTGAAGCTC GCCGAGGCAC ATGCGGCGTT GCCCGCACCC
GAAGGGCTGG GTCGCGCCAT CGGTGCGCCT CGCTAA
 
Protein sequence
MTKTAAIIGG GVIGGGWAAR FLLNGWDVRV FDPDPEAERK IGDVLANARR SLPGLGNVAL 
PPEGSLSYHE TLAETVQGVD WVQESVPERL DLKQKVYAEL EAHAPGGAVI GSSTSGYKPS
QLQDGFTNAA QIVVAHPFNP VYLMPLVEVV TTDVNTPEMI AKAKAIITEI GMYPLHLKKE
IDAHVADRFL EAVWREALWL VKDGIATTEE IDNAIRYGFG IRWAQMGLFE TYRVAGGEAG
MKHFMAQFGP ALSWPWTKLM DVPEYNDALV DLIAGQSDAQ SGQYSIRELE RIRDDNLVGM
MRALGANNWG AGALQNAHDA ARLSEAGLAR SMEDVADLSQ PILTQSRVVP LDWTDYNGHM
TESRYLDAFA QSTDRLMEII GCDADYIASG GSYFTAETHI RHIDEVHAGH PVQVRTRVIM
GAGKKMHLWH EMYEGDRLLA TGEHMLLHVD LKTRRSAPPA AHIEANLVKL AEAHAALPAP
EGLGRAIGAP R