Gene TM1040_3026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3026 
Symbol 
ID4076599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3194450 
End bp3195625 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content60% 
IMG OID638008355 
Productbeta-ketothiolase 
Protein accessionYP_615020 
Protein GI99082866 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.345239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.537231 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGATA TTGTGATTCT GGATGGCGCC CGCACTGCGA TCGGAACCTT TGGTGGCGCG 
CTGGCCCAGA CGGCGCCGAT AGATCTTGGA GCGACGGTCG CCAAAGCTGC GATGGAGCGG
TCTGGTGTCG ATCCTGCTCA GATCGGCACT GTTGTCTATG GTCATGTGAT CAACACCGAA
CCACGTGATA TGTATCTGTC GCGGGTGGCA GCGATGCAGG CGGGAATTCC AGAAAGCACG
CCTGCGATGA ATGTGAACCG CCTCTGCGGG TCCGGCGCGC AAGCCATCGT ATCAGGAATC
CAAGCCTTGA TGCTGGGAGA TGCTGAATAC GCTCTGACCG GCGGAGCAGA AAGCATGTCG
CGCAGTCCAT TCATCACGCC TTCGACCCGC TGGGGGCAAA AGATGGGTGA CGTGAAATCG
CTCGACATGA TGCTGGGCGC TCTGAATTGC CCGTTTGGGA CTGGCCACAT GGGAGTGACG
GCAGAGAATG TCGCAGATGA GCATGAGATC ACGCGCGCTC AGATGGATGA GTTCGCGCTG
GTTAGCCAGA CCCGGGCCGC TGCCGCCATC GAGGCCGGCT ACTTCCAAAG CCAGATCGTG
CCGGTTGATG TCAAGGTGAA GCGGGACATG GTTCCGTTCG AAGTCGATGA GCATCCAAAG
GGCACATCGA TGGAGGCGCT CTCCGGGCTG CGTCCGGTGT TCAAGAAGGA TGGGCGTGTG
ACGGCTGGCA ATGCGTCGGG AATCAACGAT GGCGCGGCTG CATTGGTGCT CGCCACAGCC
GAGGCGGCTG AGAAATCCGG TCTGAAACCC AAGGCCCGTA TCCTCGGATA TGCCCATGCA
GGCGTTCGTC CGGAGGTCAT GGGCGTAGGT CCGATTCCGG CTGTAGAGCA GCTTCTGAAG
CGGATTGATA TGACTGTTGG TGACTTTGAC CTCATTGAAT CCAACGAGGC CTTTGCGGCG
CAGGCTCTGG CCGTCAACAA GGCACTTGGG TTGGACAGTG CCAAGGTAAA TCCGAATGGC
GGCGCAATTG CCCTGGGCCA TCCGGTCGGC GCAACCGGCG CCATCATCAC TGTCAAAGCG
CTCTATGAGC TGGAGCGCAC TGGCGGGCGC CGTGCGATCA TCACCATGTG TATCGGGGGC
GGGCAGGGGA TCGCCCTCGC GATCGAACGG ATCTGA
 
Protein sequence
MTDIVILDGA RTAIGTFGGA LAQTAPIDLG ATVAKAAMER SGVDPAQIGT VVYGHVINTE 
PRDMYLSRVA AMQAGIPEST PAMNVNRLCG SGAQAIVSGI QALMLGDAEY ALTGGAESMS
RSPFITPSTR WGQKMGDVKS LDMMLGALNC PFGTGHMGVT AENVADEHEI TRAQMDEFAL
VSQTRAAAAI EAGYFQSQIV PVDVKVKRDM VPFEVDEHPK GTSMEALSGL RPVFKKDGRV
TAGNASGIND GAAALVLATA EAAEKSGLKP KARILGYAHA GVRPEVMGVG PIPAVEQLLK
RIDMTVGDFD LIESNEAFAA QALAVNKALG LDSAKVNPNG GAIALGHPVG ATGAIITVKA
LYELERTGGR RAIITMCIGG GQGIALAIER I