Gene TM1040_3735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3735 
Symbol 
ID4075442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp793512 
End bp794687 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content63% 
IMG OID638005255 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_611964 
Protein GI99078706 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.902632 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACG TAGTAATTGC ATCCGCAGCC CGCACCGCTG TCGGCAGCTT TGGCGGCGCC 
TTCGCCACCA CACCGGCCCA TGATCTGGGC GCCGCTGTGC TCGAGGCGGT TGTGGCCCGT
GCAGGCATCG ACAAATCCGA GGTCTCGGAA ACCATCCTGG GCCAGGTGCT GACCGCGGCA
CAGGGCCAGA ACCCCGCCCG GCAGGCCCAT ATCAATGCAG GTCTGCCCAA AGAATCCGCC
GCCTGGGGCA TCAACCAGGT CTGTGGCTCC GGTCTGCGCG CTGTTGCGCT TGGCGCGCAG
CACATCCAAC TGGGCGACGC CTCCATCGTT GCCGCAGGTG GTCAGGAAAA CATGACTCTC
AGCCCCCATG CTGCCAACCT GCGTGCCGGT CACAAGATGG GTGACATGAG CTACATCGAC
ACGATGATCC GCGACGGGCT GTGGGACGCG TTCAACGGCT ATCACATGGG TCAGACCGCC
GAGAACGTGG CAGAGCAATG GCAGATCTCG CGCGACATGC AGGACGAGTT TGCGGTTGCC
TCTCAAAACA AAGCCGAAGC CGCCCAGAAA GCGGGCAAGT TTGCCGATGA GATCACGCCC
TTTGTGGTGA AGCATCGCAA GGGCGACATC ACCGTGGACG CGGATGAATA CATCCGTCAC
GGCGCAACCA TTGAGGCGAT GCAGAAACTG CGCCCCGCCT TCACGCGCGA TGGCTCGGTC
ACCGCGGCGA ACGCGTCCGG TCTGAATGAT GGCGCGGCCG CGACCCTCTT GATGAGCGCG
GATGAAGCCG AAAAACGCGG AATCGAGCCG CTTGCCCGCA TCGCGTCCTA CGCAACCGCC
GGCCTTGATC CGTCGATCAT GGGTGTTGGC CCGGTCTTTG CCTCCCGCAA GGCACTCGAC
AAGGCGGGTT GGAGCGTGGA CGATCTGGAT CTGGTCGAAG CCAACGAGGC CTTTGCCGCG
CAGGCCTGTG CAGTGAACAA GGACATGGGC TGGAACCCTG AAATCGTGAA CGTCAACGGC
GGCGCGATTG CAATCGGCCA CCCGATTGGC GCGTCTGGGT GCCGCGTTCT GAACACCCTG
CTGTTTGAAA TGAAACGCCG TGGCGCCAAG AAAGGCCTCG CGACACTCTG CATCGGTGGG
GGCATGGGCG TCGCTATGTG CGTAGAGCGC CCGTAA
 
Protein sequence
MTNVVIASAA RTAVGSFGGA FATTPAHDLG AAVLEAVVAR AGIDKSEVSE TILGQVLTAA 
QGQNPARQAH INAGLPKESA AWGINQVCGS GLRAVALGAQ HIQLGDASIV AAGGQENMTL
SPHAANLRAG HKMGDMSYID TMIRDGLWDA FNGYHMGQTA ENVAEQWQIS RDMQDEFAVA
SQNKAEAAQK AGKFADEITP FVVKHRKGDI TVDADEYIRH GATIEAMQKL RPAFTRDGSV
TAANASGLND GAAATLLMSA DEAEKRGIEP LARIASYATA GLDPSIMGVG PVFASRKALD
KAGWSVDDLD LVEANEAFAA QACAVNKDMG WNPEIVNVNG GAIAIGHPIG ASGCRVLNTL
LFEMKRRGAK KGLATLCIGG GMGVAMCVER P