Gene Dshi_0074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0074 
SymbolthlA 
ID5711696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp73694 
End bp74872 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content70% 
IMG OID641265968 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001531424 
Protein GI159042630 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0231112 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGACTG TCGCGATTTG TGGGGCGGCC CGGACGCCCA TGGGGGGCTT TCAGGGCGTG 
TTTTCCGATG TCAGCGCGGC GCAACTGGGC GGTGCAGCCA TCGCCGGGGC GCTGGCGGAT
GCTGGCGTGG CCCCGGCGCA GGTGAATGAG CTGCTGATGG GCTGCGTGCT GCCGGCGGGA
CAGGGGCAAG CCCCGGCGCG GCAGGCCGGA TATGCGGCGG GACTGGGCGA CGCGGTGCCT
GCCACGACGC TCAACAAGAT GTGCGGCTCT GGCATGAAGG CGGCGATGAT CGCCTGCGAC
CAGATCGCGC TCGGCCAGTC CGACCTGGTG GTCGCCGGCG GCATGGAGAG CATGACCAAC
GCGCCCTACC TGCTGGACAA GATGCGGGGC GGGGCGCGGA TCGGCCATGG GCAGGTGATC
GATCACATGT TTCTCGATGG GCTGGAGGAT GCCTATGACA AGGGCCGCCT GATGGGCACC
TTTGCCGAGG ACTGCGCCGA GGCGTTCCAG TTCACGCGTG CGGCCCAGGA CACCTATGCG
CTGGGCTCGC TGGAAAATGC GCTGGCGGCG GAGGCGTCCG AGGCTTTCGC GATGGAACTG
GTGCCGGTGA CCGTTTCCGG GCGCAAAGGC GAGACCGTGG TGATACGGGA TGAACAACCC
GCCGCGGCCC GGCCCGAGAA GATCCCCCAT CTCAAGCCCG CCTTCCGCAA GGACGGGACC
GTCACGGCGG CGAATTCCTC GTCGATCTCG GACGGGGCGG CGGCGCTGGT TCTGGCCGAC
GCCGGACAGG CCGAGGCCCA TGGCCTGCCG GTGCGGGCCC GGGTGCTCGG GCATGCGAGC
CATGCCCAGA AGCCCGCGCT TTTCCCGACG GCCCCGGTGC CGGCGGCGCG CAAACTGCTC
GACCGGCTCG GCTGGTGCGT GGCGGACGTG GATCTGTGGG AGGTCAACGA GGCCTTCGCG
GTCGTGCCCA TGGCCTTCAT GCACGAGATG GGCGTGCCGC GGGAGAAGAT GAACGTAAAC
GGCGGGGCCT GTGCCTTGGG TCACCCGATC GGGGCCTCCG GCGCGCGGAT CCTGGTGACG
TTGCTCAACG CCATGGAGGC GCGGGACCTG AAACGGGGCG TGGCCGCGAT CTGCATCGGG
GGCGGGGAAG GCACTGCCAT CGCGCTGGAG CGCGACTAA
 
Protein sequence
MRTVAICGAA RTPMGGFQGV FSDVSAAQLG GAAIAGALAD AGVAPAQVNE LLMGCVLPAG 
QGQAPARQAG YAAGLGDAVP ATTLNKMCGS GMKAAMIACD QIALGQSDLV VAGGMESMTN
APYLLDKMRG GARIGHGQVI DHMFLDGLED AYDKGRLMGT FAEDCAEAFQ FTRAAQDTYA
LGSLENALAA EASEAFAMEL VPVTVSGRKG ETVVIRDEQP AAARPEKIPH LKPAFRKDGT
VTAANSSSIS DGAAALVLAD AGQAEAHGLP VRARVLGHAS HAQKPALFPT APVPAARKLL
DRLGWCVADV DLWEVNEAFA VVPMAFMHEM GVPREKMNVN GGACALGHPI GASGARILVT
LLNAMEARDL KRGVAAICIG GGEGTAIALE RD