Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0074 |
Symbol | thlA |
ID | 5711696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 73694 |
End bp | 74872 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641265968 |
Product | acetyl-CoA acetyltransferase |
Protein accession | YP_001531424 |
Protein GI | 159042630 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | [TIGR01930] acetyl-CoA acetyltransferases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 52 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0231112 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGACTG TCGCGATTTG TGGGGCGGCC CGGACGCCCA TGGGGGGCTT TCAGGGCGTG TTTTCCGATG TCAGCGCGGC GCAACTGGGC GGTGCAGCCA TCGCCGGGGC GCTGGCGGAT GCTGGCGTGG CCCCGGCGCA GGTGAATGAG CTGCTGATGG GCTGCGTGCT GCCGGCGGGA CAGGGGCAAG CCCCGGCGCG GCAGGCCGGA TATGCGGCGG GACTGGGCGA CGCGGTGCCT GCCACGACGC TCAACAAGAT GTGCGGCTCT GGCATGAAGG CGGCGATGAT CGCCTGCGAC CAGATCGCGC TCGGCCAGTC CGACCTGGTG GTCGCCGGCG GCATGGAGAG CATGACCAAC GCGCCCTACC TGCTGGACAA GATGCGGGGC GGGGCGCGGA TCGGCCATGG GCAGGTGATC GATCACATGT TTCTCGATGG GCTGGAGGAT GCCTATGACA AGGGCCGCCT GATGGGCACC TTTGCCGAGG ACTGCGCCGA GGCGTTCCAG TTCACGCGTG CGGCCCAGGA CACCTATGCG CTGGGCTCGC TGGAAAATGC GCTGGCGGCG GAGGCGTCCG AGGCTTTCGC GATGGAACTG GTGCCGGTGA CCGTTTCCGG GCGCAAAGGC GAGACCGTGG TGATACGGGA TGAACAACCC GCCGCGGCCC GGCCCGAGAA GATCCCCCAT CTCAAGCCCG CCTTCCGCAA GGACGGGACC GTCACGGCGG CGAATTCCTC GTCGATCTCG GACGGGGCGG CGGCGCTGGT TCTGGCCGAC GCCGGACAGG CCGAGGCCCA TGGCCTGCCG GTGCGGGCCC GGGTGCTCGG GCATGCGAGC CATGCCCAGA AGCCCGCGCT TTTCCCGACG GCCCCGGTGC CGGCGGCGCG CAAACTGCTC GACCGGCTCG GCTGGTGCGT GGCGGACGTG GATCTGTGGG AGGTCAACGA GGCCTTCGCG GTCGTGCCCA TGGCCTTCAT GCACGAGATG GGCGTGCCGC GGGAGAAGAT GAACGTAAAC GGCGGGGCCT GTGCCTTGGG TCACCCGATC GGGGCCTCCG GCGCGCGGAT CCTGGTGACG TTGCTCAACG CCATGGAGGC GCGGGACCTG AAACGGGGCG TGGCCGCGAT CTGCATCGGG GGCGGGGAAG GCACTGCCAT CGCGCTGGAG CGCGACTAA
|
Protein sequence | MRTVAICGAA RTPMGGFQGV FSDVSAAQLG GAAIAGALAD AGVAPAQVNE LLMGCVLPAG QGQAPARQAG YAAGLGDAVP ATTLNKMCGS GMKAAMIACD QIALGQSDLV VAGGMESMTN APYLLDKMRG GARIGHGQVI DHMFLDGLED AYDKGRLMGT FAEDCAEAFQ FTRAAQDTYA LGSLENALAA EASEAFAMEL VPVTVSGRKG ETVVIRDEQP AAARPEKIPH LKPAFRKDGT VTAANSSSIS DGAAALVLAD AGQAEAHGLP VRARVLGHAS HAQKPALFPT APVPAARKLL DRLGWCVADV DLWEVNEAFA VVPMAFMHEM GVPREKMNVN GGACALGHPI GASGARILVT LLNAMEARDL KRGVAAICIG GGEGTAIALE RD
|
| |