Gene Dshi_3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3066 
SymbolatoB 
ID5710918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3234503 
End bp3235678 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content67% 
IMG OID641268993 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001534400 
Protein GI159045606 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAATG TCGTAATCGC ATCCGCCGCG CGTACTGCCG TCGGCAGCTT CGGCGGATCC 
TTTGCCAACA CGCCTGCCCA TGACCTGGGC TCCGCCGTGC TCGAAGCGCT GGTAGCGCGC
GCGGGGATCG AGAAGGGAGA AGTCTCCGAG ACCATCCTCG GCCAGGTGTT GACCGGCGGC
CAGGGCCAGA ACCCGGCGCG CCAGGCGCAT ATCAACGCAG GCCTGCCGCA GGAAAGCGCG
GCCTGGGGTC TCAACCAGGT GTGCGGCTCG GGCCTGCGCG CGGTCGCCCT CGGCGCCCAG
CACATCCAGC TCGGCGATGC GGAGATCGTC TGCGCCGGCG GCCAGGAAAA CATGACGCTC
AGCCCCCATG TGGCCAACCT GCGCGCGGGC CAGAAGATGG GCGACATGAA GTTCATCGAC
TCGATGATCC GCGACGGCCT CTGGGACGCG TTCAACGGCT ACCACATGGG CCAGACCGCC
GAAAACGTCG CCGAGAAGTG GCAGATCAGC CGCGAGATGC AGGACGAGTT CGCCGTCGCC
AGCCAGAACA AGGCCGAGGC CGCCCAGAAG GCGGGCAAGT TCGATGACGA GGTGGTGGCC
TTCACCATCA AGACCCGCAA GGGCGACATC GTCGTGGACA AGGACGAGTA CATCCGCCAC
GGCGCGACCA TGGAGGCCAT GCAGAAACTG CGCCCGGCCT TCACCAAGGA CGGCTCGGTC
ACGGCGGCCA ATGCGTCGGG GCTGAACGAC GGCGCGGCCG GCGTTCTGCT GATGTCGGCG
GAAAATGCCG AGAAGCGCGG GATCACCCCG ATGGCGCGCA TCGCGTCCTA CGCCACCGCC
GGGCTCGACC CGTCGATCAT GGGCGTCGGG CCGATCTATG CCTCGCGCAA GGCGCTGGAG
AAGGCCGGGT GGAAGGTCGA CGACCTGGAC CTGGTGGAAG CCAACGAAGC CTTCGCCGCC
CAGGCCTGTG CCGTGAACAA GGACATGGGC TGGGATCCGG CGATCGTGAA CGTGAACGGC
GGCGCAATCG CCATCGGTCA CCCGATCGGC GCCTCCGGCG CGCGGGTTCT CAACACCCTG
CTGTTCGAAA TGCAGCGGCG GGATGCCAAG AAGGGCCTTG CCACGCTGTG CATCGGCGGC
GGCATGGGCG TGGCGCTCTG CGTCGAGCGC CCCTGA
 
Protein sequence
MTNVVIASAA RTAVGSFGGS FANTPAHDLG SAVLEALVAR AGIEKGEVSE TILGQVLTGG 
QGQNPARQAH INAGLPQESA AWGLNQVCGS GLRAVALGAQ HIQLGDAEIV CAGGQENMTL
SPHVANLRAG QKMGDMKFID SMIRDGLWDA FNGYHMGQTA ENVAEKWQIS REMQDEFAVA
SQNKAEAAQK AGKFDDEVVA FTIKTRKGDI VVDKDEYIRH GATMEAMQKL RPAFTKDGSV
TAANASGLND GAAGVLLMSA ENAEKRGITP MARIASYATA GLDPSIMGVG PIYASRKALE
KAGWKVDDLD LVEANEAFAA QACAVNKDMG WDPAIVNVNG GAIAIGHPIG ASGARVLNTL
LFEMQRRDAK KGLATLCIGG GMGVALCVER P