Gene Caul_4515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4515 
Symbol 
ID5901976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4887714 
End bp4888889 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content70% 
IMG OID641565034 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001686133 
Protein GI167648470 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.657914 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA TCGTCATCGT TTCCGCCGCC CGCACGCCGG TCGGATCGTT CCTCGGCGCC 
CTGTCGAGCC TGCCGGCGCA CGAACTGGGC AAGCTGGCCA TCCAGGCCGC GGTCGAACGG
GCCGGCATCA GCGCCGCCGA TGTTGACGAG GTGATCCTGG GCCAGGTGCT GCAGGCCGCC
GCCGGCCAGG GCCCGGCCCG CCAGGCCTCG GTCAACGCCG GCATTCCGGT CGAAAGCCCG
GCCTGGAGCC TCAACCAGCT GTGCGGCTCG GGCCTGCGCG CCGTGGCCCT GGGCGCCCAG
CAGATCGCCG CCGGCGACGC CAACATCGTG GTGGTGGGCG GCCAGGAGAG CATGAGCCAG
GCCCCGCACG CGCAAGGCTT ACGCAACGGC CAGAAGATGG GCGACCTGTC GTTCGTCGAC
ACCATGATCA AGGACGGCCT GTGGGACGCC TTCCACGGCT ACCACATGGG CCAGACCGCC
GAGAACATCG CCAGCCGCTG GCAGATCACC CGCGAGGCCC AGGACCGCTT CGCCGTCGCC
AGCCAGAACA AGGCCGAGGC CGCCCAGAAG GCCGGCAAGT TCGACAGCGA GATCGTCCCG
GTCACCATCA AGGGCCGCAA GGGCGACACG GTGGTCAAGG ACGACGAGTT CATCCGCCAC
GGCGCGACCT ATGAAGGGAT CTCCGGCCTC AAGCCCGCCT TCACCAAGGA CGGCTCGGTC
ACCGCCGCCA ACGCCTCGGG CCTCAATGAC GGGGCCGCCG CCCTGGTGCT GATGAGCGCC
GACGAGGCCA AGAAGCGCGG CCTGACCCCG CTGGCCCGCA TCGCCTCGTG GGCCAATGCC
GGCGTCGAGC CCGAGATCAT GGGCACCGGC CCGATCCCGG CCAGCAAGAA GGCCCTGGAA
AAGGCCGGCT GGACGGTCGC CGACCTCGAC CTGATCGAGA GCAACGAGGC CTTCGCCGCC
CAGTCGCTGT GCGTGGTGCA GGAGCTGGGC CTCGACCCGG CCAAGGTCAA TGTCAACGGC
GGGGCCATCG CCATCGGCCA TCCGATCGGC GCCTCGGGGG CGCGCATCCT GACCACCCTG
GTCCACGAGA TGAAGCGGTC CGGCGCCAAG AAAGGCCTGG CCACGCTGTG CATCGGCGGC
GGCATGGGCG TGGCCATGTG CGTCGAGGCG GTCTGA
 
Protein sequence
MSDIVIVSAA RTPVGSFLGA LSSLPAHELG KLAIQAAVER AGISAADVDE VILGQVLQAA 
AGQGPARQAS VNAGIPVESP AWSLNQLCGS GLRAVALGAQ QIAAGDANIV VVGGQESMSQ
APHAQGLRNG QKMGDLSFVD TMIKDGLWDA FHGYHMGQTA ENIASRWQIT REAQDRFAVA
SQNKAEAAQK AGKFDSEIVP VTIKGRKGDT VVKDDEFIRH GATYEGISGL KPAFTKDGSV
TAANASGLND GAAALVLMSA DEAKKRGLTP LARIASWANA GVEPEIMGTG PIPASKKALE
KAGWTVADLD LIESNEAFAA QSLCVVQELG LDPAKVNVNG GAIAIGHPIG ASGARILTTL
VHEMKRSGAK KGLATLCIGG GMGVAMCVEA V